Spectra Net: Hybrid Time and Frequency-Domain Modeling for Sustainable Cloud CPU Prediction

Loading...
Thumbnail Image

Journal Title

Journal ISSN

Volume Title

Publisher

Department of Computer Science and Engineering(CSE), Islamic University of Technology(IUT), Board Bazar, Gazipur-1704, Bangladesh

Abstract

The proliferation of cloud computing has created an urgent need for sustainable resource management strategies to mitigate the growing energy consumption and carbon footprint of data centers. Accurate CPU workload forecasting is a cornerstone of this effort, en abling proactive resource allocation that minimizes energy waste from over-provisioning, and prevents performance degradation from under-provisioning. This study introduces SpectraNet, a lightweight hybrid model that advances sustainable cloud computing by delivering highly accurate CPU usage predictions with minimal computational overhead. By integrating time domain and frequency-domain analysis, SpectraNet effectively cap tures both transient and periodic workload patterns. Experimental results on the Azure VM CPU Readings and Alibaba Cloud Workload datasets demonstrate that this dual-domain approach significantly improves forecasting accuracy. SpectraNet demonstrates an com petitive balance of performance and efficiency, achieving a Mean Absolute Error (MAE) of 0.0549 on long-range forecasts. Critically, the model achieves this accuracy while being up to 19 times smaller (with only 89,805 parameters) and 5 times faster at inference than larger, more complex architectures. The model’s efficiency and small footprint make it a practical and scalable solution for real world, resource-constrained cloud environments. By enabling more precise and energy-efficient resource management, SpectraNet contributes a valuable tool for building more sustainable and cost-effective cloud infrastructures.

Description

Supervised by Dr. Md. Azam Hossain, Associate Professor, Mr. Aashnan Rahman, Junior Lecturer, Department of Computer Science and Engineering (CSE) Islamic University of Technology (IUT) Board Bazar, Gazipur, Bangladesh This thesis is submitted in partial fulfillment of the requirement for the degree of Bachelor of Science in Computer Science and Engineering, 2025

Citation

[1] M. Alhartomi, A. Salh, L. Audah, S. Alzahrani, and A. Alzahmi, “Enhancing sus tainable edge computing offloading via renewable prediction for energy harvesting,” IEEE Access, vol. 12, pp. 74 011–74 023, 2024. [2] Alibaba, Alibaba cluster trace data v2017, https://github.com/alibaba/ clusterdata / tree / master / cluster - trace - v2017, Accessed: 2025-05-28, 2017. [3] Alibaba, Alibaba cluster trace data v2018, https://github.com/alibaba/ clusterdata / tree / master / cluster - trace - v2018, Accessed: 2025-05-28, 2018. [4] M. Alonge and O. Isreal, “Time series forecasting with deep learning: New frontiers in predictive analytics,” [5] M. Amiri and L. Mohammad-Khanli, “Survey on prediction models of applica tions for resources provisioning in cloud,” Journal of Network and Computer Applications, vol. 82, pp. 93–113, 2017. [6] A.-F. Antonescu and T. Braun, “Simulation of sla-based vm-scaling algorithms for cloud-distributed applications,” Future Generation computer systems, vol. 54, pp. 260–273, 2016. [7] A. Anwar et al., “Improving docker registry design based on production workload analysis,” in 16th USENIX Conference on File and Storage Technologies (FAST 18), 2018, pp. 265–278. [8] D. Ardagna, S. Casolari, and B. Panicucci, “Flexible distributed capacity allocation and load redirect algorithms for cloud systems,” in 2011 IEEE 4th International Conference on Cloud Computing, IEEE, 2011, pp. 163–170. [9] S. Bai, Z. Kolter, and V. Koltun, “An empirical evaluation of generic convolutional and recurrent networks for sequence modeling,” arXiv preprint arXiv:1803.01271, 2018. 62 [10] F. J. Baldan, S. Ramirez-Gallego, C. Bergmeir, F. Herrera, and J. M. Benitez, “A forecasting methodology for workload forecasting in cloud systems,” IEEE Transactions on Cloud Computing, vol. 6, no. 4, pp. 929–941, 2016. [11] K. Bandara, C. Bergmeir, and S. Smyl, “Forecasting across time series databases using recurrent neural networks on groups of similar series: A clustering approach,” Expert Systems with Applications, vol. 140, p. 112 896, 2020. [12] M. Barati and S. Sharifian, “A hybrid heuristic-based tuned support vector regres sion model for cloud load prediction,” The Journal of Supercomputing, vol. 71, no. 11, pp. 4235–4259, 2015. [13] K. Benidis et al., “Deep learning for time series forecasting: Tutorial and literature survey,” ACM Computing Surveys, vol. 55, no. 6, pp. 1–36, 2022. [14] S. Bharany et al., “A systematic survey on energy-efficient techniques in sustainable cloud computing,” Sustainability, vol. 14, no. 10, p. 6256, 2022. [15] J. Bi, S. Li, H. Yuan, and M. Zhou, “Integrated deep learning method for workload and resource prediction in cloud systems,” Neurocomputing, vol. 424, pp. 35–48, 2021. [16] J. Bi, H. Ma, H. Yuan, and J. Zhang, “Accurate prediction of workloads and resources with multi-head attention and hybrid lstm for cloud data centers,” IEEE Transactions on Sustainable Computing, vol. 8, no. 3, pp. 375–384, 2023. [17] J. Bi, H. Yuan, S. Li, K. Zhang, J. Zhang, and M. Zhou, “Arima-based and multi application workload prediction with wavelet decomposition and savitzky–golay filter in clouds,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 54, no. 4, pp. 2495–2506, 2024. [18] J. Bi, L. Zhang, H. Yuan, and M. Zhou, “Hybrid task prediction based on wavelet decomposition and arima model in cloud data center,” in 2018 IEEE 15th Inter national Conference on Networking, Sensing and Control (ICNSC), IEEE, 2018, pp. 1–6. [19] T. Bollerslev, “Generalized autoregressive conditional heteroskedasticity,” Journal of Econometrics, vol. 31, no. 3, pp. 307–327, 1986. [20] G. E. Box and D. A. Pierce, “Distribution of residual autocorrelations in autoregressive integrated moving average time series models,” Journal of the American statistical Association, vol. 65, no. 332, pp. 1509–1526, 1970. [21] R. N. Calheiros, E. Masoumi, R. Ranjan, and R. Buyya, “Workload prediction using arima model and its impact on cloud applications’ qos,” IEEE transactions on cloud computing, vol. 3, no. 4, pp. 449–458, 2014. 63 [22] R. N. Calheiros, R. Ranjan, A. Beloglazov, C. A. F. D. Rose, and R. Buyya, “Cloudsim: A toolkit for modeling and simulation of cloud computing environ ments and evaluation of resource provisioning algorithms,” Software: Practice and Experience, vol. 41, no. 1, pp. 23–50, 2011. DOI: 10.1002/spe.995 [23] M. C. Calzarossa, M. L. Della Vedova, L. Massari, D. Petcu, M. I. Tabash, and D. Tessera, “Workloads in the clouds,” in Principles of Performance and Reliability Modeling and Evaluation: Essays in Honor of Kishor Trivedi on his 70th Birthday, Springer, 2016, pp. 525–550. [24] J. Cao, J. Fu, M. Li, and J. Chen, “Cpu load prediction for cloud environment based on a dynamic ensemble model,” Software: Practice and Experience, vol. 44, no. 7, pp. 793–804, 2014. [25] A. Casolaro, V. Capone, G. Iannuzzo, and F. Camastra, “Deep learning for time series forecasting: Advances and open problems,” Information, vol. 14, no. 11, p. 598, 2023. [26] V. Cerqueira, L. Torgo, and C. Soares, “Machine learning vs statistical methods for time series forecasting: Size matters,” arXiv preprint arXiv:1909.13316, 2019. [27] K. Cetinski and M. B. Juric, “Ame-wpc: Advanced model for efficient workload prediction in the cloud,” Journal of Network and Computer Applications, vol. 55, pp. 191–201, 2015. [28] T. Chai and R. R. Draxler, “Root mean square error (rmse) or mean absolute error (mae)? – arguments against avoiding rmse in the literature,” Geoscientific Model Development, vol. 7, no. 3, pp. 1247–1250, 2014. [29] B. Chen, M. Fang, and X. Li, “Denoising matrix factorization for high-dimensional time series forecasting,” Neural Computing and Applications, vol. 36, no. 2, pp. 993–1005, 2024. [30] C. Chen, N. Lu, B. Jiang, and C. Wang, “A risk-averse remaining useful life estimation for predictive maintenance,” IEEE/CAA Journal of Automatica Sinica, vol. 8, no. 2, pp. 412–422, 2021. [31] Z. Chen, J. Hu, G. Min, A. Y. Zomaya, and T. El-Ghazawi, “Towards accurate prediction for high-dimensional and highly-variable cloud workloads with deep learning,” IEEE Transactions on Parallel and Distributed Systems, vol. 31, no. 4, pp. 923–934, 2019. [32] C. Clemm, L. Stobbe, K. Wimalawarne, and J. Druschke, “Towards green ai: Current status and future research,” in 2024 Electronics Goes Green 2024+(EGG), IEEE, 2024, pp. 1–11. 64 [33] E. Cortez, A. Bonde, A. Muzio, M. Russinovich, M. Fontoura, and R. Bianchini, “Resource central: Understanding and predicting workloads for improved resource management in large cloud platforms,” in Proceedings of the 26th Symposium on Operating Systems Principles, 2017, pp. 153–167. [34] V. Cozzolino, L. Tonetto, N. Mohan, A. Y. Ding, and J. Ott, “Nimbus: Towards latency-energy efficient task offloading for ar services,” IEEE Transactions on Cloud Computing, vol. 11, no. 2, pp. 1530–1545, 2022. [35] F. Dama and C. Sinoquet, “Time series analysis and modeling to forecast: A survey,” arXiv preprint arXiv:2104.00164, 2021. [36] T. DeStefano, R. Kneller, and J. Timmis, “Cloud computing and firm growth,” The Review of Economics and Statistics, pp. 1–47, Nov. 2023, ISSN: 0034-6535. DOI: 10.1162/rest_a_01393 eprint: https://direct.mit.edu/rest/ article-pdf/doi/10.1162/rest\_a\_01393/2186107/rest\ _a\_01393.pdf. [Online]. Available: https://doi.org/10.1162/ rest%5C_a%5C_01393 [37] S. Di, D. Kondo, and W. Cirne, “Host load prediction in a google compute cloud with a bayesian model,” in SC’12: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, IEEE, 2012, pp. 1–11. [38] B. Dietrich, S. Nunna, D. Goswami, S. Chakraborty, and M. Gries, “Lms-based low-complexity game workload prediction for dvfs,” in 2010 IEEE International conference on computer design, IEEE, 2010, pp. 417–424. [39] J. Dogani, F. Khunjush, and M. Seydali, “Host load prediction in cloud computing with discrete wavelet transformation (dwt) and bidirectional gated recurrent unit (bigru) network,” Computer Communications, vol. 198, pp. 157–174, 2023. [40] H. Drucker, C. J. Burges, L. Kaufman, A. Smola, and V. Vapnik, “Support vector regression machines,” Advances in neural information processing systems, vol. 9, 1997. [41] J. Duggan, Y. Chi, H. Hacigümü¸s, S. Zhu, and U. Cetintemel, “Packing light: Portable workload performance prediction for the cloud,” in 2013 IEEE 29th International Conference on Data Engineering Workshops (ICDEW), IEEE, 2013, pp. 258–265. [42] P. Duhamel and M. Vetterli, “Fast fourier transforms: A tutorial review and a state of the art,” Signal processing, vol. 19, no. 4, pp. 259–299, 1990. 65 [43] Entony, Cloud computing performance metrics, 2023. DOI: 10.34740/KAGGLE/ DSV/6165137 [Online]. Available: https://www.kaggle.com/dsv/ 6165137 [44] B. Feng and Z. Ding, “Application-oriented cloud workload prediction: A survey and new perspectives,” Tsinghua Science and Technology, vol. 30, no. 1, pp. 34–54, 2024. [45] J. Gama et al., “A survey on concept drift adaptation,” ACM computing surveys (CSUR), vol. 46, no. 4, pp. 1–37, 2014. [46] A. Gandhi, Y. Chen, D. Gmach, M. Arlitt, and M. Marwah, “Minimizing data center sla violations and power consumption via hybrid resource provisioning,” in 2011 International Green Computing Conference and Workshops, IEEE, 2011, pp. 1–8. [47] J. Gao, H. Wang, and H. Shen, “Machine learning based workload prediction in cloud computing,” in 2020 29th international conference on computer communica tions and networks (ICCCN), IEEE, 2020, pp. 1–9. [48] E. S. Gardner Jr, “Exponential smoothing: The state of the art,” Journal of fore casting, vol. 4, no. 1, pp. 1–28, 1985. [49] E. S. Gardner Jr, “Exponential smoothing: The state of the art—part ii,” Interna tional journal of forecasting, vol. 22, no. 4, pp. 637–666, 2006. [50] E. Gomede, The fourier transform and its application in machine learning, Medium, TDS Archive, Accessed: 2025-08-14, Nov. 2023. [Online]. Available: https:// medium.com/the-modern-scientist/the-fourier-transform and-its-application-in-machine-learning-edecfac4133c [51] J. A. González Martínez, M. Bote-Lorenzo, E. Gómez-Sánchez, and R. Cano Parra, “Cloud computing and education: A state-of-the-art survey,” Computers & Education, vol. 80, pp. 132–151, Jan. 2015. DOI: 10.1016/j.compedu. 2014.08.017 [52] P. Goodwin, “Forecasting arma processes with systematic errors when estimating the parameters over a short series,” International Journal of Forecasting, vol. 15, no. 4, pp. 405–414, 1999. [53] K. Greff, R. K. Srivastava, J. Koutník, B. R. Steunebrink, and J. Schmidhuber, “Lstm: A search space odyssey,” IEEE transactions on neural networks and learn ing systems, vol. 28, no. 10, pp. 2222–2232, 2016. 66 [54] J. Guo, J. Wu, J. Na, and B. Zhang, “A type-aware workload prediction strategy for non-stationary cloud service,” in 2017 IEEE 10th Conference on Service-Oriented Computing and Applications (SOCA), IEEE, 2017, pp. 98–103. [55] S. Gupta and D. A. Dinesh, “Resource usage prediction of cloud workloads using deep bidirectional long short term memory networks,” in 2017 IEEE international conference on advanced networks and telecommunications systems (ANTS), IEEE, 2017, pp. 1–6. [56] E. Hai-hong et al., “Host load prediction in cloud based on classification methods,” The Journal of China Universities of Posts and Telecommunications, vol. 21, no. 4, pp. 40–46, 2014. [57] H. Han et al., “Time series forecasting model for non-stationary series pattern extraction using deep learning and garch modeling,” Journal of Cloud Computing, vol. 13, no. 1, p. 2, 2024. [58] A. C. Harvey, “Forecasting, structural time series models and the kalman filter,” 1990. [59] G. E. Hinton, S. Osindero, and Y.-W. Teh, “A fast learning algorithm for deep belief nets,” Neural computation, vol. 18, no. 7, pp. 1527–1554, 2006. [60] S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural computation, vol. 9, no. 8, pp. 1735–1780, 1997. [61] M. Hossin and M. N. Sulaiman, “A review on evaluation metrics for data classifica tion evaluations,” International journal of data mining & knowledge management process, vol. 5, no. 2, p. 1, 2015. [62] C.-J. Hsu, V. Nair, V. W. Freeh, and T. Menzies, “Arrow: Low-level augmented bayesian optimization for finding the best cloud vm,” in 2018 IEEE 38th Interna tional Conference on Distributed Computing Systems (ICDCS), 2018. [63] J. Hu, L. Shen, and G. Sun, “Squeeze-and-excitation networks,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 7132– 7141. [64] R. Hu, J. Jiang, G. Liu, and L. Wang, “Cpu load prediction using support vector regression and kalman smoother for cloud,” in 2013 IEEE 33rd international conference on distributed computing systems workshops, IEEE, 2013, pp. 88–92. [65] R. Hu, J. Jiang, G. Liu, and L. Wang, “Kswsvr: A new load forecasting method for efficient resources provisioning in cloud,” in 2013 IEEE International Conference on Services Computing, IEEE, 2013, pp. 120–127. 67 [66] R. Hu, J. Jiang, G. Liu, and L. Wang, “Efficient resources provisioning based on load forecasting in cloud,” The Scientific World Journal, vol. 2014, no. 1, p. 321 231, 2014. [67] P. J. Huber, “Robust estimation of a location parameter,” The Annals of Mathemati cal Statistics, vol. 35, no. 1, pp. 73–101, 1964. [68] R. J. Hyndman and G. Athanasopoulos, Forecasting: principles and practice. OTexts, 2018. [69] R. J. Hyndman and A. B. Koehler, “Another look at measures of forecast accuracy,” International journal of forecasting, vol. 22, no. 4, pp. 679–688, 2006. [70] R. J. Hyndman and A. B. Koehler, “Another look at measures of forecast accuracy,” International Journal of Forecasting, vol. 22, no. 4, pp. 679–688, 2006. [71] S. Islam, J. Keung, K. Lee, and A. Liu, “Empirical prediction models for adaptive resource provisioning in the cloud,” Future Generation Computer Systems, vol. 28, no. 1, pp. 155–162, 2012. [72] M. A. Jahin, A. Shahriar, and M. A. Amin, “Mcdfn: Supply chain demand fore casting via an explainable multi-channel data fusion network model,” Evolutionary Intelligence, vol. 18, no. 3, p. 66, 2025. [73] D. Janardhanan and E. Barrett, “Cpu workload forecasting of machines in data centers using lstm recurrent neural networks and arima models,” in 2017 12th international conference for internet technology and secured transactions (ICITST), IEEE, 2017, pp. 55–60. [74] A. Javeed, A. Borg, H. Grahn, L. Lundberg, D. Patel, and S. Shirinbab, “Improving cloud efficiency: A machine learning-based stacking model for cpu utilization prediction,” in 2025 8th International Conference on Data Science and Machine Learning Applications (CDMA), IEEE, 2025, pp. 120–125. [75] J.-J. Jheng, F.-H. Tseng, H.-C. Chao, and L.-D. Chou, “A novel vm workload prediction using grey forecasting model in cloud data center,” in The International Conference on Information Networking 2014 (ICOIN2014), IEEE, 2014, pp. 40– 45. [76] J. Jiang, J. Lu, G. Zhang, and G. Long, “Optimal cloud resource auto-scaling for web applications,” in 2013 13th IEEE/ACM international symposium on cluster, Cloud, and Grid Computing, IEEE, 2013, pp. 58–65. [77] M. E. Karim, M. M. S. Maswood, S. Das, and A. G. Alharbi, “Bhyprec: A novel bi-lstm based hybrid recurrent neural network model to predict the cpu workload of cloud virtual machine,” IEEE Access, vol. 9, pp. 131 476–131 495, 2021. 68 [78] A. Khan, X. Yan, S. Tao, and N. Anerousis, “Workload characterization and prediction in the cloud: A multiple time series approach,” in 2012 IEEE Network Operations and Management Symposium, IEEE, 2012, pp. 1287–1294. [79] I. K. Kim, W. Wang, Y. Qi, and M. Humphrey, “Forecasting cloud application work loads with cloudinsight for predictive resource management,” IEEE Transactions on Cloud Computing, vol. 10, no. 3, pp. 1848–1863, 2020. [80] F. Kluge, S. Uhrig, J. Mische, B. Satzger, and T. Ungerer, “Dynamic workload prediction for soft real-time applications,” in 2010 10th IEEE International Con ference on Computer and Information Technology, IEEE, 2010, pp. 1841–1848. [81] A. S. Kumar and S. Mazumdar, “Forecasting hpc workload using arma models and ssa,” in 2016 International conference on information technology (ICIT), IEEE, 2016, pp. 294–297. [82] J. Kumar, D. Saxena, A. K. Singh, and A. V. Vasilakos, “A quantum controlled-not neural network-based load forecast and management model for smart grid,” IEEE Systems Journal, vol. 17, no. 4, pp. 5714–5725, 2023. [83] J. Kumar, R. Goomer, and A. K. Singh, “Long short term memory recurrent neural network (lstm-rnn) based workload forecasting model for cloud datacenters,” Procedia computer science, vol. 125, pp. 676–682, 2018. [84] J. Kumar and A. K. Singh, “Workload prediction in cloud using artificial neural network and adaptive differential evolution,” Future Generation Computer Systems, vol. 81, pp. 41–52, 2018. [85] A. Lackinger, A. Morichetta, and S. Dustdar, “ Time Series Predictions for Cloud Workloads: A Comprehensive Evaluation,” in 2024 IEEE International Conference on Service-Oriented System Engineering (SOSE), Los Alamitos, CA, USA: IEEE Computer Society, Jul. 2024, pp. 36–45. DOI: 10.1109/SOSE62363.2024. 00011 [Online]. Available: https : / / doi . ieeecomputersociety . org/10.1109/SOSE62363.2024.00011 [86] G. Lai, W.-C. Chang, Y. Yang, and H. Liu, “Modeling long- and short-term tem poral patterns with deep neural networks,” The 41st International ACM SIGIR Conference, 2018. [87] C. Lea, M. D. Flynn, R. Vidal, A. Reiter, and G. D. Hager, “Temporal convolutional networks for action segmentation and detection,” in proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 156–165. [88] Y. LeCun, Y. Bengio, and G. Hinton, “Deep learning,” nature, vol. 521, no. 7553, pp. 436–444, 2015. 69 [89] Y. C. Lee and A. Y. Zomaya, “Energy efficient utilization of resources in cloud computing systems,” The Journal of Supercomputing, vol. 60, pp. 268–280, 2012. [90] J. Li, J. Yao, D. Xiao, D. Yang, and W. Wu, “Evogwp: Predicting long-term changes in cloud workloads using deep graph-evolution learning,” IEEE Transactions on Parallel and Distributed Systems, vol. 35, no. 3, pp. 499–516, 2024. [91] S. Li, Y. Wang, X. Qiu, D. Wang, and L. Wang, “A workload prediction-based multi-vm provisioning mechanism in cloud computing,” in 2013 15th Asia-Pacific network operations and management symposium (APNOMS), IEEE, 2013, pp. 1–6. [92] S. Li, J. Bi, H. Yuan, M. Zhou, and J. Zhang, “Improved lstm-based prediction method for highly variable workload and resources in clouds,” in 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC), IEEE, 2020, pp. 1206–1211. [93] X. Li, D. Zhong, B. Ren, G. Fan, and B. Cui, “Prediction of curtain grouting efficiency based on anfis,” Bulletin of Engineering Geology and the Environment, vol. 78, no. 1, pp. 281–309, 2019. [94] Y. Li et al., “Spatial-temporal fusion graph neural networks for traffic flow fore casting,” Proceedings of AAAI, 2021. [95] Y. Li, R. Yu, C. Shahabi, and Y. Liu, “Diffusion convolutional recurrent neural network: Data-driven traffic forecasting,” in International Conference on Learning Representations (ICLR), 2018. [96] B. Lim and S. Zohren, “Time-series forecasting with deep learning: A survey,” Philosophical Transactions of the Royal Society A, vol. 379, no. 2194, p. 20 200 209, 2021. [97] C. Liu, C. Liu, Y. Shang, S. Chen, B. Cheng, and J. Chen, “An adaptive prediction approach based on workload pattern discrimination in the cloud,” Journal of Network and Computer Applications, vol. 80, pp. 35–44, 2017. [98] Y. Liu, B. Gong, C. Xing, and Y. Jian, “A virtual machine migration strategy based on time series workload prediction using cloud model,” Mathematical Problems in Engineering, vol. 2014, no. 1, p. 973 069, 2014. [99] T. Lorido-Botran, J. Miguel-Alonso, and J. A. Lozano, “A review of auto-scaling techniques for elastic applications in cloud environments,” Journal of grid comput ing, vol. 12, no. 4, pp. 559–592, 2014. [100] Y. Lu, J. Panneerselvam, L. Liu, and Y. Wu, “Rvlbpnn: A workload forecasting model for smart cloud computing,” Scientific Programming, vol. 2016, no. 1, p. 5 635 673, 2016. 70 [101] S. Luo et al., “Characterizing microservice dependency and performance: Alibaba trace analysis,” in Proceedings of the ACM Symposium on Cloud Computing, 2021, pp. 412–426. [102] S. Luo et al., “The power of prediction: Microservice auto scaling via workload learning,” in Proceedings of the ACM Symposium on Cloud Computing, 2022. [103] H. Lyu, P. Li, R. Yan, A. Masood, B. Sheng, and Y. Luo, “Load forecast of resource scheduler in cloud architecture,” in 2016 International Conference on Progress in Informatics and Computing (PIC), IEEE, 2016, pp. 508–512. [104] A. I. Maiyza, N. O. Korany, K. Banawan, H. A. Hassan, and W. M. Sheta, “Vtgan: Hybrid generative adversarial networks for cloud workload prediction,” Journal of Cloud Computing, vol. 12, no. 1, p. 97, 2023. [105] S. Makridakis, “Accuracy measures: Theoretical and practical concerns,” Interna tional Journal of Forecasting, vol. 9, no. 4, pp. 527–529, 1993. [106] S. S. Manvi and G. K. Shyam, “Resource management for infrastructure as a service (iaas) in cloud computing: A survey,” Journal of network and computer applications, vol. 41, pp. 424–440, 2014. [107] M. Masdari and A. Khoshnevis, “A survey and classification of the workload forecasting methods in cloud computing,” Cluster Computing, vol. 23, no. 4, pp. 2399–2424, 2020. [108] M. Masdari, S. Nabavi, and V. Ahmadi, “An overview of virtual machine placement schemes in cloud computing,” Journal of Network and Computer Applications, vol. 66, Jan. 2016. DOI: 10.1016/j.jnca.2016.01.011 [109] M. Masdari, S. ValiKardan, Z. Shahi, and S. Azar, “Towards workflow scheduling in cloud computing: A comprehensive analysis,” Journal of Network and Computer Applications, vol. 66, Feb. 2016. DOI: 10.1016/j.jnca.2016.01.018 [110] R. P. Masini, M. C. Medeiros, and E. F. Mendes, “Machine learning advances for time series forecasting,” Journal of economic surveys, vol. 37, no. 1, pp. 76–111, 2023. [111] K. Mason, “Advances in evolutionary neural networks with applications in energy systems and the environment,” Ph.D. dissertation, PhD thesis, NUI Galway, 2018. [112] S. Mazumdar and A. S. Kumar, “Forecasting data center resource usage: An experimental comparison with time-series methods,” in International Conference on Soft Computing and Pattern Recognition, Springer, 2016, pp. 151–165. 71 [113] B. K. Nelson, “Time series analysis using autoregressive integrated moving average (arima) models,” Academic emergency medicine, vol. 5, no. 7, pp. 739–744, 1998. [114] H. M. Nguyen, S. Woo, J. Im, T. Jun, and D. Kim, “A workload prediction approach using models stacking based on recurrent neural network and autoencoder,” in 2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS), IEEE, 2016, pp. 929–936. [115] T. Nguyen, T. Nguyen, B. M. Nguyen, and G. Nguyen, “Efficient time-series fore casting using neural network and opposition-based coral reefs optimization,” Inter national Journal of Computational Intelligence Systems, vol. 12, no. 2, pp. 1144– 1161, 2019. [116] A. Y. Nikravesh, S. A. Ajila, and C.-H. Lung, “Towards an autonomic auto-scaling prediction system for cloud resource provisioning,” in 2015 IEEE/ACM 10th International Symposium on Software Engineering for Adaptive and Self-Managing Systems, IEEE, 2015, pp. 35–45. [117] B. N. Oreshkin, D. Carpov, N. Chapados, and Y. Bengio, “N-beats: Neural ba sis expansion analysis for interpretable time series forecasting,” arXiv preprint arXiv:1905.10437, 2019. [118] A. R. S. Parmezan, V. M. Souza, and G. E. Batista, “Evaluation of statistical and machine learning models for time series prediction: Identifying the state-of-the-art and the best conditions for the use of each model,” Information sciences, vol. 484, pp. 302–337, 2019. [119] J. Pomeroy, “Transforming business intelligence: Leveraging generative ai and predictive analytics in cloud environments,” 2024. [120] O. Project, Opencloud hadoop workload dataset, https://opencloudproject. example.org/hadoop-workloads, Accessed: 2025-07-20, 2010. [121] K. Qazi, Y. Li, and A. Sohn, “Power: Prediction of workload for energy efficient relocation of virtual machines,” in Proceedings of the 4th annual Symposium on Cloud Computing, 2013, pp. 1–2. [122] K. Qazi, Y. Li, and A. Sohn, “Workload prediction of virtual machines for harness ing data center resources,” in 2014 IEEE 7th International Conference on Cloud Computing, IEEE, 2014, pp. 522–529. [123] F. Qiu, B. Zhang, and J. Guo, “A deep learning approach for vm workload predic tion in the cloud,” in 2016 17th IEEE/ACIS International Conference on Software 72 Engineering, Artificial Intelligence, Networking and Parallel/Distributed Comput ing (SNPD), IEEE, 2016, pp. 319–324. [124] S. Qiu, “Global weighted average pooling bridges pixel-level localization and image-level classification,” arXiv preprint arXiv:1809.08264, 2018. [125] L. R. Rabiner, C.-H. Lee, B.-H. Juang, and J. G. Wilpon, “Hmm clustering for connected word recognition,” in International Conference on Acoustics, Speech, and Signal Processing,, IEEE, 1989, pp. 405–408. [126] F. Ramezani and M. Naderpour, “A fuzzy virtual machine workload prediction method for cloud environments,” in 2017 IEEE International conference on fuzzy systems (FUZZ-IEEE), IEEE, 2017, pp. 1–6. [127] R. Reddy, “Sustainable computing: A comprehensive review of energy-efficient algorithms and systems,” Authorea Preprints, 2024. [128] C. Reiss, J. Wilkes, and J. L. Hellerstein, “Google cluster-usage traces: Format+ schema,” Google Inc., White Paper, vol. 1, pp. 1–14, 2011. [129] A. Rossi, A. Visentin, S. Prestwich, and K. N. Brown, “Bayesian uncertainty mod elling for cloud workload prediction,” in 2022 IEEE 15th International Conference on Cloud Computing (CLOUD), IEEE, 2022, pp. 19–29. [130] M. S. Sahay, S. Ganta, B. N. V. Vardhan, and K. Srilakshmi, “A comprehensive study on improving time series forecasting precision,” in Seminars in Medical Writing and Education, AG Editor (Argentina), vol. 3, 2024, p. 7. [131] J. Sahni and D. P. Vidyarthi, “Heterogeneity-aware adaptive auto-scaling heuristic for improved qos and resource usage in cloud environments,” Computing, vol. 99, no. 4, pp. 351–381, 2017. [132] D. Salinas, V. Flunkert, J. Gasthaus, and T. Januschowski, “Deepar: Probabilis tic forecasting with autoregressive recurrent networks,” International journal of forecasting, vol. 36, no. 3, pp. 1181–1191, 2020. [133] S. Sarikaa, S. Niranjana, and K. Sri Vishnu Deepika, “Time series forecasting of cloud resource usage,” in 2021 IEEE 6th International Conference on Computing, Communication and Automation (ICCCA), IEEE, 2021, pp. 372–382. [134] D. Saxena and A. K. Singh, “A comprehensive survey on sustainable resource management in cloud computing environments,” Authorea Preprints, 2024. [135] R. Sen, H.-F. Yu, and I. S. Dhillon, “Think globally, act locally: A deep neural network approach to high-dimensional time series forecasting,” Advances in neural information processing systems, vol. 32, 2019. 73 [136] A. Setayesh, H. Hadian, and R. Prodan, “An efficient online prediction of host workloads using pruned gru neural nets,” arXiv preprint arXiv:2303.16601, 2023. [137] M. Shahrad et al., “Serverless in the wild: Characterizing and optimizing the serverless workload at a large cloud provider,” in 2020 USENIX Annual Technical Conference (USENIX ATC 20), 2020, pp. 205–218. [138] R. Shariffdeen, D. Munasinghe, H. Bhathiya, U. Bandara, and H. D. Bandara, “Adaptive workload prediction for proactive auto scaling in paas systems,” in 2016 2nd International Conference on Cloud Computing Technologies and Applications (CloudTech), IEEE, 2016, pp. 22–29. [139] Z. Shen, S. Subbiah, X. Gu, and J. Wilkes, “Cloudscale: Elastic resource scaling for multi-tenant cloud systems,” in Proceedings of the 2nd ACM Symposium on Cloud Computing, 2011, pp. 1–14. [140] Q. Shi et al., “Block hankel tensor arima for multiple short time series forecasting,” in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, 2020, pp. 5758–5766. [141] N. Singh and S. Rao, “Ensemble learning for large-scale workload prediction,” IEEE Transactions on Emerging Topics in Computing, vol. 2, no. 2, pp. 149–165, 2014. [142] P. Singh, P. Gupta, and K. Jyoti, “Tasm: Technocrat arima and svr model for workload prediction of web applications in cloud,” Cluster Computing, vol. 22, no. 2, pp. 619–633, 2019. [143] M. Smendowski and P. Nawrocki, “Optimizing multi-time series forecasting for enhanced cloud resource utilization based on machine learning,” Knowledge-Based Systems, vol. 304, p. 112 489, 2024. [144] B. Song, Y. Yu, Y. Zhou, Z. Wang, and S. Du, “Host load prediction with long short-term memory in cloud computing,” The Journal of Supercomputing, vol. 74, no. 12, pp. 6554–6568, 2018. [145] W. Sun et al., “Lstm based link quality confidence interval boundary prediction for wireless communication in smart grid,” Computing, vol. 103, no. 2, pp. 251–269, 2021. [146] X. Tang, X. Liao, J. Zheng, and X. Yang, “Energy efficient job scheduling with workload prediction on cloud data center,” Cluster Computing, vol. 21, no. 3, pp. 1581–1593, 2018. 74 [147] S. J. Tarsa, A. P. Kumar, and H. Kung, “Workload prediction for adaptive power scaling using deep learning,” in 2014 IEEE International Conference on IC Design & Technology, IEEE, 2014, pp. 1–5. [148] S. J. Taylor and B. Letham, “Forecasting at scale,” The American Statistician, vol. 72, no. 1, pp. 37–45, 2018. [149] C. Tian et al., “Minimizing content reorganization and tolerating imperfect work load prediction for cloud-based video-on-demand services,” IEEE Transactions on Services Computing, vol. 9, no. 6, pp. 926–939, 2015. [150] M. Tirmazi et al., “Borg: The next generation,” in Proceedings of the fifteenth European conference on computer systems, 2020, pp. 1–14. [151] M. P. Toopchinezhad and M. Ahmadi, “Machine learning approaches for active queue management: A survey, taxonomy, and future directions,” Computer Net works, p. 111 174, 2025. [152] J. F. Torres, D. Hadjout, A. Sebaa, F. Martínez-Álvarez, and A. Troncoso, “Deep learning for time series forecasting: A survey,” Big data, vol. 9, no. 1, pp. 3–21, 2021. [153] K. Valarmathi and S. Kanaga Suba Raja, “Resource utilization prediction tech nique in cloud using knowledge based ensemble random forest with lstm model,” Concurrent Engineering, vol. 29, no. 4, pp. 396–404, 2021. [154] N. Vasiloglou, A. Chandrayan, V. Kuznetsov, et al., “Meta-learning for time-series forecasting,” arXiv preprint arXiv:2004.11702, 2021. [155] P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P.-A. Manzagol, “Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion,” Journal of machine learning research, vol. 11, pp. 3371– 3408, 2010. [156] G. M. Wamba, Y. Li, A.-C. Orgerie, N. Beldiceanu, and J.-M. Menaud, “Cloud workload prediction and generation models,” in 2017 29Th international sympo sium on computer architecture and high performance computing (SBAC-PAD), IEEE, 2017, pp. 89–96. [157] H. Wang, K. J. Mathews, M. Golec, S. S. Gill, and S. Uhlig, “Amazonaicloud: Proactive resource allocation using amazon chronos based time series model for sustainable cloud computing,” Computing, vol. 107, no. 3, p. 77, 2025. [158] W. Wei et al., “Efficient model selection for time series forecasting via llms,” arXiv preprint arXiv:2504.02119, 2025. 75 [159] J. Wilkes, “Google cluster-usage traces v3,” Google Inc., Mountain View, CA, USA, Technical Report, Apr. 2020, Posted at https://github.com/google/ cluster-data/blob/master/ClusterData2019.md. [160] C. J. Willmott and K. Matsuura, “Advantages of the mean absolute error (mae) over the root mean square error (rmse) in assessing average model performance,” Climate research, vol. 30, no. 1, pp. 79–82, 2005. [161] C. J. Willmott and K. Matsuura, “Advantages of the mean absolute error (mae) over the root mean square error (rmse) in assessing average model performance,” Climate Research, vol. 30, no. 1, pp. 79–82, 2005. [162] H. Wu, J. Xu, J. Wang, and M. Long, “Autoformer: Decomposition transform ers with auto-correlation for long-term series forecasting,” Advances in neural information processing systems, vol. 34, pp. 22 419–22 430, 2021. [163] M. Xu and R. Buyya, “Brownoutcon: A software system based on brownout and containers for energy-efficient cloud computing,” Journal of Systems and Software, vol. 155, pp. 91–103, 2019. [164] J. Yang et al., “A cost-aware auto-scaling approach using the workload prediction in service clouds,” Information Systems Frontiers, vol. 16, no. 1, pp. 7–18, 2014. [165] Q. Yang, Y. Zhou, Y. Yu, J. Yuan, X. Xing, and S. Du, “Multi-step-ahead host load prediction using autoencoder and echo state networks in cloud computing,” The Journal of Supercomputing, vol. 71, no. 8, pp. 3037–3053, 2015. [166] Y. Yang, C. Fan, and H. Xiong, “A novel general-purpose hybrid model for time series forecasting,” Applied Intelligence, vol. 52, no. 2, pp. 2212–2223, 2022. [167] Z. Yang et al., “Fftnet: Fusing frequency and temporal awareness in long-term time series forecasting,” Electronics, vol. 14, no. 7, p. 1303, 2025. [168] P. Yazdanian and S. Sharifian, “E2lg: A multiscale ensemble of lstm/gan deep learning architecture for multistep-ahead cloud workload prediction,” The Journal of Supercomputing, vol. 77, no. 10, pp. 11 052–11 082, 2021. [169] J. Ye et al., “A survey of time series foundation models: Generalizing time series representation with large language model,” arXiv preprint arXiv:2405.02358, 2024. [170] K. Yemets, I. Izonin, and I. Dronyuk, “Enhancing the fft-lstm time-series forecast ing model via a novel fft-based feature extraction–extension scheme,” Big Data and Cognitive Computing, vol. 9, no. 2, p. 35, 2025. 76 [171] K. Yi et al., “Frequency-domain mlps are more effective learners in time series fore casting,” Advances in Neural Information Processing Systems, vol. 36, pp. 76 656– 76 679, 2023. [172] H. Yu, Z. Peng, X. Ma, C. Hu, and K. Li, “Temporal clustering for predicting workload intensity of virtual machines in cloud computing environments,” in IEEE Trustcom/BigDataSE/ISPA, 2016, pp. 1243–1248. [173] L. Zhang, Y. Zhang, P. Jamshidi, L. Xu, and C. Pahl, “Workload patterns for quality-driven dynamic cloud service configuration and auto-scaling,” in 2014 IEEE/ACM 7th International Conference on Utility and Cloud Computing, IEEE, 2014, pp. 156–165. [174] Q. Zhang, L. T. Yang, Z. Yan, Z. Chen, and P. Li, “An efficient deep learning model to predict cloud workload for industry informatics,” IEEE transactions on industrial informatics, vol. 14, no. 7, pp. 3170–3178, 2018. [175] X. Zhang, R. R. Chowdhury, R. K. Gupta, and J. Shang, “Large language models for time series: A survey,” arXiv preprint arXiv:2402.01801, 2024. [176] W. Zhong, Y. Zhuang, J. Sun, and J. Gu, “A load prediction model for cloud computing using pso-based weighted wavelet support vector machine,” Applied Intelligence, vol. 48, no. 11, pp. 4072–4083, 2018. [177] W. Zhong, Y. Zhuang, J. Sun, and J. Gu, “Load forecasting for cloud comput ing based on wavelet support vector machine,” International Journal of High Performance Computing and Networking, vol. 14, no. 3, pp. 315–324, 2019. [178] H. Zhou, L. Cheng, S. Yu, and C. Chen, “Tasktransfer: A meta-learning framework for few-shot cloud workload forecasting,” IEEE Transactions on Cloud Computing, 2022. DOI: 10.1109/TCC.2022.3186429 [179] H. Zhou et al., “Informer: Beyond efficient transformer for long sequence time series forecasting,” in Proceedings of AAAI, 2021. [180] X. Zhou et al., “Load balancing prediction method of cloud storage based on analytic hierarchy process and hybrid hierarchical genetic algorithm,” SpringerPlus, vol. 5, no. 1, p. 1989, 2016. [181] Y. Zhu, W. Zhang, Y. Chen, and H. Gao, “A novel approach to workload predic tion using attention-based lstm encoder-decoder network in cloud environment,” EURASIP Journal on Wireless Communications and Networking, vol. 2019, no. 1, p. 274, 2019.

Collections

Endorsement

Review

Supplemented By

Referenced By