AI-ASSISTED MEMORY AND CACHE OPTIMIZATION IN MODERN OPERATING SYSTEMS AND MULTICORE PROCESSORS

Authors

  • Vasuki Shankar Graduate Student, The University of Texas - Dallas, USA Author
  • Nanditha Muralidhar Graduate Student, Vellore Institute of Technology, TN, India Author
  • Vinay Krishnamurthy Member of Technical Staff, AMD, USA Author

DOI:

https://doi.org/10.34218/IJCET_16_02_018

Keywords:

Artificial Intelligence (AI), Memory Management, Cache Optimization, Machine Learning, Deep Learning, Reinforcement Learning, Multicore Processors, Operating Systems, Cache Hit Rate, Memory Latency, Power Efficiency

Abstract

Memory and cache optimization are crucial in various areas of computing, including multicore processors and operating systems handling diverse workloads. Traditional memory management techniques such as Least Recently Used (LRU) and First-In First-Out (FIFO) are still widely used today. However, these methods do not adapt to changing workload patterns, resulting in inefficiencies such as increased cache misses and memory latency. Artificial Intelligence (AI) offers a promising solution to memory and cache management by dynamically adapting to workload variations, thereby enabling maximum resource utilization. Studies have shown that AI-based optimization can lead to a 40% reduction in memory latency, a 15–20% increase in cache hit rates, and lower energy consumption. These improvements make AI particularly well-suited for modern high-performance computing platforms. AI can enhance computational efficiency, eliminate bottlenecks, and improve system responsiveness. Future research should focus on real-time implementation of AI models, hybrid techniques combining heuristic and AI-driven approaches, and ensuring the security of AI-based memory management systems. As AI continues to evolve, its role in optimizing computing architectures will expand significantly, paving the way for smarter and more efficient memory and cache management solutions.

References

Zhang, Y., Zhao, X., Yin, J., Zhang, L. and Chen, Z., 2024. Operating system and artificial intelligence: A systematic review. arXiv preprint arXiv:2407.14567.

Wang, J., Cao, Y., Zhai, Y., Gao, J., Wu, L., Tang, J., Wang, H., Cai, X., Shan, H., Yu, Y. and Wang, S., Machine Learning System Architecture and Training Method Optimization for Financial Data Center Based on Heterogeneous Ai Chips.

Reza, M.F., 2023. Machine learning enabled solutions for design and optimization challenges in networks-on-chip based multi/many-core architectures. ACM Journal on Emerging Technologies in Computing Systems, 19(3), pp.1-26.

Tadi, S.R.C.C.T., AI-Based Automated Memory Profiling And Optimized Backend Data Synchronizations.

V. Shankar, "Advancements in AI-Based Compiler Optimization Techniques for Machine Learning Workloads," International Journal of Computer Sciences and Engineering, vol. 13, no. 3, pp. 70–77, 2025.

Geng, T., Amaris, M., Zuckerman, S., Goldman, A., Gao, G.R. and Gaudiot, J.L., 2022. A profile-based AI-assisted dynamic scheduling approach for heterogeneous architectures. International Journal of Parallel Programming, 50(1), pp.115-151.

Li, S., Lin, M.S., Chen, W.C. and Tsai, C.C., 2024. High-Bandwidth Chiplet Interconnects for Advanced Packaging Technologies in AI/ML Applications: Challenges and Solutions. IEEE Open Journal of the Solid-State Circuits Society.

Brewer, W., Gainaru, A., Suter, F., Wang, F., Emani, M. and Jha, S., 2024. AI-coupled HPC workflow applications, middleware and performance. arXiv preprint arXiv:2406.14315.

Kwon, J., 2022. Machine Learning for AI-Augmented Design Space Exploration of Computer Systems. Columbia University.

Wang, X. and Jia, W., 2025. Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies. arXiv preprint arXiv:2501.03265.

Tol, M.C., 2024. AI-Assisted Detection and Mitigation of Microarchitectural Vulnerabilities (Doctoral dissertation, Worcester Polytechnic Institute).

V. Shankar, M. M. Deshpande, N. Chaitra, and S. Aditi, "Automatic Detection of Acute Lymphoblastic Leukemia Using Image Processing," in Proceedings of the 2016 IEEE International Conference on Advances in Computer Applications (ICACA), Coimbatore, India, 2016, pp. 186–189, doi: 10.1109/ICACA.2016.7887948.

Lnu, D.K., 2025. AI-Driven Verification for Compute Express Link (CXL): Challenges, Innovations, and Future.

Souza, M.A. and Freitas, H.C., 2024. Reinforcement Learning-Based Cache Replacement Policies for Multicore Processors. IEEE Access.

Li, L., Gou, F., Long, H., He, K. and Wu, J., 2022. Effective data optimization and evaluation based on social communication with AI‐assisted in opportunistic social networks. Wireless Communications and Mobile Computing, 2022(1), p.4879557.

V. Shankar, "Machine Learning for Linux Kernel Optimization: Current Trends and Future Directions," International Journal of Computer Sciences and Engineering, vol. 13, no. 3, pp. 56–64, 2025.

Bhaskaran, S.V., 2023. Resilient real-time data delivery for ai summarization in conversational platforms: Ensuring low latency, high availability, and disaster recovery. Journal of Intelligent Connectivity and Emerging Technologies, 8(3), pp.113-130.

Wu, N. and Xie, Y., 2022. A survey of machine learning for computer architecture and systems. ACM Computing Surveys (CSUR), 55(3), pp.1-39.

Abdellatif, A.A., Abo-Eleneen, A., Mohamed, A., Erbad, A., Navkar, N.V. and Guizani, M., 2023. Intelligent-slicing: An AI-assisted network slicing framework for 5G-and-beyond networks. IEEE Transactions on Network and Service Management, 20(2), pp.1024-1039.

V. Shankar, "Edge AI: A Comprehensive Survey of Technologies, Applications, and Challenges," in 2024 1st International Conference on Advanced Computing and Emerging Technologies (ACET), Ghaziabad, India, 2024, pp. 1–6, doi: 10.1109/ACET61898.2024.10730112.

Li, H., Sun, J. and Xiong, K., 2024. AI-Driven Optimization System for Large-Scale Kubernetes Clusters: Enhancing Cloud Infrastructure Availability, Security, and Disaster Recovery. Journal of Artificial Intelligence General science (JAIGS) ISSN: 3006-4023, 2(1), pp.281-306.

Wilkins, G., 2024. Online Workload Allocation and Energy Optimization in Large Language Model Inference Systems.

Downloads

Published

2025-04-16

How to Cite

Vasuki Shankar, Nanditha Muralidhar, & Vinay Krishnamurthy. (2025). AI-ASSISTED MEMORY AND CACHE OPTIMIZATION IN MODERN OPERATING SYSTEMS AND MULTICORE PROCESSORS. INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING AND TECHNOLOGY, 16(2), 250-264. https://doi.org/10.34218/IJCET_16_02_018