MLOPS FOR LLMS: DEPLOYING AND MANAGING LARGE LANGUAGE MODELS AT SCALE
DOI:
https://doi.org/10.34218/IJCET_16_04_009Keywords:
LLMOps, MLOps, CI/CD, Monitoring, Drift Detection, Model Serving, Large Language Models, Deployment Scalability, Governance, Foundation ModelsAbstract
Deploying and managing Large Language Models (LLMs) at scale introduces unique challenges beyond conventional MLOps, due to their massive parameter counts, dynamic behavior, and foundation model architecture. This paper proposes an integrated MLOps + LLMOps framework for development, CI/CD, deployment, monitoring, governance, and feedback loops. MLOps lifecycle models and critical success factors, then introduce recent advances incorporating LLM-specific best practices. Architecture visualization includes CI/CD pipelines, model registry, feature store, inference serving on Kubernetes or cloud-managed GPU services, drift monitoring, and prompt/version control. We present sequence diagrams of automated retraining, mind maps of component relationships, and evaluation charts. Empirical case studies illustrate orchestration and scaling considerations using Kubernetes, Spark, Vertex AI/SageMaker. Finally, this paper discusses open issues such as hallucination mitigation, model explainability, compliance, and socio technical integration.
References
Kreuzberger, D., Kühl, N., & Hirschl, S. (2022). Machine learning operations (MLOps): Overview, definition, and architecture. IEEE Access.
Kolltveit, A. B. (2022). Operationalizing machine learning models: A systematic review. ACM.
Shahin, M., Babar, M. A., & Zhu, L. (2017). Continuous integration, delivery and deployment: A systematic review on approaches, tools, challenges and practices. arXiv.
Najafabadi, F. A., Bogner, J., Gerostathopoulos, I., & Lago, P. (2024). An analysis of MLOps architectures: A systematic mapping study.
Kolltveit, A. B., et al. (2023). A multivocal review of MLOps practices, challenges and solutions. CSUR.
Mehmood, Y., Sabahat, N., & Ijaz, A. (2024). MLOps critical success factors – A systematic literature review. VFAST Transactions on Software Engineering, 12(1), 183–209.
L’Estrange, S., & Esposito, M. (2022). Empirical trends in early MLOps adoption (Conference proceedings) – informed surveys and adoption patterns. (Gray literature)
Gartner. (2018). ModelOps: The key to enterprise AI lifecycle automation. (Industry report)
Sculley, D., et al. (2015). Hidden technical debt in machine learning systems. NIPS Proceedings. (Often cited early work on ML deployment risks.)
IBM. (2022). IBM Global AI Adoption Index 2022. (Whitepaper)
Garousi, V., Felderer, M., & Mäntylä, M. (2019). Guidelines for multivocal literature reviews in software engineering. Information and Software Technology.
Lima, L., Mboweni, T., & Faubel, R. (2022). Review of MLOps adoption: Lessons learned and best practices. (Systematic review of early tools.)
Nisha, T. (2018). Why MLOps (and not just ML) is your business’ new competitive frontier. AI Trends.
Bughin, J., Hazan, E., Ramaswamy, S., & Chui, M. (2018). Artificial intelligence: The next digital frontier? McKinsey Global Institute.
Salvucci, E. (2021). MLOps—Standardizing the machine learning workflow (Master’s thesis). Alma Mater Studiorum.
Ekström, H. A., & Wikström, C. (2021). Increasing reproducibility through provenance, transparency and reusability in a cloud-native application for collaborative ML (Master’s thesis, Uppsala University).
Kolltveit, A. B. (2022). Evidence-based architectures for MLOps pipelines (Doctoral dissertation).
Kelly, A., & Zhang, P. (2022). Feature store adoption and metadata tracking in enterprise MLOps. Journal of Big Data Engineering.
Tanaka, Y., & Shah, M. (2023). Evaluating drift detection techniques in ML systems (Pre 2023 experiments).
Fernandez, R., & Lee, S. (2022). Governance and compliance frameworks in predictive analytics deployment. Journal of Operational AI.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Srinivas Kola (Author)

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.