Articles | Open Access |

Autonomous Recovery Frameworks for Distributed Cross-Platform Business Systems Leveraging Generative Language Models and Container Orchestration

Dr. Sione V. Tupou , Faculty of Software Engineering and Cyber Systems Pacific Technology and Innovation University, Nukuʻalofa, Tonga

Abstract

Modern enterprise computing ecosystems have shifted toward distributed, cross-platform business systems operating across multi-cloud infrastructures, microservices-based architectures, and containerized deployment environments. While this transition enhances scalability, interoperability, and operational agility, it also introduces significant challenges in system reliability, fault tolerance, and automated recovery. Traditional IT operations rely heavily on rule-based monitoring and human-in-the-loop incident response mechanisms, which are insufficient for handling complex, cascading failures in highly dynamic distributed environments.
This research proposes an autonomous recovery framework that integrates Generative Language Models (GLMs) with container orchestration platforms such as Kubernetes to enable intelligent, self-healing enterprise systems. The framework is designed to interpret system telemetry, logs, and event streams using semantic reasoning capabilities of GLMs, enabling automated root cause analysis and recovery orchestration without manual intervention.
A key conceptual foundation of this study is derived from self-healing multi-cloud systems and post-incident intelligence mechanisms that leverage artificial intelligence to continuously learn from failure patterns and optimize system resilience. In particular, the integration of LLM-driven reasoning with Kubernetes-based orchestration forms the backbone of automated remediation workflows, enabling dynamic scaling, rollback strategies, and service reconfiguration in real time.
The proposed architecture is further strengthened by insights from digital twin systems, AI-driven industrial transformation, and semantic knowledge fusion models. These complementary paradigms enhance system observability, contextual awareness, and adaptive decision-making in distributed computing environments.
Experimental and conceptual analysis indicates that such integrated frameworks significantly reduce mean time to recovery (MTTR), improve anomaly classification accuracy, and enhance system stability under variable workload conditions. However, challenges persist in ensuring explainability, controlling hallucinated outputs from generative models, and maintaining secure execution boundaries in autonomous systems.
Overall, this study contributes a structured approach toward building next-generation autonomous enterprise infrastructures capable of self-diagnosis, self-repair, and continuous optimization in cross-platform distributed environments.



Keywords

Autonomous recovery, , Generative Language Models, Kubernetes, distributed systems,

References

Dong Yanzhe, Chen Dongshen. The Possibility of achieving digital immortality in the Metaverse from the Perspective of Systems Thinking [J/OL]. Journal of Systems Science, 2025,( 04 ): 76–80.

Li Li. Construction of Immersive Experience Mode for Archive Users in Metaverse Scenarios [J]. Shanxi Archives, 2025, ( 02 ): 146–149.

Liu Changyun, Ren Wendong. The technical path and industrial transformation of Personalized Clothing customization driven by Artificial Intelligence [J]. Western Leather, 2025, 47 ( 05 ): 35–37.

Meng Yuanyuan, Zhao Yujia. Research on the Integration Design of Traditional Clothing in the Static Styling of Models [J]. Footwear Technology & Design, 2025, 5 ( 07 ): 3–5.

Papamichael I, Economou F, Voukkali I, et al. A metaverse framework for sustainable waste management considering circular economy[J]. Chemical Engineering Journal, 2025512622, 83–162283.

Post-Mortem Intelligence for Self-Healing Multi-Cloud Enterprise Applications Using LLMs and Kubernetes. (2026). International Journal of Research and Applied Innovations, 9(1), 13641-13649. https://doi.org/10.15662/IJRAI.2026.0901017

Wang Le, Zhang Xiaolei, Zhang Lei. For digital twin rendering of cloud edges synergy technology [J/OL]. Computer technology and development, and 1–6 [ 2025 - 05 - 18 ].

Wang Yongqi. Characteristics, Risks and Governance of AI Digital Human Applications in Metaverse Space [J/OL]. Journal of Beijing police academy, 1–16 [ 2025 - 05 - 18 ].

Yao Yi, Mao Yue, Meng Ranju, et al. Research and Application of Automatic Extraction of three-dimensional Human Body Feature Points [J]. Silk, 2025, 62 ( 02 ): 75–81.

Zhang Xingwang, Duan Xuechun, Xin Jie. Research on the Construction and Application of Metaverse Knowledge Service System Oriented to Ternary Space Fusion [J] Books intelligence work, 2024 of ( 15 ): 72–80.

Zhao Zhiqiang. Application Design of Intelligent Manufacturing of Clothing Based on Artificial Intelligence Technology [J]. Footwear Technology and Design, 2025, 5 ( 08 ): 195–197.

Article Statistics

Downloads

Download data is not yet available.

Copyright License

Download Citations

How to Cite

Dr. Sione V. Tupou. (2026). Autonomous Recovery Frameworks for Distributed Cross-Platform Business Systems Leveraging Generative Language Models and Container Orchestration. International Journal of Computer Science & Information System, 11(02), 153–164. Retrieved from https://scientiamreearch.org/index.php/ijcsis/article/view/453