Alexander Müller. “Optimizing Large Language Model Inference: Strategies for Latency Reduction, Energy Efficiency, and Cybersecurity Applications”. International Journal of Computer Science & Information System, vol. 10, no. 11, Nov. 2025, pp. 93-97, https://scientiamreearch.org/index.php/ijcsis/article/view/214.