1.
Alexander Müller. Optimizing Large Language Model Inference: Strategies for Latency Reduction, Energy Efficiency, and Cybersecurity Applications. ijcsis [Internet]. 2025 Nov. 30 [cited 2026 Mar. 12];10(11):93-7. Available from: http://scientiamreearch.org/index.php/ijcsis/article/view/214