“Reducing Latency and Enhancing Accuracy in LLM Inference through Firmware-Level Optimization”. International journal of signal processing, embedded systems and VLSI design 5, no. 02 (July 21, 2025): 26–36. Accessed February 11, 2026. https://www.academicpublishers.org/journals/index.php/ijvsli/article/view/5873.