“Reducing Latency and Enhancing Accuracy in LLM Inference through Firmware-Level Optimization”. International journal of signal processing, embedded systems and VLSI design 5, no. 02 (July 21, 2025): 26–36. Accessed October 5, 2025. https://www.academicpublishers.org/journals/index.php/ijvsli/article/view/5873.