“Reducing Latency and Enhancing Accuracy in LLM Inference through Firmware-Level Optimization” (2025) International journal of signal processing, embedded systems and VLSI design, 5(02), pp. 26–36. doi:10.55640/ijvsli-05-02-02.