[1]
2025. Reducing Latency and Enhancing Accuracy in LLM Inference through Firmware-Level Optimization. International journal of signal processing, embedded systems and VLSI design. 5, 02 (Jul. 2025), 26–36. DOI:https://doi.org/10.55640/ijvsli-05-02-02.