ELANA: A Simple Energy and Latency Analyzer for LLMs
Arxiv
🤗 Support models on Hugging Face, including Transformers, SSMs, and hybrid models
🔧 Support energy, latency, model size, KV cache size profiling
🔍 Support detailed kernel latency profiling
Compare to Zeus profiling framework
Hugging Face interface
Profile model size
Profiling results
Profile models on A6000
Profile models on Jetson series
Profiling kernels
We use torch profile to generate a json trace file and visualize it with Perfetto.
Citation
@article{chiang2025elana,
title = {ELANA: A Simple Energy and Latency Analyzer for LLMs},
author = {Chiang, Hung-Yueh and Wang, Bokun and Marculescu, Diana},
journal = {arXiv preprint arXiv:2512.09946},
year = {2025},
}
Acknowledgements
This work was supported in part by the ONR Minerva program, NSF CCF Grant No. 2107085, iMAGiNE - the Intelligent Machine Engineering Consortium at UT Austin, UT Cockrell School of Engineering Doctoral Fellowships.