Per-query energy consumption of LLMs

16 points by avsm


mitsuhiko

To put this into perspective of LLMs at scale: the numbers for Google Gemini are 0.24Wh/prompt and 0.26ml of Water/prompt. Source