this post was submitted on 01 Dec 2024
46 points (88.3% liked)
Futurology
1823 readers
163 users here now
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
A lot of the smaller LLMs don’t require GPU at all - they run just fine on a normal consumer CPU.
yeah but 10x slower, at speeds that just don't work for many use cases. When you compare energy consumption per token, there isn't much difference.
Wouldn't running on a CPU (while possible) make it less energy efficient, though?
It depends. A lot of LLMs are memory-constrained. If you’re constantly thrashing the GPU memory it can be both slower and less efficient.