this post was submitted on 27 Jan 2025
121 points (96.9% liked)

Futurology

2028 readers
48 users here now

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] just_another_person 19 points 1 week ago* (last edited 1 week ago)

It cost so little because all previous open source work was already done, and a lot of the research work had already been knocked out. Building models isn't the time consuming process it used to be, it's the training, testing, retraining loop that's expensive.

If you're just building a model that is focused on specific things-like coding, math, and logic-then you don't need large swathes of content from the internet, you can just train it on already solved, freely available information. If you want to piss away money on an LLM that also knows how many celebrities each celebrity has diddled, well that costs a lot more to make.