this post was submitted on 28 Jan 2025
884 points (94.5% liked)

memes

11533 readers
2878 users here now

Community rules

1. Be civilNo trolling, bigotry or other insulting / annoying behaviour

2. No politicsThis is non-politics community. For political memes please go to [email protected]

3. No recent repostsCheck for reposts when posting a meme, you can only repost after 1 month

4. No botsNo bots without the express approval of the mods or the admins

5. No Spam/AdsNo advertisements or spam. This is an instance rule and the only way to live.

A collection of some classic Lemmy memes for your enjoyment

Sister communities

founded 2 years ago
MODERATORS
 

Office space meme:

"If y'all could stop calling an LLM "open source" just because they published the weights... that would be great."

you are viewing a single comment's thread
view the rest of the comments
[–] thespcicifcocean 2 points 1 week ago (2 children)

It's not just the weights though is it? You can download the training data they used, and run your own instance of the model completely separate from their servers.

[–] [email protected] 9 points 1 week ago (1 children)

Did "they" publish the training data? And the hyperparameters?

[–] thespcicifcocean -2 points 1 week ago (1 children)

I mean, I downloaded it from the repo.

[–] [email protected] 10 points 1 week ago (1 children)

You downloaded the weights. That's something different.

[–] thespcicifcocean 1 points 1 week ago (1 children)

I may misunderstand, but are the weights typically several hundred gigabytes large?

[–] [email protected] 8 points 1 week ago* (last edited 1 week ago) (1 children)

Yes. The training data is probably a few hundred petabytes.

[–] thespcicifcocean 2 points 1 week ago (1 children)
[–] BradleyUffner 3 points 1 week ago

Yeah, some models are trained on pretty much the entire content of the publicly accessible Internet.

[–] BradleyUffner 8 points 1 week ago* (last edited 1 week ago)

You don't download the training data when running an LLM locally. You are downloading the already baked model.