this post was submitted on 14 Oct 2023
63 points (73.0% liked)

Technology

60133 readers
3014 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 2 years ago
MODERATORS
 

Meta made its Llama 2 AI model open-source because 'Zuck has balls,' a former top Facebook engineer says::Meta CEO Mark Zuckerberg took a big risk by making its powerful AI model Llama 2 mostly open source, according to Replit CEO Amjad Masad.

you are viewing a single comment's thread
view the rest of the comments
[–] just_another_person 39 points 1 year ago (1 children)

The model, weights, and pre-trained data sets are. The training tools are not. You could argue that it's not "truly FOSS" without the tools to create that data, but technically, the article is correct.

[–] BeefPiano 22 points 1 year ago (2 children)

The whole point of open-source is to be able to recreate it yourself so you can make changes. This is freeware. Free-as-in-beer, not free-as-in-speech. Hell, with freeware I can use it for commercial purposes, it’s not even as free as that.

[–] just_another_person 10 points 1 year ago (1 children)

In the AI world it's a bit different. You can do whatever you want with the model and weights data which will net you the functional part of the resulting product. Train, retrain, dissect, segment...etc. They're just not giving out the source for the actual engine. The people working with such things really only care about the data, and in most cases, would probably convert it to a different engine anyway.

[–] BeefPiano 5 points 1 year ago (1 children)

Can I remake the model only including Creative Commons sourced training material?

[–] just_another_person 4 points 1 year ago* (last edited 1 year ago) (1 children)

You can reuse the data however you want, yes. You just can't do it with their proprietary model. So, again, the ENGINE is not open source (the thing that drives their released version), but the model and data as it runs as released you can do whatever you want with.

[–] BeefPiano 2 points 1 year ago (1 children)

I thought I was only licensed for non-commercial use

[–] just_another_person 5 points 1 year ago

Nope. Free for educational, research, or commercial. I'm sure their license has some restrictions on what that actually means once you get to be competitive with the original as a product, but otherwise free unless you start a massive enterprise based on it, at which point you probably wouldn't use it anyway. It's just an LLM, it's not doing anything super special like folding proteins for drug development, or curing cancer.

[–] [email protected] 3 points 1 year ago (1 children)

Calling ML models "Open Source" is already confused. Because they are not programs, but rather formats, they don't come 1:1 with the source.

You can obtain a model and train it futher. Similliar how you can get JPEG file with permissive licence, edit it and share it. Having the GIMP/Photoshop project from the image was created from is helpful but not nessesary.

[–] dym_sh 1 points 1 year ago

here's core difference: the nature of ai-models is generative, but all layers in a .PSD file are inherently static.

better analogy would be rendering of a fractal — a limited subset of infinite possibilities, but to explore the rest of them you need both rules and data