this post was submitted on 15 Feb 2024
86 points (84.7% liked)
Technology
59769 readers
3782 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
OpenAI's existential problem is that they'll eat their own lunch and then have nothing left. The reason people make useful content now and give it away for free is because they can get paid for the traffic.
Take that traffic away and all the content goes behind paywalls and login screens where OpenAI can't touch it.
But the content has already been absorbed. I wouldn’t be surprised if they have all of it sucked up (many would argue illegally) and stored as a corpus for them to iterate onto. It’s not like they go out to touch all the web every time they train a new version of their model.
Right, they already have scary amounts of data.
One of the craziest facts about GPT (to me) is that it was trained on 570GB of text data. That’s obviously a lot of text, but it’s bewildering to me that I could theoretically store their entire training dataset on my laptop.
Lol people will literally bring openai with them past paywalls and logins.
Exactly, not getting money from almost all visits is still better than not getting any visits