this post was submitted on 15 Feb 2024
86 points (84.7% liked)

Technology

59756 readers
2557 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 2 years ago
MODERATORS
top 25 comments
sorted by: hot top controversial new old
[–] [email protected] 98 points 9 months ago* (last edited 9 months ago) (2 children)

OpenAI wants to eat everyone's lunch. They don't even respect robots.txt.

I'm gonna be real, I don't think an AI company stealing every bit of data I ever put online to ingest into their AI so they can use it to make money really falls under "fair use."

If it were individuals, who were doing it for shits and giggles, and not to make a profit, I'd have a different view. Because I kind of don't care if you're just some random asshat making an art project and aren't using it to try make beaucoup bucks on the back of the whole planets intellectual labor.

Fair Use should be for humans not corporations. Fuck this "corporations are people too" schtick, they aren't and the fact that they're taking this data and using it to profit really should be viewed no differently than the argument that people made about The Pirate Bay, that through piracy, they were making money, which is what made it wrong.

If we keep allowing the rich to just steal from the public commons because of dumb shit like corporations having the same rights as people, we're barrelling headlong into a Cyberpunk Dystopia, if we're not there already.

Humans who are creating art and culture should have rights to Fair Use, not giant fucking corporations using it to profit wildly at the public expense. Copyright is all fucked up and broken, and this is a great example of why, because we're aiming provisions that were meant to protect the public are now aimed at protecting corporations profits.

[–] QuandaleDingle 19 points 9 months ago* (last edited 9 months ago) (1 children)

This is all exactly what I've been thinking, and better parsed too. If "we" don't guide AI with the law now, we're screwed.

[–] [email protected] 10 points 9 months ago (2 children)

good luck trying to convince our capitalist overlords to stop making money.

we need more than laws yo

[–] [email protected] 5 points 9 months ago (1 children)

It's not even that the lawmakers are capitalists, it's that they are ancient, and have no clue about this. There are sitting congresspeople who proudly say they've never sent an email.

[–] [email protected] 1 points 9 months ago (1 children)

I strongly doubt legislators are innocent old men, just caught off guard by tech, even if some might. They know what it is and are trying to figure out how to control it.

And the lawmakers don't even need to be capitalists themselves (even though they mostly are), they just need to be in their pockets. Its not a coincidence their legislation is almost always beneficial to corporations one way or the other.

Hell, in the US lobbying is legal, its not even a secret.

[–] [email protected] 2 points 9 months ago

I never called anyone innocent and never implied it. I was highlighting the true issue is that our whole legislature is completely outclassed by the progress of tech.

[–] QuandaleDingle 2 points 9 months ago
[–] [email protected] 4 points 9 months ago

I don't like corporations using the data I put online to train their AI either. I'd happily give it to people to train a FOSS AI tho. I've also contributed my voice to Mozillas Common Voice project, which is an open dataset that everyone is allowed to use. If it's something that everyone can use and benefit from equally, I'm happy to help. I'm not happy to help some corporation make even more profits tho. At least not without getting payed.

[–] [email protected] 28 points 9 months ago (3 children)

That doesn't sound very "Open".

[–] [email protected] 10 points 9 months ago

Sort of like if there was a guy named Open Kevin, whose whole thing was walking into people's homes and eating all of their food.

[–] [email protected] 9 points 9 months ago

Oh it's open alright. Open to making money.

[–] [email protected] 3 points 9 months ago

No, you're open.

[–] [email protected] 24 points 9 months ago* (last edited 9 months ago) (1 children)

Yeah, they've been wanting that for a while. Microsoft and Google both already have AI search products, and the only thing we've really learned from those are that AI search products don't work.

Also

Altman is Google’s nightmare that it can’t wake up from.

Holy mother of dickriding. This article mainly just seems like a list of statements for AI tech bros to parrot to each other.

[–] [email protected] 6 points 9 months ago

ai search products don’t work

Idk how you got to that conclusion. I’m trying to change my habit of googling coding problems because phind.com will 9/10 times have an answer faster and more accurate than if I googled it.

[–] hperrin 16 points 9 months ago

I can’t imagine that whatever they build will be any worse than Google Search.

[–] nucleative 14 points 9 months ago (2 children)

OpenAI's existential problem is that they'll eat their own lunch and then have nothing left. The reason people make useful content now and give it away for free is because they can get paid for the traffic.

Take that traffic away and all the content goes behind paywalls and login screens where OpenAI can't touch it.

[–] [email protected] 5 points 9 months ago (1 children)

But the content has already been absorbed. I wouldn’t be surprised if they have all of it sucked up (many would argue illegally) and stored as a corpus for them to iterate onto. It’s not like they go out to touch all the web every time they train a new version of their model.

[–] platypus_plumba 3 points 9 months ago (1 children)

Right, they already have scary amounts of data.

[–] QuaternionsRock 1 points 9 months ago

One of the craziest facts about GPT (to me) is that it was trained on 570GB of text data. That’s obviously a lot of text, but it’s bewildering to me that I could theoretically store their entire training dataset on my laptop.

[–] [email protected] 4 points 9 months ago (1 children)

Lol people will literally bring openai with them past paywalls and logins.

[–] [email protected] 1 points 9 months ago

Exactly, not getting money from almost all visits is still better than not getting any visits

[–] GlitzyArmrest 11 points 9 months ago

If they eventually do have a search engine, it definitely can't be trusted. GPT-4 loves to hallucinate.

[–] [email protected] 4 points 9 months ago

awesome. google ruined the internet, would be great for them to face an existential threat

[–] [email protected] 4 points 9 months ago

This is the best summary I could come up with:


The AI search engine could be a new feature for ChatGPT, or a potentially separate app altogether.

Microsoft Bing would allegedly power the service from Sam Altman, which could be the most serious threat Google Search has ever faced.

The report says this new search product could be faster than ChatGPT, without sacrificing its powerful summarizing abilities.

Now, OpenAI is coming directly for the crown, in an attempt to dethrone the most dominant internet service of all time.

Perplexity, a small AI search startup with backing from Jeff Bezos, is also competing for Google’s crown.

If such a small team can steal 10 million people from Google, imagine what OpenAI and Microsoft can do.


The original article contains 405 words, the summary contains 114 words. Saved 72%. I'm a bot and I'm open source!

[–] dunestorm 4 points 9 months ago

This guy looks like AI.