this post was submitted on 29 Apr 2024

195 points (94.9% liked)

Technology

59675 readers

5154 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 1 year ago

MODERATORS

195

ChatGPT provides false information about people, and OpenAI can’t correct it (noyb.eu)

submitted 7 months ago by [email protected] to c/technology

61 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 28 points 7 months ago (3 children)

Uh, I understand the sentiment, but the model doesn't know anything. And it's legit really hard to differentiate between factual things and random bullshit it made up.

[–] [email protected] 18 points 7 months ago (1 children)

Was gonna say, the AI doesn't make up or admit bullshit, its just a very advanced a prediction algorithm. It responds with what the combination of words that is most likely the expected answer.

Wether that is accurate or not is part of training it but you'll never get 100% accuracy to any query

[–] [email protected] 1 points 7 months ago (3 children)

If it can name what the most likely combination is, couldn't it also know how likely that combination of words is?

[–] [email protected] 7 points 7 months ago* (last edited 7 months ago)

It's not actually deciding anything, the AI thinking is marketing fluff really. But yes that's called confidence rating and it does. But at the scale of something like chatgpt that uses a snapshot of the entire internet and is non mutable there's no way to train it for every possible question. If you ask about a topic 99% of the internet gets wrong it'll respond the wrong thing with 99% confidence

[–] [email protected] 3 points 7 months ago

If it has been trained using questionable sources, or if it's training data includes sarcastic responses (without understanding that context), it isn't hard to imagine how confidently wrong some of the responses could be.

[–] [email protected] 3 points 7 months ago

No, because that requires it to understand the words. It doesn't.

[–] [email protected] 8 points 7 months ago (1 children)

Yeah, no one can make it say "I don't know" because it is not really AI. Business bros decided to call it that and everyone smiled and nodded. LLMs are 1 small component (maybe) of AI. Maybe 1/80th of a true AI or AGI.

Honestly the most impressive part of LLMs is the tokenizer that breaks down the request, not the predictive text button masher that comes up with the response.

[–] [email protected] 10 points 7 months ago

Honestly the most impressive part of LLMs is the tokenizer that breaks down the request, not the predictive text button masher that comes up with the response.

Yes, exactly! It's ability to parse the input is incredible. It's the thing that has that "wow" factor, and it feels downright magical.

Unfortunately, that also makes people intuitively trust its output.

[+] givesomefucks -14 points 7 months ago (2 children)

It "knows" as in it has access to the information and the ability to provide the right info for the right context.

Any part of that process the AI can just "bullshit" and fills in the gaps with random stuff.

Which is what you want when it's "learning". You want it to try so it's attempt can be rated, and the relevant info added to its "knowledge".

But when consumers are using it, you want it to say "I can't answer that". But consumers are usually stupid and will buy/use the one that says "I can't answer that" the least.

And it’s legit really hard to differentiate between factual things and random bullshit it made up.

Which is why AI should tell end users "I don't know" more often.

[–] NounsAndWords 10 points 7 months ago

Which is why AI should tell end users “I don’t know” more often.

If you feel this is a simple solution, I strongly suggest you write up exactly how you do this and make yourself a billion dollars.

[–] [email protected] 8 points 7 months ago (1 children)

It “knows” as in it has access to the information and the ability to provide the right info for the right context.

It doesn't, though, any more than you have access to the information in a pile of 10 million shredded documents.

[+] givesomefucks -15 points 7 months ago (3 children)

Right, in this case that we're talking about...

Do you not understand how "answer unavailable" is a better answer than taking a small percent of strips of paper at random and filling in the rest with words that sound relevant?

It's like a mad libs

[–] [email protected] 7 points 7 months ago

taking a small percent of strips of paper at random and filling in the rest with words that sound relevant?

It's like a mad libs

Right. They're text generators. That's the technology. It can't do what you're demanding because that's not how it works. LLMs aren't magic answer machines. They don't know when to say "answer not available". They don't know what they're being asked. They don't know anything.

[–] [email protected] 2 points 7 months ago

That is what LLMs do in EVERY conversation. Most of the time you don't notice it, because it fits your expectations.

[–] then_three_more 1 points 7 months ago

You know that answer unavailable is better because you have real intelligence, an LLM is just some mathematical functions so it can't do that. If it could it would be getting much closer to actually being AI.