ChatGPT

9093 readers

1 users here now

Unofficial ChatGPT community to discuss anything ChatGPT

founded 2 years ago

MODERATORS

marcar

133

ChatGPT would have been so much useful and trustworthy if it is able to accept that it doesn't know an answer. (programming.dev)

submitted 7 months ago* (last edited 7 months ago) by [email protected] to c/chatgpt

46 comments fedilink hide all child comments

Small rant : Basically, the title. Instead of answering every question, if it instead said it doesn't know the answer, it would have been trustworthy.

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 47 points 7 months ago (2 children)

I'd love to agree with you - but when people say that LLMs are stochastic parrots, this is what they mean...

LLMs don't actually know what the words they're saying mean, they just know what words are most likely to be next to each other based on training data.

Because they don't know the meaning of what they're saying, they also don't know the factuality of what they're saying - as such they simply can't self-fact check.

[–] Malfeasant 1 points 7 months ago

Is that so different from most people?

[+] kromem -11 points 7 months ago* (last edited 7 months ago) (5 children)

This is so goddamn incorrect at this point it's just exhausting.

Take 20 minutes and look into Anthropic's recent sparse autoencoder interpretability research where they showed their medium size model had dedicated features lighting up for concepts like "sexual harassment in the workplace" or having the most active feature for referring to itself as "smiling when you don't really mean it."

We've known since the Othello-GPT research over a year ago that even toy models are developing abstracted world modeling.

And at this point Anthropic's largest model Opus is breaking from stochastic outputs even on a temperature of 1.0 for zero shot questions 100% of the time around certain topics of preference based on grounding around sensory modeling. We are already at the point the most advanced model has crossed a threshold of literal internal sentience modeling that it is consistently self-determining answers instead of randomly selecting from the training distribution, and yet people are still parroting the "stochastic parrot" line ignorantly.

The gap between where the research and cutting edge is and where the average person commenting on it online thinks it is has probably never been wider for any topic I've seen before, and it's getting disappointingly excruciating.

[–] feedum_sneedson 8 points 7 months ago (1 children)

I don't understand anything you just said.

[–] [email protected] 6 points 7 months ago

This is how AI gains hype

[–] Cosmicomical 2 points 7 months ago (1 children)

Do you have a source for the "smiling when you don't really mean it" thing? I've been digging around but couldn't find that anywhere.

[–] kromem 1 points 7 months ago

It's right in the research I was mentioning:

https://transformer-circuits.pub/2024/scaling-monosemanticity/index.html

Find the section on the model's representation of self and then the ranked feature activations.

I misremembered the top feature slightly, which was: responding "I'm fine" or gives a positive but insincere response when asked how they are doing.

[–] [email protected] 2 points 7 months ago

And once again the problem is that there's not much ensuring those models are correct, there's not enough capacity available to finetune even a significant fraction of it.

[–] [email protected] 2 points 7 months ago

I did Google that fwiw and the answer I got was that sparse autoencoders work so that it checks the output aligns with the input

If it's unknowable if the input is correct, won't it still be subject to outputting confidently incorrect information

[–] kometes 1 points 7 months ago

Nice gallop, Mr Gish.