Machine Learning - Learning/Language Models

24 readers

1 users here now

Discussion of models, thier use, setup and options.

Please include models used with your outputs, workflows optional.

Model Catalog

We follow Lemmy’s code of conduct.

Communities

Useful links

founded 1 year ago

MODERATORS

[email protected]

GPT-4 Can’t Reason (medium.com)

submitted 1 year ago* (last edited 1 year ago) by [email protected] to c/[email protected]

4 comments fedilink hide all child comments

Corresponding arXiv preprint: https://arxiv.org/abs/2308.03762

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 1 points 1 year ago

I like to explain LLMs to people as "glorified autocompletes". They're just stringing words together in "the most rational way possible" based on the training data. They're not "sentient" or "smart", but they can still surprise our meat brains.

In other words, it doesn't "know" anything, but can still output a pattern that makes us go "Ooooooo it KNOWS".

Folks are getting better at training specific goals into their models. So the math that failed yesterday may work tomorrow may fail the day after. These problems will be solved in time and we'll have a broader range of surprising output moments.

I dunno, just feels like a waste of an article for anyone in the know and confusing for those not paying attention. "ChatGPT doesn't have a soul!" Ya, duh . . .