this post was submitted on 20 Aug 2023
3 points (71.4% liked)

Machine Learning - Learning/Language Models

24 readers
1 users here now

Discussion of models, thier use, setup and options.

Please include models used with your outputs, workflows optional.

Model Catalog

We follow Lemmy’s code of conduct.

Communities

Useful links

founded 1 year ago
MODERATORS
3
submitted 1 year ago* (last edited 1 year ago) by [email protected] to c/[email protected]
 

Corresponding arXiv preprint: https://arxiv.org/abs/2308.03762

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 1 points 1 year ago

I like to explain LLMs to people as "glorified autocompletes". They're just stringing words together in "the most rational way possible" based on the training data. They're not "sentient" or "smart", but they can still surprise our meat brains.

In other words, it doesn't "know" anything, but can still output a pattern that makes us go "Ooooooo it KNOWS".

Folks are getting better at training specific goals into their models. So the math that failed yesterday may work tomorrow may fail the day after. These problems will be solved in time and we'll have a broader range of surprising output moments.

I dunno, just feels like a waste of an article for anyone in the know and confusing for those not paying attention. "ChatGPT doesn't have a soul!" Ya, duh . . .