this post was submitted on 12 Jul 2023

278 points (97.6% liked)

Technology

63178 readers

3826 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

278

The world's most-powerful AI model suddenly got 'lazier' and 'dumber.' A radical redesign of OpenAI's GPT-4 could be behind the decline in performance. (www.businessinsider.com)

submitted 2 years ago by L4s to c/technology

77 comments fedilink hide all child comments

Users of OpenAI's GPT-4 are complaining that the AI model is performing worse lately. Industry insiders say a redesign of GPT-4 could be to blame.

top 50 comments

sorted by: hot top controversial new old

[+] parpol@programming.dev 97 points 2 years ago (8 children)

[deleted]

[–] conditional_soup@lemm.ee 27 points 2 years ago (1 children)

This isn't sustainable. They're banking that nobody else is going to be able to achieve GPT-4-like quality, and what with us basically being at near the bottom of the vertical bit of the growth curve, I'd say that's a little like betting that nobody's going to be able to build a car that beats the Model T's performance. Meta is trying to tackle very large language models in the same way that they got React to be so good and widely supported: by taking it open source. Google, on the other hand, is currently working on having LLMs running natively on phones and tablets. That's not to speak of the fully open source models. Yeah, running a 1.6 trillion parameter GPT-based LLM is fucking expensive and difficult to replicate, but there are newer, more efficient techniques popping up around LLMs at a dizzying pace. It's only a matter of time before someone comes up with something that's at least as good as GPT 4.

[–] jrs100000 14 points 2 years ago (1 children)

A popular venture capital backed tech project with an unsustainable business model? Now Ive heard everything. ~/s~

[–] conditional_soup@lemm.ee 5 points 2 years ago

Yeah, that's just crazy talk. Next you're going to tell me that they're going to start hand crafting bills and spending millions in advertising to get them passed.

[–] Marsupial@quokk.au 18 points 2 years ago (2 children)

Good, they should be seperate.

You don’t want a medical llm trained on Internet memes or a coding llm trained to write poetry. Specialisation exists for a reason.

[–] brsrklf@compuverse.uk 5 points 2 years ago (3 children)

Honest question, why would you want a medical LLM anyway? Other kinds of AI, sure, like diagnosis help through pattern learning on medical imaging, etc, that I can understand.

How is a language based approach that completely abstracts away actual knowledge, and just tries to sound "good enough" any kind of useful in a medical workflow?

load more comments (3 replies)

load more comments (1 replies)

[–] btaf45 11 points 2 years ago (1 children)

A lot of people in the media are routinely confused about the different between AI and ordinary software. They are started to call all software "AI" now.

[–] beanz 6 points 2 years ago* (last edited 2 years ago)

Can you quantify the difference? Far as I can tell, there's just an imaginary line where software becomes AI just because the logic filtering it depends on to operate is sufficiently complex. The term doesn't really seem to be a useful categorization either, e.g. the fundamentally different approaches of diffusion models and transformer models.

[–] RIotingPacifist 9 points 2 years ago (3 children)

But the only thing it's actually good at is generating languages, if they try and pretend to know stuff in fields, they're quickly exposed as frauds.

[–] HelloHotel 5 points 2 years ago* (last edited 2 years ago) (1 children)

I cant express my diappointment with chatgpt, they let loose a bot that makes content farms shreek in joy but messes up basic things if their is no well treaded answer, wont give you non mainstream answers (you likely already know and watched what it tells you is "really obscure anime") And jenuinely has no tolerance for error, from you or itself

[–] ChaoticEntropy@feddit.uk 7 points 2 years ago* (last edited 2 years ago)

I think the fact that they are sitting on that sweet, sweet first-to-market money consoles them somewhat.

[–] vacuumflower@lemmy.sdf.org 5 points 2 years ago

Ah, yes, when I was a kid, I would try to read big texts I understood nothing of and imitate something similar. I thought it made me smarter.

In some sense it did - probabilities of certain words being connected in a certain way, if you make some connection between them and real entities, are useful.

I mean, it did work at school, just say some water without turning on your brain. I sometimes start talking like this when I panic after a question.

[–] MercuryUprising 4 points 2 years ago (1 children)

It doesn't even "know" language. Every time I see it write a poem it reads like something a 3rd grader would come up with. At the end of the day, language is way to explain your experience. An LLM doesn't have experiences.

load more comments (1 replies)

[–] Lifesupport@lemmy.fmhy.ml 6 points 2 years ago

After the leak of how their system is configured I think this makes the most sense.

load more comments (3 replies)

[–] nbailey@lemmy.ca 87 points 2 years ago* (last edited 2 years ago) (6 children)

The model has become inbred because it’s now impossible to scrape the web without AI content getting ingested, which is full of “hallucinations” and other weird artifacts. The last opportunity to get “uncontaminated” training data was sometime in mid 2022.

Not to say that it’s causing this particular problem, but this issue will emerge eventually. Garbage in = garbage out. Eventually GPT-19 will grow a mighty Habsburg chin.

[–] jantin 28 points 2 years ago* (last edited 2 years ago) (3 children)

Maybe not yet, but...

Spez will turn Reddit into a bot farm and sell this as training data
Musk turns Twitter into a bigoted cesspool and will sell this as training data, which will subsequently be flagged for low quality (also: a botfarm)
Threads is a corporate ad dashboard (and we already know how easy it is to GPT copy) and Zuck will sell this as training data
Facebook is either dead or only good for boomers and Poles
blogs are dead
Fediverse is out there waiting to be scraped but possibly too small to sustain a big model

We'te getting there, hopefully.

[–] cyberpunk007 7 points 2 years ago (1 children)

Scrapped?... Or scraped?

[–] jantin 8 points 2 years ago (1 children)

absolutely scraped, fixed

[–] damnYouSun@sh.itjust.works 10 points 2 years ago

Also We'te, which I believe is a Klingon name.

load more comments (2 replies)

[–] minorninth 4 points 2 years ago

That hasn't happened yet. Most likely they quantized GPT-4 more. It's still based on the same training data.

load more comments (2 replies)

[–] monerobull@monero.town 33 points 2 years ago* (last edited 2 years ago) (1 children)

The lobotomies will continue. Free models will keep getting better.

[–] HelloHotel 11 points 2 years ago* (last edited 2 years ago)

The chatgpt people are really paranoid. Gpt-3 is so good at not halucinating that it often cant, even if it needs to do so to accomplish a task. Fearing the ai will confidently give the wrong answer.

[–] randon31415 29 points 2 years ago (1 children)

Not the first time OpenAI has done this. DALLE2 used to be the best AI art program in the world. Then OpenAI decided that they didn't want to get sued by celebrities, so they made it so that if a face came out that resembled a celebrity, it would be distorted. But every face kind of looks like someone famous. Ta da! Now DALLE2 can't do faces.

Want a crane shot areal image of a teen couple in a corvette driving off into the sunset? Well, you are now banned for life from the DALLE2 service, because DALLE2 produced an image of a 'shot teen' and that violates it's terms of service.

[–] Slacking@sh.itjust.works 4 points 2 years ago (1 children)

Dalle2 was always kind of shit tbh.

[–] randon31415 6 points 2 years ago

Dalle2 was great when it was free and stable diffusion didn't exist. I don't see the logic of: "Someone made a free version. Lets make the program worse and charge money for it!"

[–] hoshikarakitaridia@sh.itjust.works 26 points 2 years ago (1 children)

The only way in mind this dumbing down happens is by fumbling with the model. So that's the one thing we can be sure: the AI is most definitely changed while publicly staying "ChatGPT 4". I assume they are either using clipping or token limitations to split the server load but fucking up the result, or they are purposely dumbing it down to capitalise on it later by introducing other pay models like ppl already mentioned.

Either way they are shooting themselves in the foot because a bunch of ppl will unsubscribe either out of spite for the change or because it's just not worth it anymore for them.

[+] Something_Complex 8 points 2 years ago (1 children)

[removed by mod]

[–] IDatedSuccubi 15 points 2 years ago (2 children)

I remember one time a guy was trying to add SDL (a programming library) to Visual Studio (code editor and IDE), and said that it wouldn't link to a project no matter what he's done. You can google how to do this in five minutes, with video tutorials and everything, it's like a basic thing every programmer does in that IDE. Like 5 question threads later, turns out he was "following all ChatGPT steps" and they were all complete nonsense, just random functions of Visual Studio done with the filenames of SDL.

[–] btaf45 11 points 2 years ago* (last edited 2 years ago) (2 children)

lmfao. Why haven't most people figured out yet that chatgpt lies more often than not?

[–] MercuryUprising 5 points 2 years ago

No, because their sales team and ceo are spending millions so that people don't find that out.

[–] HelloHotel 4 points 2 years ago (1 children)

Its not even suppost to tell the truith, we warped it into doing so. Why AI lies

load more comments (1 replies)

[–] Jerkface@lemmy.ml 6 points 2 years ago

Might be right but in my experience a lack of skill in conversing with AI is a much greater factor in determining it's usefulness. It's almost always going to defer to the user. It's like when someone is dealing with tech support and they tell them to try turning it off and on again. If that really is the solution, and the user insists that it is not, CGPT is going to make something up just to appease the user's request.

Users have to know that CGPT isn't magic. How they behave affects how it behaves. Kind of like talking to actual people, which is what it's essentially trying to simulate.

[–] Donjuanme 25 points 2 years ago

AI taking a running leap at enshittification.

[–] balder1991 20 points 2 years ago (1 children)

Some people have been saying that since the beginning while some haven’t noticed this “decline”. It seems very subjective.

[–] tdawg 17 points 2 years ago

Honestly as a daily user I think it's a combination of it getting worse at understanding vague prompts and people bumbing up against edge cases more. I would suspect the former is due to things like prompt hardening but can only speculate, while the latter isn't hard to imagine just from frequent use.

[–] zikk_transport2 9 points 2 years ago (1 children)

lmao I was write back then :D

https://lemmy.world/post/687651

[–] mexicancartel@lemmy.dbzer0.com 15 points 2 years ago (2 children)

You mean "I was right* or "i wrote*"?

[–] Dicska 13 points 2 years ago

No no, he used to work as a wright. Built ships and shit.

load more comments (1 replies)

[–] ancientweasel@social.fossware.space 4 points 2 years ago

Just like most people after they achieve success.

load more comments