this post was submitted on 08 Dec 2024
457 points (94.5% liked)

Technology

59882 readers
4657 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 2 years ago
MODERATORS
457
The GPT Era Is Already Ending (www.theatlantic.com)
submitted 3 days ago* (last edited 3 days ago) by [email protected] to c/technology
 

If this is the way to superintelligence, it remains a bizarre one. “This is back to a million monkeys typing for a million years generating the works of Shakespeare,” Emily Bender told me. But OpenAI’s technology effectively crunches those years down to seconds. A company blog boasts that an o1 model scored better than most humans on a recent coding test that allowed participants to submit 50 possible solutions to each problem—but only when o1 was allowed 10,000 submissions instead. No human could come up with that many possibilities in a reasonable length of time, which is exactly the point. To OpenAI, unlimited time and resources are an advantage that its hardware-grounded models have over biology. Not even two weeks after the launch of the o1 preview, the start-up presented plans to build data centers that would each require the power generated by approximately five large nuclear reactors, enough for almost 3 million homes.

https://archive.is/xUJMG

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 1 points 13 hours ago (1 children)

I mean isn't it already that is included in the datasets? It's pretty much a mix of everything.

[–] [email protected] 1 points 12 hours ago

Not everything in the dataset is retrievable. It's very lossy. It's also extremely noisy with a lot of training data that's not education-worthy.

I suspect they'd make a purpose-built model trained mainly on what they actually would want to teach especially from good educators.