this post was submitted on 31 Jan 2025
255 points (97.4% liked)

Technology

61300 readers
2771 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

I wonder what his first clue was.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 29 points 16 hours ago (2 children)

After seeing that the public was willing to call DeepSeek “open source” for releasing 800 lines of Python, an opaque model, and a PDF vaguely describing (or just praising) the proprietary training framework… Yeah, I imagine he feels like he missed an opportunity.

[–] [email protected] 44 points 11 hours ago (1 children)

It's been a few days and a simple search reveals it's already been reproduced by many different bodies using the "vague" pdf. What's this disservice for?

[–] KingRandomGuy 1 points 41 minutes ago

TBH the paper is a bit light on the details, at least compared to the standards of top ML conferences. A lot of DeepSeek's innovations on the engineering front aren't super well documented (at least well enough that I could confidently reproduce them) in their papers.

[–] MITM0 6 points 11 hours ago

At least we have HuggingFace