Technology

65982 readers

7059 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

214

Zotac accidentally lists RTX 5090, RTX 5080, and RTX 5070 family weeks before launch — accidental listing seemingly confirms the RTX 5090 with 32GB of GDDR7 VRAM (www.tomshardware.com)

submitted 2 months ago by [email protected] to c/technology

49 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 1 points 2 months ago (1 children)

What's up with Qwen that makes it better than anything else?

[–] brucethemoose 4 points 2 months ago* (last edited 2 months ago)

It's just smarter with the same number of parameters. Try Qwen QwQ or Qwen coder 32B, see for yourself... it stacks up well against huge models like the 123B Mistral Large, or even GPT-4.

Why? Alibaba trained it well, presumably with better data than OpenAI or whomever else, though specifics are up for debate. Some suggests that bilingual training on English/Chinese (aka the two largest text corpuses in existance) significantly helps the model over mostly english. Some say the government just gave them better data. There's also suggestions that having so few GPUs compared to American AI companies made the Chinese "thrifty," and gave them far more incentive to be innovative rather than brute forcing models (which has diminishing returns).