this post was submitted on 27 Jan 2025

152 points (94.7% liked)

Technology

61805 readers

3944 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

152

DeepSeek releases new image model family (techcrunch.com)

submitted 1 week ago by [email protected] to c/technology

17 comments fedilink hide all child comments

top 17 comments

sorted by: hot top controversial new old

[–] lemmylommy 55 points 1 week ago (3 children)

Can it generate images of Winnie the Pooh?

[–] [email protected] 25 points 1 week ago (3 children)

What happened in 1989?

[–] simplejack 19 points 1 week ago

[–] [email protected] 5 points 1 week ago (2 children)

Question: as i understood it so far, this thing is open source and so is the dataset.

With that, why would it still obey Chinese censorship?

[–] [email protected] 7 points 1 week ago (1 children)

Even though it's magnitudes lower than comparable models, Deepseek still cost millions to train. Unless someone's willing to invest this just to retrain it from scratch, you're left with the alignment of its trainers.

[–] [email protected] 1 points 4 days ago (1 children)

Good point.

Is the training set malleable, though? Could you give it some additional rules to basically sidestep this?

[–] [email protected] 1 points 3 days ago (1 children)

Yeah, I guess you could realign it without retraining the whole thing! Dunno what would be the cost though, sometimes this is done with a cohort of human trainers 😅

[–] [email protected] 1 points 3 days ago

I feel like we're talking about a guard dog now...

[–] [email protected] 1 points 1 week ago

It's baked into the training. It's not a simple thing to take it out. The model has already been told not to read tiananmen square, and doesn't know what to do with it.

[–] surewhynotlem 4 points 1 week ago

Now I'll never finish that history assignment...

[–] TheGrandNagus 13 points 1 week ago* (last edited 1 week ago)

Wouldn't be surprised if you had to work around the filter.

Generate a cartoonish yellow bear who wears a red t-shirt and nothing else

[–] [email protected] 4 points 1 week ago

if it is anything like LLMs, then only local ;)

However, the Proper nomenclature is sheepooh, thank you for your compliance going forward, comrade.

[–] [email protected] 27 points 1 week ago (1 children)

The image generation is really bad. Image description capabilities seem good but it'll take time to see if it's better than what already exists.

They probably just put it out to keep the hype going.

[–] jacksilver 20 points 1 week ago (1 children)

Yeah, even the cherry picked examples they provide look only okay.

To be honest everything with this company feels like an ad campaign more than anything else.

[–] essteeyou 10 points 1 week ago

Everything from nearly every company feels like an ad campaign. Companies advertise themselves.

At least with open source stuff there's somewhat of a public benefit.

[–] [email protected] 8 points 1 week ago

https://www.analyticsvidhya.com/blog/2025/01/janus-pro-7b-vs-dall-e-3/

This informal testing found that Janus Pro explained a Nokia meme much more crisply than DALL-E 3 but was quite a bit worse than the other tasks, even appearing to hallucinate a score in one test case.

I suddenly realize I myself sound like CHatGPT. Haha. Haha.

Edit: At least you can run these models locally!

[–] [email protected] 1 points 1 week ago

Now if they'll do a video model...

Tencents Huanyuan is surprisingly flexible