Funny: Home of the Haha

5973 readers

37 users here now

Welcome to /c/funny, a place for all your humorous and amusing content.

Looking for mods! Send an application to Stamets!

Our Rules:

Keep it civil. We're all people here. Be respectful to one another.
No sexism, racism, homophobia, transphobia or any other flavor of bigotry. I should not need to explain this one.
Try not to repost anything posted within the past month. Beyond that, go for it. Not everyone is on every site all the time.

Other Communities:

/c/[email protected] - Star Trek chat, memes and shitposts
/c/[email protected] - General memes

founded 2 years ago

MODERATORS

Anticorp

993

Good effort (startrek.website)

submitted 11 months ago by [email protected] to c/funny

129 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] RememberTheApollo_ 16 points 11 months ago (3 children)

MidJourney has the same problem. “A room that has no elephants in it” is the prompt.

There very much is an elephant present.

[–] nandeEbisu 12 points 11 months ago

Just don't talk about it and you're good.

[–] dipshit -2 points 11 months ago* (last edited 11 months ago) (1 children)

Try saying “a room” and leaving off the elephants. AI cannot understand “no” like you think it does.

[–] RememberTheApollo_ 8 points 11 months ago (1 children)

I think most of us understand that and this exercise is the realization of that issue. These AI do have “negative” prompts, so if you asked it to draw a room and it kept giving you elephants in the room you could “-elephants”, or whatever the “no” format is for the particular AI, and hope that it can overrule whatever reference it is using to generate elephants in the room. It’s not always successful.

[–] fidodo 2 points 11 months ago* (last edited 11 months ago)

I think the main point here is that image generation AI doesn't understand language, it's giving weight to pixels based on tags, and yes you can give negative weights too. It's more evident if you ask it to do anything positional or logical, it's not designed to understand that.

LLMs are though, so you could combine the tools so the LLM can command the image generator and even create a seed image to apply positional logic. I was surprised to find out that asking chat gpt to generate a room without elephants via dalle also failed. I would expect it to convert the user query to tags and not just feed it in raw.