Technology

59985 readers

2566 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 2 years ago

MODERATORS

995

Results of the "Can you tell which images are AI generated?" survey (i.vgy.me)

submitted 1 year ago* (last edited 1 year ago) by [email protected] to c/technology

156 comments fedilink hide all child comments

Previous posts: https://programming.dev/post/3974121 and https://programming.dev/post/3974080

Original survey link: https://forms.gle/7Bu3Tyi5fufmY8Vc8

Thanks for all the answers, here are the results for the survey in case you were wondering how you did!

Edit: People working in CS or a related field have a 9.59 avg score while the people that aren’t have a 9.61 avg.

People that have used AI image generators before got a 9.70 avg, while people that haven’t have a 9.39 avg score.

Edit 2: The data has slightly changed! Over 1,000 people have submitted results since posting this image, check the dataset to see live results. Be aware that many people saw the image and comments before submitting, so they've gotten spoiled on some results, which may be leading to a higher average recently: https://docs.google.com/spreadsheets/d/1MkuZG2MiGj-77PGkuCAM3Btb1_Lb4TFEx8tTZKiOoYI

you are viewing a single comment's thread
view the rest of the comments

[–] SpaceNoodle 40 points 1 year ago (1 children)

Did you not check for a correlation between profession and accuracy of guesses?

[–] [email protected] 57 points 1 year ago (4 children)

I have. Disappointingly there isn't much difference, the people working in CS have a 9.59 avg while the people that aren't have a 9.61 avg.

There is a difference in people that have used AI gen before. People that have got a 9.70 avg, while people that haven't have a 9.39 avg score. I'll update the post to add this.

[–] [email protected] 12 points 1 year ago (1 children)

Can we get the raw data set? / could you make it open? I have academic use for it.

[–] [email protected] 24 points 1 year ago* (last edited 1 year ago) (3 children)

Sure, but keep in mind this is a casual survey. Don't take the results too seriously. Have fun: https://docs.google.com/spreadsheets/d/1MkuZG2MiGj-77PGkuCAM3Btb1_Lb4TFEx8tTZKiOoYI

Do give some credit if you can.

[–] [email protected] 15 points 1 year ago

Of course! I'm going to find a way to integrate this dataset into a class I teach.

[–] [email protected] 10 points 1 year ago* (last edited 1 year ago) (1 children)

If I can be a bother, would you mind adding a tab that details which images were AI and which were not? It would make it more usable, people could recreate the values you have on Sheet1 J1;K20

[–] [email protected] 10 points 1 year ago (1 children)

Done, column B in the second sheet contains the answers (Yes are AI generated, No aren't)

[–] [email protected] 4 points 1 year ago

Awesome! Thanks very much.

[–] [email protected] 1 points 1 year ago (1 children)

I’d be curious to see the results broken down by image generator. For instance, how many of the Midjourney images were correctly flagged as AI generated? How does that compare to DALL-E? Are there any statistically significant differences between the different generators?

[–] [email protected] 1 points 1 year ago

Are there any statistically significant differences between the different generators?

Every image was created by DALL-E 3 except for one. I honestly got lazy so there isn't much data there. I would say DALL-E is much better in creating stylistic art but Midjourney is better at realism.

[–] MooseBoys 10 points 1 year ago

Sampling from Lemmy is going to severely skew the respondent population towards more technical people, even if their official profession is not technical.

[–] MysticKetchup 6 points 1 year ago (1 children)

If you do another one of these, I would like to see artist vs non-artist. If anything I feel like they would have the most experience with regular art, and thus most able to spot incongruency in AI art.

[–] [email protected] 1 points 1 year ago

I don't feel that's true coming from more "traditional" art circles. From my anecdotal experience, most people can't tell AI art from human art, especially digital and the kind the examples are from - meaning, hobbyist/semi-pro/pro deviant art type stuff. The examples seem obviously hand picked from both non-AI and AI-side to eliminate any differences as far as possible. And I feel both, the inability to tell the difference and the reason the dataset is what it is is because, well, they're very similar, mainly because the whole deviant art/art station/whatever scene is a masssssive part of the dataset they use to train these Ai-models, closing the gap even further.

I'm even a bit of a stickler when it comes to using digital tools and prefer to work with pens and paints as far as possible, but I flunked out pretty bad, but then again I can't really stand this deviant art type stuff so I'm not a 100% familiar, a lot of the human made ones look very AI.

I'd be interested in seeing the same, artist vs. non-artist survey, but honestly I feel it's the people more familiar with specifically AI-generated art that can tell them apart the best. They literally specifically have to learn (if you're good at it) to spot the weird little AI-specific details and oopsies to not make it look weird and in the uncanny valley.