this post was submitted on 13 Jul 2024
363 points (97.9% liked)
196
16563 readers
1603 users here now
Be sure to follow the rule before you head out.
Rule: You must post before you leave.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I mean, yes?
That's very pithy, but the material used as training data was probably produced by artists attempting to create art using tools (ai and otherwise), as well as more mundane data designed and produced by humans with no ai tools and some produced by humans with almost exclusively ai tools.
You probably live in a different world than I do.
Don't chicken/egg this. All of the training data was man-made at some point. Until the first LLMs started outputting based on it.
Secondly, the amount of human-produced content and LLM-produced content that's in the training data is incomparable. And will continue to be so. Otherwise the models break.