this post was submitted on 04 Sep 2024
185 points (96.0% liked)
ChatGPT
8947 readers
1 users here now
Unofficial ChatGPT community to discuss anything ChatGPT
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Are we maybe talking about 57% of newly created content? Because I also have a very hard time believing that LLM generated content already surpassed the entire last few decades of accumulated content on the internet.
I'm too dumb to understand the paper, but it doesn't feel unlikely that this is a misinterpretation.
What I've figured out:
What I can't quite figure out:
The actual quote from the paper is:
And "multi-way parallel" means translated into multiple languages:
But yeah, no idea, what their "translation tuples" actually contain. They seem to do some deduplication of sentences, too. In general, it very much feels like just quoting those 57.1% without any of the context, is just a massive oversimplification.