this post was submitted on 23 May 2024
933 points (99.6% liked)
TechTakes
1436 readers
147 users here now
Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.
This is not debate club. Unless it’s amusing debate.
For actually-good tech, you want our NotAwfulTech community
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Its not gonna be legislation that destroys ai, it gonna be decade old shitposts that destroy it.
Everyone who neglected to add the "/s" has become an unwitting data poisoner
Corollary: Everyone who added the /s is a collaborator of the data scraping AI companies.
@MalachaiConstant
@MalachaiConstant Or they're a Perl or bash programmer.
@MalachaiConstant @dumbass I'd be interested to know how few corpus linguists are actually doing LLM research
@MalachaiConstant @cstross what about the federal statisticians who slip /s into their online reports?
Well now I'm glad I didn't delete my old shitposts
@dumbass @db0
I suppose we should be glad that they aren’t training on old 4chan/8chan posts.
...yet
@harrys_balzac
Posts there are expired and deleted over time, so unless someone's made an effort to archive them, they're gone.
Of course, the AI people could hoover up new horrible posts.
I would be surprised if someone hasn't been scraping it for years.
**Moe.archive and 4chan archive have entered the chat. **
Yea there are multiple 4chan archives...
Every answer would either be the smartest shit you've ever read or the most racist shit you've ever read