this post was submitted on 23 May 2024
931 points (99.6% liked)

TechTakes

1013 readers
237 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

For actually-good tech, you want our NotAwfulTech community

founded 1 year ago
MODERATORS
 

Source

I see Google's deal with Reddit is going just great...

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 78 points 1 month ago (3 children)

Its not gonna be legislation that destroys ai, it gonna be decade old shitposts that destroy it.

[–] MalachaiConstant 48 points 1 month ago (4 children)

Everyone who neglected to add the "/s" has become an unwitting data poisoner

[–] [email protected] 1 points 1 month ago

@MalachaiConstant Or they're a Perl or bash programmer.

[–] [email protected] 1 points 1 month ago

@MalachaiConstant @dumbass I'd be interested to know how few corpus linguists are actually doing LLM research

[–] [email protected] 1 points 1 month ago

@MalachaiConstant @cstross what about the federal statisticians who slip /s into their online reports?

[–] [email protected] 18 points 1 month ago

Well now I'm glad I didn't delete my old shitposts

[–] [email protected] 15 points 1 month ago (2 children)

@dumbass @db0

I suppose we should be glad that they aren’t training on old 4chan/8chan posts.

[–] [email protected] 20 points 1 month ago (1 children)
[–] [email protected] 6 points 1 month ago (2 children)

@harrys_balzac

Posts there are expired and deleted over time, so unless someone's made an effort to archive them, they're gone.

Of course, the AI people could hoover up new horrible posts.

[–] [email protected] 7 points 1 month ago (1 children)

I would be surprised if someone hasn't been scraping it for years.

[–] [email protected] 9 points 1 month ago

**Moe.archive and 4chan archive have entered the chat. **

[–] [email protected] 5 points 1 month ago

Yea there are multiple 4chan archives...

[–] [email protected] 9 points 1 month ago

Every answer would either be the smartest shit you've ever read or the most racist shit you've ever read