this post was submitted on 12 Jun 2023
14 points (100.0% liked)

Experienced Devs

3981 readers
1 users here now

A community for discussion amongst professional software developers.

Posts should be relevant to those well into their careers.

For those looking to break into the industry, are hustling for their first job, or have just started their career and are looking for advice, check out:

founded 2 years ago
MODERATORS
 

cross-posted from: https://lemmy.world/post/76533

One of the arguments made for Reddit's API changes is that they are now the go to place for LLM training data (e.g. for ChatGPT).

https://www.reddit.com/r/reddit/comments/145bram/addressing_the_community_about_changes_to_our_api/jnk9izp/?context=3

I haven't seen a whole lot of discussion around this and would like to hear people's opinions. Are you concerned about your posts being used for LLM training? Do you not care? Do you prefer that your comments are available to train open source LLMs?

(I will post my personal opinion in a comment so it can be up/down voted separately)

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 6 points 1 year ago (6 children)

Scraping open content is OK. Search engines have been doing that, it's their main job.

LLM won't exist without large inputs, hehe, and the internet is a good source for a big volume of language, most of which can even make sense.

I don't feel like Reddit should be against LLMs, ignoring their bogus claims. At least I hope GitHub doesn't share private and licenced repos.

[–] FearTheCron 4 points 1 year ago (5 children)

I was wondering if someone would bring up search engine indexing. Google certainly has the upper hand for LLM training data with Reddit's new API change since they have the comments anyway. This is a big reason I fear these API changes, it is very much concentrating power in the hands of already powerful companies.

[–] [email protected] 4 points 1 year ago (4 children)

Always has been meme.jpg

I really don't think Reddit changed because of the AI, it's just for the IPO, trying to pump and dump it sky high.

It's really sad when you imagine what we could do as a species, if we could work together instead of trying to one-up each other.

It kind of brings me back to decentralized services, which for me is the ultimate freedom model, and I'm loving this alternative to Reddit.

[–] FearTheCron 2 points 1 year ago (1 children)

I am cautiously optimistic about the decentralization and federation. But I think the biggest hurdle is developing the user base right now. ExperiencedDevs is the only subreddit I followed before this all started that directly linked a Lemmy alternative.

[–] [email protected] 1 points 1 year ago (1 children)

I'm not sure which subreddit mentioned Lemmy, but I've never been to r/ExperiencedDevs.

I've been using my Matrix instance for 99% of my private conversations since 2018, so I know it works, but also see that people won't change unless their #1 solution isn't taken away.

So right now it's do or die for any Reddit alternative.

[–] FearTheCron 1 points 1 year ago (1 children)

r/ExperiencedDevs pointed at programming.dev specifically which is why I am giving it a shot. I have used decentralized stuff in the past like Usenet and IRC. I kinda miss the lack of corporate overlords.

https://www.reddit.com/r/ExperiencedDevs/comments/147ebxd/experienceddevs_will_go_dark_until_the_end_of_the/

[–] [email protected] 1 points 1 year ago

I found this instance from the list, checked out any interesting names, saw the content, and this one looked the best for me.

I will do my own instance in the future, but it will take about a month before my schedule clears out.

It's kind of weird though, I always wanted to microblog, but never really found the time, now I'm thinking of using Lemmy for that.

load more comments (2 replies)
load more comments (2 replies)
load more comments (2 replies)