this post was submitted on 14 Jul 2023
1011 points (98.4% liked)
Technology
59671 readers
3936 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
My gut tells me they are not deleted but rather simply no longer publicly available. Can't have these pesky AI bots training for free.
I think it's the opposite. These are private chats that can't be sold to the AI, that's why Reddit thinks they're worthless.
More likely just moved to cold storage to save money. It’s expensive to keep data in an easily accessible database. If you don’t need to access it you can move it to object storage for pennies on the dollar and still keep it accessible for whatever nefarious data brokers you want to sell it to in the future
They’re implementing new chat infrastructure and only replicated 2023/01/01 forward. It’s in the article.
Oh man.
To be able to have a long running project and decide to truncate years worth of data...
Just, drop it like you never need it again.
Apart from working at Reddit, sounds like a dream
How much data are you storing and how much do you think reddit is?
Right but the difference is Reddit is now charging an extortionate amount for data access via the API when compared to other platforms.
What do you think this (if achieved what was hoped) massive new flow of income was supposed to help sustain?
Oh yeah, infrastructure costs.
No need to be quite so condescending is there?
I fully understand how they work.
Seems like this growth they should be trying to achieve is fucking futile when they:
Not to mention, infrastructure costs lead to growth.
If you don’t have the resources to support your current platform, you shouldn’t be actively trying to grow the platform by discarding older content. It makes those accessing and making use of the content (be it individual or institutional) lose trust in the quality of data.
Great growth strategy that.
He means the business model they're going for with the IPO.
Growth means the value of the company. The shares. The stockholders getting dividends. It has fuck all to with user experience, or even user growth. It's about manipulating information and using clever accounting to get investors to give you money.
The company is worth fuck all if their sole “money generation” process is via user data and authentic conversation.
Users ARE Reddit.
No content, no value.
No value? No dividends.
They’re implementing new chat infrastructure and only replicated 2023/01/01 forward. It’s in the article.
Probably want to prevent people from deleting their own messages. Can't delete messages you don't have access to anymore.
100%. The idea that reddit would just permanently delete all those chat messages rather than just archive them away from the public is crazy. Even if they don't directly sell them to advertisers there's a shitload of value for private ML training
Can't have those pesky users deleting spez's retirement fund
Training on inaccessible private chat logs?