this post was submitted on 24 Jul 2024
183 points (96.0% liked)

Technology

59599 readers
3092 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 3 points 4 months ago (1 children)

That's a good point, it's probably way less load and overhead if Reddit and Google just sent info back and forth instead of scraping. Good way for Google to keep their spot as the favoured search engine and beat the competition too, since everything that comes up these days are articles full of SEO nonsense at best, then AI generated nonsense at worst. If nobody else can read the actual human responses, Google has a huge leg up. Also interesting to see that Google's honouring the txt file even when nobody's holding them to it.

I had no idea Twitter's search updated their index immediately after a comment is posted though. That's a lot of updates considering the amount of posts they get daily.

[–] [email protected] 2 points 4 months ago* (last edited 4 months ago)

I had no idea Twitter’s search updated their index immediately after a comment is posted though.

While I never had a Twitter account, it's the major reason that I used the service anonymously. In an unfolding event, like a natural disaster or something, it was absolutely unparalleled in its ability to rapidly comb through enormous amounts of information being plonked in by people around the world. I strongly prefer Reddit-style forum structure most of the time, but for issues for which there is no pre-existing communities and where the common issue is one that will only exist for a short period of time, I think that Twitter's ad-hoc connections between retweets and hashtags works much better than Reddit's association-of-comments-by-subreddit. I understand that Mastodon, unfortunately, doesn't have a full-text search feature, just searching based on exact hashtags. Actually...hmm. I was just talking about Kagi's search lens for the Threadiverse in another comment that I saw. I wonder if Kagi actually indexes Mastodon as well? That'd provide for similar functionality.

investigates

No, it looks like they only do the Reddit-alike Threadiverse (lemmy, kbin, mbin, etc), for which they use the term "Fediverse Forums".

investigates further

It does look like they index in real time, though, or at least quickly -- they probably are one of the institutions out there with an instance slurping up everything out there. I was able to find your comment on that search lens.

That’s a lot of updates considering the amount of posts they get daily.

Yeah, I'm sure that however the Twitter guys built it, they specifically designed it around permitting inexpensive index updates.