Briskfall

joined 1 year ago
[โ€“] [email protected] 2 points 1 year ago

Thanks to your post I've noticed that just typing the following string while using a search engine (Google in this case) we can get all of wikis that use MediaWiki that are indexed because they follow the same URL structure:

site:*.*/wiki/Main_Page

[โ€“] [email protected] 2 points 1 year ago (1 children)

I found out in another thread that it is NOT intended behavior here https://kbin.social/m/[email protected]/t/56323/YSK-that-you-can-edit-titles-on-Lemmy-unlike-Reddit#entry-comment-241691

Which means that hopefully, there should be a fix underway... I hope๐Ÿฅฒ.

[โ€“] [email protected] 4 points 1 year ago

I've had better results searching through the instances themselves because Google doesn't always index the keywords on time. On caveat of this method is that if the instance doesn't have the syncing out the instance where the info is from being propagated, then this trick would not work

[โ€“] [email protected] 1 points 1 year ago

While I'm not entirely sure about why things are set up the way they are, I guess that it was done on purpose seeing your response. Now, if explaining the why's and how's of this design is too much of a sensitive topic, I'm okay with dropping my questions and just going along with the ride ๐Ÿ‘.

If it's possible as a compromise, how about consider displaying both "the origin of the magazine" (instance) and the "name of the magazine", rather than the existing format where kbin.social appears after the title, followed by just the "name of the magazine" (omitting the original instance from which it hails).

For example, the current layout is:

TITLE

(kbin.social)

PREVIEW OF BODY TEXT

USER, 33 minutes ago to NAME_OF_MAGAZINE_WITHOUT_ORIGINAL_INSTANCE (like meta)

My suggestion is to include an option for displaying it like this (I do understand that it might be a clutter to some, but it would be a nice feature for others facing the same qualms as I do):

TITLE

(kbin.social)

PREVIEW OF BODY TEXT

USER, 33 minutes ago to NAME_OF_MAGAZINE_WITH_ORIGINAL_INSTANCE (like [email protected])

To provide some context, I came across a post on either main or meta on the front page. It had kbin.social listed below the title along with the "name of the magazine," but the original instance name was missing. I was about to express my confusion since it didn't align with my experience on kbin.social, but upon double-checking the URL, I realized the post was not related to kbin.social at all. This "issue" generated by the behavior seems to mainly affect mags like main and meta posts, but considering these posts do often appear on the front page (since I've enabled other instances to discover more exciting content), I think that it's issue worth exploring.

Hopefully in the meantime, I've figured better ways to work around this issue (to verify the instance of the magazine) on mobile ( apart from checking the address bar), which are to

  • click on the hamburger menu or
  • to scroll to the page's end.
[โ€“] [email protected] 7 points 1 year ago

Oh no, not a Pay to Win x HostileArchitecure collaboration!!

This is giving me peak ABoringDystopia vibes...

[โ€“] [email protected] 2 points 1 year ago

Uh wow this is the first time I've seen summoning the instance in reverse (what I'm aware of so far) and it feels like black magic on the other end... My head hurts now hahaha ๐Ÿ˜ตโ€๐Ÿ’ซ

 

The current behavior (I'm using the web client) as it is displays self-post as (kbin.social) under the thread title. Not sure if it is the result of a fallback behavior.

On mobile it gets really awkward because the URL address bar would truncate to the domain + TLD (the full URL is only down once the user double press the address bar which makes it cumbersome and adds an extra layer of step) and for certain topics such as metatalk the user viewing the thread would have a higher chance of confounding it kbin metatalk instead of maybe lemmy.world meta talk, etc.

[โ€“] [email protected] 1 points 1 year ago

Just found out a while ago that it varies by instance, e.g. for kbin.social, it's not good to add the exclamation mark and just leave it as community@instance bare.

[โ€“] [email protected] 5 points 1 year ago (1 children)

I used to not do it, but I've been influenced by Bing after talking to it for so long during the closed beta (I guess that this is an effect of subconsciously mirroring it so that I don't get kicked out before the 5 turn limit back in the days haha ๐Ÿ˜‚)

Then again, in diverse online communities, there are various styles and voices that are eventually formed to be what's "acceptable" be the general consensus. As Lemmy is very new, it has yet to find its voice yet... I think ๐Ÿค”.

 

AI-TRIGGER WARNING: I've asked ChatGPT to revise my writing because it was ass (writing a stream of coherent looking text is not my forte). Proceed at your own discretion.

Yes the emoji 's all on me, I've been too much influenced by Bing Chat lately---even ChatGPT took it out but then I pestered it to move it back.

Below this line it's all text that has been retouched by AI ๐Ÿ˜ฑ:


Title: Archiving Reddit Threads During Protests: Suggestions Needed

Body:

Hello everyone,

As many of you are aware, numerous Reddit subreddits are temporarily closed due to the ongoing protest. While I completely support this action, it is causing some issues with my hobby research. Many posts are being deleted or replaced with placeholder scripts, leading to a loss of valuable information. Source: https://lemmy.ml/post/1259772

In an effort to address this, I have been using a script to save Reddit threads that I find interesting to my Personal Knowledge Management system: https://www.reddit.com/r/ObsidianMD/comments/104k0om/script_save_reddit_posts_to_obsidian/ . I have managed to successfully use it, but since I don't have a strong understanding of Ruby code ๐Ÿ˜…, I'm worried about its future functionality, especially if it depends on the Reddit API.

I recently discovered a thread discussing Reddit dumps: https://lemmy.nz/post/52092 . This discovery made me curious if it would be possible to modify the Ruby script to work with a local version of Reddit or even directly with the Reddit logs. To my understanding, these logs are in JSON format, but I haven't downloaded them yet.

Additionally, I've come across the concept of vector embeddings and a tool called Pinecone. Would it be more straightforward to use this tool to extract the necessary information, as opposed to manually searching through the data? Ideally, I would like to create a local search function, similar to Google, specifically for this dataset dump. However, I'm unsure of how to search a local database of Reddit submissions. I have found potential solutions such as Semantra and Qdrant, but I'm uncertain if these are the best tools for this task. Perhaps there is a more suitable option?

I will be honest, I don't have a strong background in technology, and this problem is proving to be quite complex. But I'm willing to tackle it. I would greatly appreciate any input or suggestions that you could provide.

Thank you in advance, everyone! ๐Ÿ˜Š

[โ€“] [email protected] 2 points 1 year ago* (last edited 1 year ago)

I think that giving too many choices to users who are already confused by the concept of federation and instances will enhance their paralysis of making choice due to cognitive overload (See https://en.m.wikipedia.org/wiki/Overchoice ).

I've found out kbin.social the easiest to get used to (end-user wise).

[โ€“] [email protected] 2 points 1 year ago (1 children)

Damn I was looking forever how to get some communities working. Thanks!

This is definitely a much needed information during onboarding (not sure if it was included already or if I've missed it).

[โ€“] [email protected] 5 points 1 year ago (1 children)

How long does it usually take for google to index websites? Because I tried the string lemmy site:lemmy.ml after:2023-06-15 and only one post turned up for me and it was Memes... the current state of affairs does not seem promising ๐Ÿ˜” And if I tried with another instance with the same keywords lemmy site:kbin.social after:2023-06-15 nothing even turned up.

I wonder though, will search engines adapt to Lemmy and its fediverse system? Or will search engines die? Or will we see dedicated search engines to search through the fediverse?

[โ€“] [email protected] 5 points 1 year ago (1 children)

Unfortunately, I've been getting some 404 not found of some communities/magazines of some instances that are not from the instance I'm using, e.g. I'm using kbin.social at the current posting account, but let's say that I tried to access something like https://sh.itjust.works/c/skincareaddiction there's no issues whatsoever (since it's the main instance where that community spawned off) but if I tried https://kbin.social/m/[email protected] then I would get the aforementioned error code. I find it pretty inconvenient that caching/indexing of certain less popular (which I assume is what is happening) community working clunkily, it feels not as reliable than using a centralized service, but I guess that this is the price to pay for a decentralized system.

view more: next โ€บ