this post was submitted on 09 Jan 2025
466 points (99.2% liked)

Opensource

1533 readers
917 users here now

A community for discussion about open source software! Ask questions, share knowledge, share news, or post interesting stuff related to it!

CreditsIcon base by Lorc under CC BY 3.0 with modifications to add a gradient



founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 53 points 1 day ago (5 children)

accessibility is honestly the first good use of ai. i hope they can find a way to make them better than youtube's automatic captions though.

[–] [email protected] 6 points 23 hours ago

While LLMs are truly impressive feats of engineering, it's really annoying to witness the tech hype train once again.

[–] [email protected] 10 points 1 day ago

The app Be My Eyes pivoted from crowd sourced assistance to the blind, to using AI and it's just fantastic. AI is truly helping lots of people in certain applications.

[–] [email protected] 13 points 1 day ago

There are other good uses of AI. Medicine. Genetics. Research, even into humanities like history.

The problem always was the grifters who insist calling any program more complicated than adding two numbers AI in the first place, trying to shove random technologies into random products just to further their cancerous sales shell game.

The problem is mostly CEOs and salespeople thinking they are software engineers and scientists.

[–] [email protected] 9 points 1 day ago

I know Jeff Geerling on Youtube uses OpenAIs Whisper to generate captions for his videos instead of relying on Youtube's. Apparently they are much better than Youtube's being nearly flawless. I would have a guess that Google wants to minimize the compute that they use when processing videos to save money.

[–] [email protected] -4 points 1 day ago (1 children)
[–] [email protected] 5 points 1 day ago* (last edited 1 day ago) (1 children)

Spoiler, they will! I use FUTO keyboard on android, it's speech to text uses an ai model and it is amazing how great it works. The model it uses is absolutely tiny compared to what a PC could run so VLC's implementation will likely be even better.

[–] Landless2029 3 points 1 day ago

I also use FUTO and it's great. But subtitles in a video are quite different than you clearly speaking into a microphone. Even just loud music will mess with a good Speech-to-text engine let alone [Explosions] and [Fighting Noises]. At the least I hope it does pick up speech well.