this post was submitted on 27 Jun 2023
47 points (100.0% liked)

Privacy

32173 readers
463 users here now

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

Related communities

much thanks to @gary_host_laptop for the logo design :)

founded 5 years ago
MODERATORS
 

You think there will be some privacy implications for this?

top 9 comments
sorted by: hot top controversial new old
[–] [email protected] 30 points 2 years ago (1 children)
[–] Rooki 14 points 2 years ago

Easiest solution of my life xd

[–] [email protected] 17 points 2 years ago* (last edited 2 years ago) (2 children)

I guess the pdfs will be processed by google on their servers. I honestly dont want that. Is there a source?

[–] styraco 1 points 2 years ago

I doubt that. There is no technical reason for that. OCR isn't that hard computationally. And from a privacy/GDPR perspective this seems like a legal mess not even google would take on.

[–] [email protected] 0 points 2 years ago* (last edited 2 years ago) (2 children)

The article states:

Chrome already had its reading mode on ChromeOS, and now Google shares that the feature is expanding to the browser on all computers

The author of the article only speculates:

Maybe we'll one day see a version of it PDFs?

Setting aside the question of privacy, this is a very nice feature. I do worry how Firefox will compete with many of these small comforts and if it may eventually fall out of favor as a viable mainstream alternative and there goes our chance at having a privacy respecting browser. I guess (hope?) there will always be a niche alternative for privacy-minded folks?

But as far as privacy I'm not sure how the scanning of PDFs will affect it. I mean everything on the internet is basically already scanned and cataloged and sharing information over public internet through PDF rather that HTML shouldn't make a difference? Unless the article means Chrome browser would be scanning private files opened through the user's computer.

[–] [email protected] 3 points 2 years ago

The big companies at least will make using their products awful experience if you value your time, sanity, privacy or just about anything at all. Its unfortunate though how many people seem to be content in watching almost nothing but ads and maybe some other content occasionally. Those types of users will probably keep using whatever was presented to them first.

[–] [email protected] 2 points 2 years ago

Yeah, if I run my own business and have my own email servers behind a VPN, no way in hell PDFs on that server are being indexed.

But now, if an employee opens a PDF in an email attachment and the file is sent out to Google's severs for OCR processing?? That's a huge breach of security.

We should be able to expect that what we open in our web browsers stays local.

[–] [email protected] -1 points 2 years ago (1 children)

Tesseract isn't too heavy, maybe it runs locally? 👉👈

[–] [email protected] -1 points 2 years ago
load more comments
view more: next ›