Privacy

35371 readers

618 users here now

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

Posting a link to a website containing tracking isn't great, if contents of the website are behind a paywall maybe copy them into the post
Don't promote proprietary software
Try to keep things on topic
If you have a question, please try searching for previous discussions, maybe it has already been answered
Reposts are fine, but should have at least a couple of weeks in between so that the post can reach a new audience
Be nice :)

Related communities

much thanks to @gary_host_laptop for the logo design :)

founded 5 years ago

MODERATORS

[email protected]

Chrome will soon automatically OCR PDFs (www.androidpolice.com)

submitted 2 years ago by [email protected] to c/[email protected]

12 comments fedilink hide all child comments

You think there will be some privacy implications for this?

top 9 comments

sorted by: hot top controversial new old

[–] [email protected] 30 points 2 years ago (1 children)

Solution: use Firefox

[–] Rooki 14 points 2 years ago

Easiest solution of my life xd

[–] [email protected] 17 points 2 years ago* (last edited 2 years ago) (2 children)

I guess the pdfs will be processed by google on their servers. I honestly dont want that. Is there a source?

[–] styraco 1 points 2 years ago

I doubt that. There is no technical reason for that. OCR isn't that hard computationally. And from a privacy/GDPR perspective this seems like a legal mess not even google would take on.

[–] [email protected] 0 points 2 years ago* (last edited 2 years ago) (2 children)

The article states:

Chrome already had its reading mode on ChromeOS, and now Google shares that the feature is expanding to the browser on all computers

The author of the article only speculates:

Maybe we'll one day see a version of it PDFs?

Setting aside the question of privacy, this is a very nice feature. I do worry how Firefox will compete with many of these small comforts and if it may eventually fall out of favor as a viable mainstream alternative and there goes our chance at having a privacy respecting browser. I guess (hope?) there will always be a niche alternative for privacy-minded folks?

But as far as privacy I'm not sure how the scanning of PDFs will affect it. I mean everything on the internet is basically already scanned and cataloged and sharing information over public internet through PDF rather that HTML shouldn't make a difference? Unless the article means Chrome browser would be scanning private files opened through the user's computer.

[–] [email protected] 3 points 2 years ago

The big companies at least will make using their products awful experience if you value your time, sanity, privacy or just about anything at all. Its unfortunate though how many people seem to be content in watching almost nothing but ads and maybe some other content occasionally. Those types of users will probably keep using whatever was presented to them first.

[–] [email protected] 2 points 2 years ago

Yeah, if I run my own business and have my own email servers behind a VPN, no way in hell PDFs on that server are being indexed.

But now, if an employee opens a PDF in an email attachment and the file is sent out to Google's severs for OCR processing?? That's a huge breach of security.

We should be able to expect that what we open in our web browsers stays local.

[–] [email protected] -1 points 2 years ago (1 children)

Tesseract isn't too heavy, maybe it runs locally? 👉👈

[–] [email protected] -1 points 2 years ago

Its likely

load more comments