this post was submitted on 25 Sep 2024
374 points (98.4% liked)

Technology

59381 readers
3468 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 25 points 1 month ago* (last edited 1 month ago) (1 children)

Haven't read details, but the classic way is to have a system visit: site.com/badimage.gif?data=abcd

Note: That s is also how things like email open rates are tracked, and how marketers grab info using JavaScript to craft image URLs.

[–] jaybone 18 points 1 month ago (1 children)

This is why every single email client for the past 2+ decades blocks external images? This didn’t occur to the AI geniuses?

[–] eager_eagle 11 points 1 month ago (1 children)

IME they usually proxy and/or prefetch images for caching instead of blocking them. Only spam content is blocked by default.

[–] [email protected] 6 points 1 month ago (1 children)

This wouldn't help, would it? How would you prefetch and cache:

site.com/base64u-to-niceware-word-array/image.gif

? It would look like a normal image URL in any article, but actually represent data.

Note: "niceware" is a way to convert binary or text data into a set of words like "cow-heart-running-something-etc".

[–] [email protected] 4 points 1 month ago (3 children)

If it’s prefetched, it doesn’t matter that you reveal that it’s been “opened,” as that doesn’t reveal anything about the recipient’s behavior, other than that the email was processed by the email server.

[–] [email protected] 3 points 1 month ago (1 children)

Personally speaking, I’ve never been a fan of this method because to the hosting web server it was still fetched. That might confirm that an email address exists or (mistakenly) confirm that the user did in fact follow the link (or load the resource).

I have ad and tracking blocked like crazy (using DNS) so I can’t follow most links in emails anyway. External assets aren’t loaded either, but this method basically circumvents that (which I hate).

[–] eager_eagle 2 points 1 month ago* (last edited 1 month ago)

an email for a receiver that doesn't exist, more often than not, goes back to the sender after e.g. 72h. That's by design.

[–] [email protected] 2 points 1 month ago (1 children)

If by prefetch you mean the server grabs the images ahead of time vs the client, this does not happen, at least on amy major modern platform that I know of. They will cache once a client has opened, but unique URLs per recipient are how they track the open rates.

[–] [email protected] 2 points 1 month ago

Apple’s Mail Privacy Protection does this. See https://www.reddit.com/r/privacy/comments/pt9ycv/apples_mail_privacy_protection/ for a post from three years ago talking about it.

I don’t know if any other major providers take this approach but Apple / iCloud is definitely one of them.

[–] [email protected] 1 points 1 month ago (1 children)

But the path changes with every new data element. It's never the same, so every "prefetch" is a whole new image in the system's eyes.

[–] [email protected] 2 points 1 month ago (1 children)

Even with a unique link, if the behavior is that as soon as the email server receives it, it’s prefetched, what does that reveal about the user?

[–] [email protected] 1 points 1 month ago* (last edited 1 month ago) (1 children)

Server or client, every supposed prefetch would be unique. If I trick an LLM client into grabbing:

site.com/random-words-of-data/image.gif

Then:

site.com/more-random-data/image.gif

Those are two separate images to the cache engine. As the data refreshes, the URL changes, forcing a new grab each time.

For email, marketers do this by using a unique image URL for every recipient.

[–] [email protected] 1 points 1 month ago (1 children)

Cool, all of your images are getting fetched by the server as it receives and processes the emails. You have 100% open rate on all emails to that domain within 3 minutes of send.

What do you know about the user and their behavior? Nothing. The prefetch is not tied to their actions, therefore you cannot learn anything about their actions.

[–] [email protected] 1 points 1 month ago (1 children)

This post isn't about email open rates, it's about data exfiltration. But for email speficially, show me major providers that prefetch by default.

[–] [email protected] 1 points 1 month ago

For data exfiltration, you’re right - this doesn’t help.