this post was submitted on 08 Sep 2023
165 points (99.4% liked)

Explain Like I'm Five

14096 readers
28 users here now

Simplifying Complexity, One Answer at a Time!

Rules

  1. Be respectful and inclusive.
  2. No harassment, hate speech, or trolling.
  3. Engage in constructive discussions.
  4. Share relevant content.
  5. Follow guidelines and moderators' instructions.
  6. Use appropriate language and tone.
  7. Report violations.
  8. Foster a continuous learning environment.

founded 1 year ago
MODERATORS
 

Do they get some kind of real-time feed that tells them "hey this URL popped up in the web today, but it is a tracker, so block it", or is this exercise is mostly helped by the crowd ?

all 30 comments
sorted by: hot top controversial new old
[–] fylkenny 152 points 1 year ago (1 children)
[–] DeadNinja 35 points 1 year ago
[–] slazer2au 75 points 1 year ago (4 children)

Regular expression magic.

A lot of ad networks have a pattern to the name or the window the advert appears in.

Using regular expression you can find just the adwindow and ignore the actual content.

Now what is regular expression? A wizard language.
ask any programmer about RE after 4 beers and watch the hate wash over their face.

[–] [email protected] 37 points 1 year ago (3 children)

I love regex, I'm not even gonna lie. To be fair, my expressions haven't been 50+ characters long, but still.

[–] [email protected] 19 points 1 year ago (1 children)

Regex absolutely has many great uses. The issue is people trying to use for things they shouldn't. Then it suddenly becomes a nightmare.

[–] [email protected] 10 points 1 year ago (1 children)

Can I parse [X]HTML with regex?

[–] [email protected] 8 points 1 year ago (1 children)

You can parse any plaintext with regex, but I would recommend using XPath for that use case, instead.

[–] [email protected] 11 points 1 year ago

Same, I've never understood the hate. But then again I memorize based off of patterns and regex in my brain is just a pattern.

[–] ShunkW 8 points 1 year ago (1 children)

I used to play regex games online lol. I once wrote a pattern for work that was ~200 characters. I loved doing it lol.

[–] [email protected] 2 points 1 year ago

Seems like an interesting way to learn. Do you remember any of them?

[–] [email protected] 20 points 1 year ago* (last edited 1 year ago) (1 children)

The first language I learned was Perl, so regex are very close to my heart. I'm also quite excitable when I drink (I'm a happy drunk), so ask me and I'll give you a very enthusiastic explanation while not noticing that you aren't interested in my detailed explanation and examples. Do it. I dare ya.

[–] slazer2au 6 points 1 year ago
[–] Mbourgon 11 points 1 year ago

Zawinski’s second law - “Sometimes a person looks at a problem and says ‘I know what I’ll do, I’ll use regular expressions’. And now they have two problems.”

[–] [email protected] 3 points 1 year ago (1 children)

Decade of c# game design under my belt, but never dealt with web scripting. Am I missing out?

[–] slazer2au 5 points 1 year ago

You are missing out on headaches.

[–] [email protected] 46 points 1 year ago

Nerds. Hundreds and hundreds of them.

[–] [email protected] 14 points 1 year ago

As someone who runs a popular blocklist collection, I've come to find that most of the MASSIVE lists are people who collate a whole bunch of lists together and then promote their "one size fits all" solution alongside their donation link. There are very few original high quality ad-blocking lists maintained (where originality is defined as a sizeable amount of unique entries not shared by other lists) and almost all don't appear to openly discuss the magic sauce behind their lists, outside of the obvious case of user submissions.

[–] [email protected] 8 points 1 year ago

The easy way would be to make a website sign up for all the ads and see what happens. Subtract your website from the data and there's the ads.

[–] [email protected] -5 points 1 year ago

The real answer will probably end up being that they ARE the ad companies in disguise.