this post was submitted on 17 Apr 2024
1350 points (99.6% liked)

Programmer Humor

19918 readers
2709 users here now

Welcome to Programmer Humor!

This is a place where you can post jokes, memes, humor, etc. related to programming!

For sharing awful code theres also Programming Horror.

Rules

founded 2 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 121 points 8 months ago (5 children)

These errors were much more common before Unicode encodings were in broad use. Unicode pretty much solved this.

[–] davidgro 52 points 8 months ago (1 children)

Still happens for new emoji on old OSs, or just missing characters in the font being used.

[–] [email protected] 6 points 8 months ago

exacgly today these errors is always because some old emojis

[–] [email protected] 19 points 8 months ago (1 children)

Only if it's enabled by default, or the dev knows to enable it.

I had a lot of weird problems processing some info with names in Powershell until I found out that Powershell doesn't default to unicode format when shoving output into files. You can easily specify the encoding, but if you don't it replaces any non-ascii characters with "?" by default, so it's not even immediately obvious that there's an incorrect character, as it just silently substitutes a valid one.

[–] [email protected] 2 points 8 months ago

it uses big-endian utf-16 with BOM by default unless you upgrade to PowerShell 7

[–] [email protected] 5 points 8 months ago

I like your enthusiasm. I remember when I believed the same. The last 16 years have clearly shown this is not the case.

[–] [email protected] 4 points 8 months ago (1 children)

No it hasn't. It has just pushed them out of sight for English natives.

[–] [email protected] 28 points 8 months ago

Can't confirm that. In the 90s encodings were a nightmare. ISO-8859-1, ISO-8859-15, CP1252, IBM850, ... If you tried to build a website with an upload form, you'd get the most bizarre encodings and there was no way to reliably distinguish them. I'm not an English native, my world is full of umlauts and s-z ligatures. Things got A LOT better in the last years, thanks to Unicode encodings.

[–] [email protected] 2 points 8 months ago

Still needs to be widely used. It took me about an hour to figure out that my encoding issues were because of Vim being in latin1, another to figure out how to change that, and a third to realize that screen also wasn't in UTF-8 mode.