this post was submitted on 06 Sep 2024
1391 points (99.6% liked)
Programmer Humor
32480 readers
404 users here now
Post funny things about programming here! (Or just rant about your favourite programming language.)
Rules:
- Posts must be relevant to programming, programmers, or computer science.
- No NSFW content.
- Jokes must be in good taste. No hate speech, bigotry, etc.
founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Why is it that whenever something is spitting out junk data, those specific characters are involved?
Everything is 0s and 1s to a computer. What a pattern of 0s and 1s encodes is decided by people--often arbitrarily. Over the years there have been attempts to standardize encodings but, for legacy reasons, older encodings are still valid.
The 0s and 1s that encode ' in UTF-8 (a standardized encoding) are the same 0s and 1s that encode ’ in CP-1252 (a legacy encoding).
The � symbol is shown when the 0s and 1s don't encode anything of meaning.
’
=e2 80 99
(3 bytes)’
=e2
80
99
(3 separate bytes)Good to see it "spelt out" like that