this post was submitted on 08 Sep 2024
1222 points (98.3% liked)

Programmer Humor

32156 readers
45 users here now

Post funny things about programming here! (Or just rant about your favourite programming language.)

Rules:

founded 5 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] SzethFriendOfNimi@lemmy.world 9 points 3 weeks ago* (last edited 3 weeks ago) (4 children)

Sounds like it’s actually using XSLT or some kind of content validation. Which to be honest sounds like a good practice.

[–] clb92@feddit.dk 8 points 3 weeks ago* (last edited 3 weeks ago) (3 children)

Here's an example of a text object taken from the XML, if you're curious: https://clips.clb92.xyz/2024-09-08_22-27-04_gfxTWDQt13RMnTIS.png

EDIT: And with more complicated strings (like numbers or symbols - just regular-ass ASCII symbols) there will be tens of , because apparently numbers and letters don't even work the same. Even line breaks have their own . And if the number of these and their charLen don't match what's actually in pt:data, it won't open the file.

[–] SzethFriendOfNimi@lemmy.world 1 points 3 weeks ago* (last edited 3 weeks ago) (2 children)

Is it because of the lower case Latin æ since it’s technically one character even if two bytes?

[–] clb92@feddit.dk 3 points 3 weeks ago (1 children)
[–] SzethFriendOfNimi@lemmy.world 1 points 3 weeks ago

What a mess… sounds like the devs got burned by various Unicode edge cases RTL, etc