renzev@lemmy.world to Programmer Humor@lemmy.mlEnglish · 11 days agoAI's take on XMLlemmy.worldimagemessage-square133fedilinkarrow-up11.23Karrow-down122
arrow-up11.21Karrow-down1imageAI's take on XMLlemmy.worldrenzev@lemmy.world to Programmer Humor@lemmy.mlEnglish · 11 days agomessage-square133fedilink
minus-squareSzethFriendOfNimi@lemmy.worldlinkfedilinkarrow-up10·edit-211 days agoSounds like it’s actually using XSLT or some kind of content validation. Which to be honest sounds like a good practice.
minus-squareclb92@feddit.dklinkfedilinkEnglisharrow-up8·11 days agoHere’s an example of a text object taken from the XML, if you’re curious: https://clips.clb92.xyz/2024-09-08_22-27-04_gfxTWDQt13RMnTIS.png
minus-squareSzethFriendOfNimi@lemmy.worldlinkfedilinkarrow-up1·edit-211 days agoIs it because of the lower case Latin æ since it’s technically one character even if two bytes?
minus-squareSzethFriendOfNimi@lemmy.worldlinkfedilinkarrow-up1·11 days agoWhat a mess… sounds like the devs got burned by various Unicode edge cases RTL, etc
Sounds like it’s actually using XSLT or some kind of content validation. Which to be honest sounds like a good practice.
Here’s an example of a text object taken from the XML, if you’re curious: https://clips.clb92.xyz/2024-09-08_22-27-04_gfxTWDQt13RMnTIS.png
Is it because of the lower case Latin æ since it’s technically one character even if two bytes?
Nope, doesn’t seem like it.
What a mess… sounds like the devs got burned by various Unicode edge cases RTL, etc