The Wiert Corner – irregular stream of stuff

Jeroen W. Pluimers on .NET, C#, Delphi, databases, and personal interests

  • My badges

  • Twitter Updates

  • My Flickr Stream

  • Pages

  • All categories

  • Enter your email address to subscribe to this blog and receive notifications of new posts by email.

    Join 1,839 other subscribers

When you bump into Mojibake in your development, don’t use table-based solutions to solve it

Posted by jpluimers on 2026/05/14

A while ago I bumped into [Wayback/Archive] Unicode weirdness – VCL – Delphi-PRAXiS [en].

This sketched a mojibake problem where PDF to text converted files had odd looking character sequences.

The solution – replacing these sequences with more correctly looking text – worked at first, but then failed because the underlying source code got “corrected” from containing the Mojibake character sequences into the correct Unicode text.

A better solution is to figure out what series of encoding/decoding steps will give the correct text.

This is where – again – [Wayback/Archive] Home – ftfy: fixes text for you comes up: a still indispensable tool.

–jeroen

Leave a comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.