ext3 – How to tell the language encoding of a filename on Linux? – Server Fault
Posted by jpluimers on 2017/05/08
From ext3 – How to tell the language encoding of a filename on Linux? – Server Fault [WayBack] I learned a few things:
- filename encoding on Linux is undetermined – the file system just assumes a byte array of characters
- FTP and SFTP suffer from this as well (SFTP is based on SSH which now prefers UTF-8 [WayBack])
A good default is UTF-8, but it’s never guaranteed.
Two tools can help to determine the encoding of a filename:
- convmv [WayBack] converts filenames from one encoding to another
- chardet (Python) The Universal Character Encoding Detector
–jeroen
Leave a Reply