$ iconv --help 64 ↵
Usage: iconv [OPTION...] [FILE...]
Convert encoding of given files from one encoding to another.
Input/Output format specification:
-f, --from-code=NAME encoding of original text
-t, --to-code=NAME encoding for output
Information:
-l, --list list all known coded character sets
Output control:
-c omit invalid characters from output
-o, --output=FILE output file
-s, --silent suppress warnings
--verbose print progress information
-?, --help Give this help list
--usage Give a short usage message
-V, --version Print program version
Mandatory or optional arguments to long options are also mandatory or optional
for any corresponding short options.
For bug reporting instructions, please see:
<https://bugs.launchpad.net/ubuntu/+source/glibc/+bugs>.
Minimum we have to supply -f, --from-code=NAME and -t, --to-code=NAME. Obvious choice for -t is UTF8. But what about -f?
uchardet
uchardet is “Universal Charset Detector”.
1
uchardet <file.txt>
Using it on example above:
1
2
$ uchardet test.txt
SHIFT_JIS
This tell us test.txt content is using SHIFT_JIS, a common encoding for Japanese text files and websites.