Troubleshooting generic file encodings

Some notes on troubleshooting different encodings for sent/received files.

Table

Encoding	å	ä	ö	Å	Ä	Ö
UTF-8	\c3\a5	\c3\a4	\c3\b6	\c3\85	\c3\84	\c3\96
latin1	\e5	\e4	\f6	\c5	\c4	\d6
CP437	\86	\84	\94	\8f	\8e	\99
CP850	\86	\84	\94	\8f	\8e	\99

Notes

WINDOWS-1252 / CP-1252 is a superset of latin1
All encodings are supersets of ASCII, so \20=SPACE and \0a=NEWLINE
Sometimes you will encounter the “ANSI”-encoding. That name is a lie, and means that the creator is stupid in the head. There is no way of knowing what the creator intends, so you will have to try one encoding at a time and hope for the best.

Vim commands

If you hover over a character and type ga, you will see the VIM interpretation of the encoding. Note that this will NOT necessarily be the FILE-encoding, but instead the VIM-encoding.

# BASH
vim -b FILENAME   # Open vim and see binary (doesn't always work)

# VIM
:e ++enc=1252 filename.txt # Reinterpret file as WINDOWS-1252
%!hexdump -C               # Read whole file as hex (if desperate)

Table#

Notes#

Vim commands#

Table

Notes

Vim commands