Understanding a typical 8bit character problem (such as w/ European language accented chars)
If a single accented European character is incorrectly displayed as two seemingly random characters, then the issue is that at some point utf-8 bytes were incorrectly interpreted as ANSI bytes. For example, consider the character “é”. In the utf-8 encoding, this character is represented in two bytes: 0xC3 0xA9 In the typical ANSI encoding (such as Windows-1252 or iso-8859-1) it […]