Conversion of mixed UTF8 and ISO8859-1/CP1252

15 December 2008

Download

free exe download

Usage

This is a simple, specialised UTF8 to Latin1 converter.

Unlike general purpose converters like uniconv, it can reliably convert textfiles in languages like German, French Spanish etc. in which part of the text is in ISO-8859-1 (Latin1) and part in UTF8. It does this by detecting what is probably UTF8 and what is certainly not.

Optionally (-w), it also supports Windows code page 1252, instead of plain ISO8859-1, so things like curly quotes, n-dashes (–), French oe-ligatures (œ, Œ), Slavic s or z with haček (Š, š, Ž, ž) are also supported. See also CP1252, on Roman Czyborra’s site.

Usage: utf8mixd [-w] infile outfile