Conversion of mixed UTF8 and ISO8859-1/CP1252

15 December 2008; update Win32 and publication of sources: 6 May 2016

Download

Free exe download (Win32 format), old 16-bits MSDOS version.

Usage

This is a simple, specialised UTF8 to Latin1 converter.

Unlike general purpose converters like uniconv, it can reliably convert textfiles in languages like German, French Spanish etc. in which part of the text is in ISO-8859-1 (Latin1) and part in UTF8. It does this by detecting what is probably UTF8 and what is certainly not.

Optionally (-w), it also supports Windows code page 1252, instead of plain ISO8859-1, so things like curly quotes, n-dashes (–), French oe-ligatures (œ, Œ), Slavic s or z with haček (Š, š, Ž, ž) are also supported. See also CP1252, on Roman Czyborra’s site.

Usage: utf8mixd [-w] infile outfile

Sources

The source files are here.



Link

For a similar, but more versatile tool, see this perl script by Helmut Richter.