[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
UTF-8 BOM parse error
From: |
David Necas (Yeti) |
Subject: |
UTF-8 BOM parse error |
Date: |
Sat, 11 Sep 2004 12:24:05 +0200 |
User-agent: |
Mutt/1.4.1i |
Gettext version: 0.14.1
Problem: msgfmt (and probably other gettext tools) print an
unhelpful error
somefile.po:1:2: parse error
when a PO file starts with UTF-8 BOM (0xef 0xbb 0xbf).
What makes it worse is that any UTF-8-capable text editor or
viewer does not show the BOM (or at least should not show),
so one gazes at the file wondering what could be wrong with
the comment on its first line...
I'm aware it cannot know the encoding when reading the first
bytes, so it would have to special-case UTF-8 (and other
UTF's) to cope with it. However, I still think it worths
fixing.
Yeti
--
Do not use tab characters. Their effect is not predictable.
- UTF-8 BOM parse error,
David Necas (Yeti) <=