[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: chinese encoded in UTF-8 and XML
From: |
Micah Cowan |
Subject: |
Re: chinese encoded in UTF-8 and XML |
Date: |
25 Sep 2003 19:52:32 -0700 |
User-agent: |
Gnus/5.09 (Gnus v5.9.0) Emacs/21.1 |
Knackeback <knackeback@randspringer.de> writes:
> Hi, I wrote a XML file with GNU emacs 21.2.2 and with
> chinese character content encoded in UTF-8.
> I wrote something like:
>
> <?xml version="1.0" encoding="UTF-8"?>
> <test>
> <chinese>撒</chinese>
> <chinese>鰓</chinese>
> </test>
>
> and then I used "C-x RET f" and then I choosed utf-8.
> Then I typed "C-x C-s" to save my file.
> I hope this is the right way in emacs to store the content
> as UTF-8 encoded text ?!
> Now I tried to parse the file with xmllint. xmllint is a
> small xml-parser program which comes with libxml2.
> The parser complains that the second "chinese line" is not proper
> UTF-8.
>
> ==>
FWICT, Emacs doesn't have a chinese input method which supports
unicode output... :-( ...I've had similar troubles with
Japanese. I've also noted that, e.g. for greek, there are input
methods which explicitly support unicode, and others which do
not.
-Micah