[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Character sets and encodings confusion
From: |
Jason Rumney |
Subject: |
Re: Character sets and encodings confusion |
Date: |
Fri, 11 Jan 2008 08:28:34 -0800 (PST) |
User-agent: |
G2/1.0 |
On 11 Jan, 14:26, "Otto Maddox" <ottomad...@fastmail.fm> wrote:
> When I type `C-u C-x =' on the character `£', ...
> Why is the code point #x23? Should it not be #xA3 in Latin Alphabet 1?
The clue is in the following:
> charset: latin-iso8859-1
> (Right-Hand Part of Latin Alphabet 1 (ISO/IEC 8859-1):
> ISO-IR-100.)
Note that the latin-iso8859-1 charset only includes the Right-Hand
part (0x80-0xff).
> Because when you click on the #x23, the character list you get shows
> the code point as being #xA3, which is confusing.
It is confusing, but the table displayed is listed as the *coded*
charset, so it has the +0x80 transformation applied.
> Also, what are the first three numbers in parenthesis on the
> `character:' line?
They are the code-point in the internal encoding (emacs-mule in the
current version) in decimal, octal and hexadecimal.