help-gnu-emacs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Character sets and encodings confusion


From: Jason Rumney
Subject: Re: Character sets and encodings confusion
Date: Fri, 11 Jan 2008 08:28:34 -0800 (PST)
User-agent: G2/1.0

On 11 Jan, 14:26, "Otto Maddox" <ottomad...@fastmail.fm> wrote:
> When I type `C-u C-x =' on the character `£', ...

> Why is the code point #x23?  Should it not be #xA3 in Latin Alphabet 1?

The clue is in the following:

>     charset: latin-iso8859-1
>              (Right-Hand Part of Latin Alphabet 1 (ISO/IEC 8859-1): 
> ISO-IR-100.)

Note that the latin-iso8859-1 charset only includes the Right-Hand
part (0x80-0xff).

> Because when you click on the #x23, the character list you get shows
> the code point as being #xA3, which is confusing.

It is confusing, but the table displayed is listed as the *coded*
charset, so it has the +0x80 transformation applied.

> Also, what are the first three numbers in parenthesis on the
> `character:' line?

They are the code-point in the internal encoding (emacs-mule in the
current version) in decimal, octal and hexadecimal.


reply via email to

[Prev in Thread] Current Thread [Next in Thread]