help-gnu-emacs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Character sets and encodings confusion


From: Eli Zaretskii
Subject: Re: Character sets and encodings confusion
Date: Fri, 11 Jan 2008 18:34:56 +0200

> From: "Otto Maddox" <ottomaddox@fastmail.fm>
> Date: Fri, 11 Jan 2008 14:26:29 +0000
> 
> When I type `C-u C-x =' on the character `£', I get
> something like this:
> 
>   character: £ (2211, #o4243, #x8a3, U+00A3)
>     charset: latin-iso8859-1
>              (Right-Hand Part of Latin Alphabet 1 (ISO/IEC 8859-1): 
> ISO-IR-100.)
>  code point: #x23
>      syntax: w        which means: word
>    category: l:Latin
> buffer code: #x81 #xA3
>   file code: #xA3 (encoded by coding system iso-latin-1)
>     display: by this font (glyph code)
>      -apple-monaco-medium-r-normal--13-130-72-72-m-130-iso10646-1 (#xA3)
> 
> Why is the code point #x23?

This is the code point of `£' in the latin-iso8859-1 charset.

> Should it not be #xA3 in Latin Alphabet 1?

No.  The latin-iso8859-1 charset does not include ASCII, so it starts
from what you are used to call "codepoint 160".

> Also, what are the first three numbers in parenthesis on the
> `character:' line?  Are they code points of some charset?

They are internal Emacs representation of this character, in decimal,
octal, and hex.

This is all explained in the Emacs manual, btw; see the node "Position
Info" there (I got to that node by typing "i C-x =" in Info).




reply via email to

[Prev in Thread] Current Thread [Next in Thread]