[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Character sets and encodings confusion
From: |
Eli Zaretskii |
Subject: |
Re: Character sets and encodings confusion |
Date: |
Fri, 11 Jan 2008 18:34:56 +0200 |
> From: "Otto Maddox" <ottomaddox@fastmail.fm>
> Date: Fri, 11 Jan 2008 14:26:29 +0000
>
> When I type `C-u C-x =' on the character `£', I get
> something like this:
>
> character: £ (2211, #o4243, #x8a3, U+00A3)
> charset: latin-iso8859-1
> (Right-Hand Part of Latin Alphabet 1 (ISO/IEC 8859-1):
> ISO-IR-100.)
> code point: #x23
> syntax: w which means: word
> category: l:Latin
> buffer code: #x81 #xA3
> file code: #xA3 (encoded by coding system iso-latin-1)
> display: by this font (glyph code)
> -apple-monaco-medium-r-normal--13-130-72-72-m-130-iso10646-1 (#xA3)
>
> Why is the code point #x23?
This is the code point of `£' in the latin-iso8859-1 charset.
> Should it not be #xA3 in Latin Alphabet 1?
No. The latin-iso8859-1 charset does not include ASCII, so it starts
from what you are used to call "codepoint 160".
> Also, what are the first three numbers in parenthesis on the
> `character:' line? Are they code points of some charset?
They are internal Emacs representation of this character, in decimal,
octal, and hex.
This is all explained in the Emacs manual, btw; see the node "Position
Info" there (I got to that node by typing "i C-x =" in Info).