[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Character sets and encodings confusion
From: |
Otto Maddox |
Subject: |
Character sets and encodings confusion |
Date: |
Fri, 11 Jan 2008 14:26:29 +0000 |
When I type `C-u C-x =' on the character `£', I get
something like this:
character: £ (2211, #o4243, #x8a3, U+00A3)
charset: latin-iso8859-1
(Right-Hand Part of Latin Alphabet 1 (ISO/IEC 8859-1): ISO-IR-100.)
code point: #x23
syntax: w which means: word
category: l:Latin
buffer code: #x81 #xA3
file code: #xA3 (encoded by coding system iso-latin-1)
display: by this font (glyph code)
-apple-monaco-medium-r-normal--13-130-72-72-m-130-iso10646-1 (#xA3)
Why is the code point #x23? Should it not be #xA3 in Latin Alphabet 1?
Because when you click on the #x23, the character list you get shows
the code point as being #xA3, which is confusing.
Also, what are the first three numbers in parenthesis on the
`character:' line? Are they code points of some charset? (I
understand that the fourth number is a Unicode code point.)
--
Otto Maddox
ottomaddox@fastmail.fm
--
http://www.fastmail.fm - Same, same, but different
- Character sets and encodings confusion,
Otto Maddox <=