help-gnu-emacs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

charset problems in CVS emacs


From: Dmitri Minaev
Subject: charset problems in CVS emacs
Date: Wed, 13 Feb 2008 16:46:26 +0400

Until recently, I used emacs-unicode2 (23.0.0), which was the first
version where cyrillic letters could be correctly pasted from X
selection. Now, lured by multi-tty, I installed the CVS version ("GNU
Emacs 23.0.60.6 (i686-pc-linux-gnu, X toolkit, Xaw3d scroll bars) of
2008-02-13 on cera"). The first problem I faced was the incorrect
representation of cyrillic characters. A little investigation shown
that the roots of the problem are in the incorrect charset used for
rendering all (?) non-ASCII characters, including Cyrillic, Greek,
German, etc. For example:

character: й (1081, #o2071, #x439)
preferred charset: gb18030 (GB18030)
code point: 0xA7DB
syntax: w       which means: word
category: Y:Cyrillic characters of 2-byte character sets c:Chinese
h:Korean j:Japanese y:Cyrillic
to input: type "q" with cyrillic-jcuken
buffer code: #xD0 #xB9
file code: #xD0 #xB9 (encoded by coding system utf-8-unix)
display: by this font (glyph code)
arial black:pixelsize=16:foundry=monotype:weight=medium:slant=r:width=normal
(#x217)

With some fonts this erroneous charset preference causes the non-latin
characters to be displayed in a different font, even though the
current one could represent the character in question.

In emacs-unicode2 describe-char gives the following output for the
same character:

character: й (1081, #o2071, #x439)
preferred charset: iso-8859-5 (ISO/IEC 8859/5)
code point: 0xD9
syntax: w       which means: word
category: Y:Cyrillic characters of 2-byte character sets c:Chinese
h:Korean j:Japanese y:Cyrillic
buffer code: #xD0 #xB9
file code: #xD0 #xB9 (encoded by coding system utf-8-unix)
display: by this font (glyph code)
-ETL-fixed-medium-r-normal--16-160-72-72-C-80-ISO8859-5 (#xD9)

Both examples were taken with the current fontset
-etl-*-medium-r-normal-*-16-*-*-*-*-*-fontset-16. However, the new
Emacs uses a completely different font to render the same letter.

Is there a workaround to fix this behaviour before the developers find
the proper solution?

-- 
With best regards,
Dmitri Minaev

Russian history blog: http://minaev.blogspot.com

reply via email to

[Prev in Thread] Current Thread [Next in Thread]