help-gnu-emacs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: umlauts (8bit characters) input


From: Peter Dyballa
Subject: Re: umlauts (8bit characters) input
Date: Wed, 2 Feb 2005 20:37:54 +0100


Am 02.02.2005 um 16:19 schrieb Stefan Monnier:


real umlauts! But in shell they're gone ... a bunch of control codes? (I am using tcsh and see a\314?o\314?u\314?\303?U\314?O\314?A\314?\342?\254
instead of äöü?ÜÖÄÛ, shell has only 'uu-')

Could you try posting in a more widely available charset than mac-roman? The last char of "uu-" is meaningless in a shell buffer (it's the encoding
of the file associated with the buffer, and since shell buffers have no
associated file, ...).

Sorry, this is no mac-roman, it's exactly what I see in xterm or in Terminal in the shell buffer -- although I should see something different: äöüßÜÖÄ€. That's the file's name. (Mail.app claims that it is using a 7bit encoding.)


The X11 client is even worse: copy and paste in this same Emacs does not work (a simple ä is converted to a whole book volume of glyphs that are
hard to describe),

Huh?  You mean you can't copy&paste from Emacs to itself correctly?
Or do you mean you can't correctly copy from Emacs to something else?
Or you can't correctly paste from something else to Emacs?

My .emacs file is only 8bit, so I chose ISO Latin-15. In its header it is explicitly written

;;; -*- mode: Emacs-Lisp; coding: iso-8859-15; -*-

When I copy a string with characters from this set via double-clicking the selection and paste it either in the same or in the (Unicode) scratch buffer the simple accented chars become each a series of this character and mostly \216 -- me and others have seen similiar things in GNOME under Linux when copying from outside GNU Emacs. Pressing umlauts etc. on my keyboard enter exactly these umlauts into the buffers. C-x = explains in hex that they're taken from the usual code positions (for example Ä is 0x8c4, ä is 0x8e4).

I can send you a snapshot privately ...


and displays 'a<box>o<box>u...Û', and shell uses no mode '--:**...' as in
Terminal and shows the name as in Terminal.

The <box> just means that it couldn't find a font to display the char.
Go to that char and hit C-u C-x = to see which char it is. Maybe you just
need to help Emacs find the right font.

There is no char. So there should not be a box. The correct string is äöüß... and as it's a file name, it's UTF-8 encoded. (Since the displayed glyphs are the chars stripped off their diaeresis the box could be ... but it's : "(01211310, 332488, 0x512c8, file ...). A bit astronomical high. æ and Æ and € are represented by themselves.

But of course, first we need to know which Emacs version you're running. If it's Emacs-CVS, please move this discussion to emacs-devel@gnu.org or
emacs-pretest-bug@gnu.org.

Since the Fink team decided to distribute in the unstable section an Emacs from CVS ... (but there was not much difference to 21.3.50, it might be exactly 0 in this case)

--
Greetings

  Pete




reply via email to

[Prev in Thread] Current Thread [Next in Thread]