lynx-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Lynx-dev] Dumps Unicode file in broken encoding.


From: Thorsten Glaser
Subject: Re: [Lynx-dev] Dumps Unicode file in broken encoding.
Date: Mon, 29 Sep 2008 11:15:02 +0000 (UTC)

Atsuhito Kohda dixit:

>> aqwa『~』$ lynx.cur --dump test.html 
>>      * é
>>      * 

This is very interesting:

address@hidden:~ $ lynx -dump test.htm
     * é
     * �

address@hidden:~ $ lynx -version
Lynx Version 2.8.7dev.9 (27 Apr 2008)
libwww-FM 2.14, SSL-MM 1.4.1, ncurses 5.6.20080830(wide)
Built on linux-gnu Sep  2 2008 02:14:03

────

address@hidden:~ $ lynx -dump test.htm                                          
                                       
     * é
     * à

address@hidden:~ $ lynx -version                                                
                                       
Lynx Version 2.8.7dev.8-MirOS (17 Feb 2008)
libwww-FM 2.14, SSL-MM 1.4.1, OpenSSL 0.9.7m, ncurses 5.5.20051010(wide)
No compilation information available.

────

But this is also the key to the solution, because MirBSD always
has the UTF-8 locale (also for assumed charset and display charset),
so, back on Debian, I get this:

address@hidden:~ $ lynx -assume_charset=utf-8 -display_charset=utf-8 -dump 
test.htm
     * é
     * à

This, in the end, means that the stock lynx-cur configuration is broken,
and this is not an upstream problem.

bye,
//mirabilos
-- 
Sometimes they [people] care too much: pretty printers [and syntax highligh-
ting, d.A.] mechanically produce pretty output that accentuates irrelevant
detail in the program, which is as sensible as putting all the prepositions
in English text in bold font.   -- Rob Pike in "Notes on Programming in C"




reply via email to

[Prev in Thread] Current Thread [Next in Thread]