lynx-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: lynx-dev Re: [dev.15] fixup patch: edit TEXTAREA


From: Leonid Pauzner
Subject: Re: lynx-dev Re: [dev.15] fixup patch: edit TEXTAREA
Date: Tue, 2 Feb 1999 10:09:57 +0300 (MSK)

1-Feb-99 16:41 Jacob Poon wrote:
> On Mon, 1 Feb 1999, Leonid Pauzner wrote:

>> 1-Feb-99 12:08 Jacob Poon wrote:
>> > Currently, that will mean Windows-based Unicode web pages will be in
>> > trouble, because Win32 applications use UCS-2 encoding by default.
>>
>> There are two different subjects: web page in Unicode
>> (not UCS-2 but actually UTF-8 encoded)
>> and lynx' internal representation of the loaded page
>> (in "display character set" actually).
>> Now you call an external editor: it is assumed that the editor
>> works properly with plain text in "display character set",
>> it is a reasonable assumption that your editor compatible
>> with your current font (try another editor if it is not true).
>>
>> What kind of trouble do you mean
>> and what "Windows-based Unicode web pages" is for you?

> The problem is, since most Windows programs store messages with UCS-2
> encodings, if I try to capture the texts and publish them into web pages
> (eg: quoting from error logs), I may be unknowingly copying UCS-2 encoded
> messages instead of UTF-7/8 data, simply because the effect is invisible.
> When I save those pages from the editors and publish them, Lynx will be
> unable to see them.

> In other words, because UCS-2 is handled transparently under Win32, users
> may forget to convert such documents into UTF-7/8 encoding, making such
> pages unviewable under Lynx.

Well, Lynx for Win32 is a console application, not a GUI,
it uses font from DOS 8bit codepages so it is expected the editor
is compatible with 8bit DOS text. Yes, you should care of.

> I have not found a web page using UCS-2 encoding (yet), but I found a text
> file encoded with UCS-2 encoding that is not compatible with Lynx
> (2.7.2)'s UNICODE UTF-8 Display Character Set.  Under Lynx, it pads each
> English letter with the annoying 'U00' word.  It is located at:
> http://www.geocities.com/TimesSquare/Battlefield/3605/srwffaq.txt

> However, the text is viewed correctly under Netscape 4.x, except the
> annoying double line feed.




reply via email to

[Prev in Thread] Current Thread [Next in Thread]