lynx-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: LYNX-DEV cp1252 (shudder)


From: Foteos Macrides
Subject: Re: LYNX-DEV cp1252 (shudder)
Date: Fri, 21 Nov 1997 11:00:23 -0500 (EST)

Foteos Macrides <address@hidden> wrote:
>"Alan J. Flavell" <address@hidden> wrote:
>>[...]
>>Of course.  Nevertheless, MS software creates these illegal and
>>meaningless representations, and it would be obtuse to claim that we don't
>>know what they intend by it.  I wasn't asking for an explanation of what
>>they do or don't mean in HTML, but making a suggestion for a practical
>>way of dealing with them, either as 8-bit characters - which would be
>>perfectly legal if charset=cp1252, in fact; or as NCRefs - which is, we
>>agree, invalid HTML, but we still may discuss how to deal with it, may
>>we? 
>[...]
>just go ahead and do it in the lynx271f code set.  I wasn't
>sure whether to convert &#1; to the white or black smiling face Unicode
>character, so I made it the white, though with most Display Character
>Sets it will end up as ASCII art.  :-)

        It occurred to me that the list of conversions may be of general
interest, for people who might wish to make them directly in documents
generated by FrontPage.  So here they are:


        Conversions of invalid numeric (MicroSoft codepage)
        character references to valid Unicode numeric or named
        character reference (names as in HTML 4.0 PR).

INVALID     Numeric   Named             Character
-------     -------- -------   -----------------------------------------
&#1;    ->  &#x263a; (none)    WHITE SMILING FACE
&#130;  ->  &#x201a; &sbquo;   SINGLE LOW-9 QUOTATION MARK
&#132;  ->  &#x201e; &bdquo;   DOUBLE LOW-9 QUOTATION MARK
&#133;  ->  &#x2026; &hellip;  HORIZONTAL ELLIPSIS
&#134;  ->  &#x2020; &dagger;  DAGGER
&#135;  ->  &#x2021; &Dagger;  DOUBLE DAGGER
&#137;  ->  &#x2030; &permil;  PER MILLE SIGN
&#139;  ->  &#x2039; &lsaquo;  SINGLE LEFT-POINTING ANGLE QUOTATION MARK
&#144;  ->  &#x2018; &lsquo;   LEFT SINGLE QUOTATION MARK
&#145;  ->  &#x2019; &rsquo;   RIGHT SINGLE QUOTATION MARK
&#146;  ->  &#x201c; &ldquo;   LEFT DOUBLE QUOTATION MARK
&#147;  ->  &#x201d; &rdquo;   RIGHT DOUBLE QUOTATION MARK
&#148;  ->  &#x2022; &bull;    BULLET
&#149;  ->  &#x2013; &ndash;   EN DASH
&#150;  ->  &#x2014; &mdash;   EM DASH
&#151;  ->  &#x02dc; &tilde;   SMALL TILDE
&#155;  ->  &#x203a; &rsaquo;  SINGLE RIGHT-POINTING ANGLE QUOTATION MARK


                                Fote

=========================================================================
 Foteos Macrides            Worcester Foundation for Biomedical Research
 address@hidden         222 Maple Avenue, Shrewsbury, MA 01545
=========================================================================
;
; To UNSUBSCRIBE:  Send a mail message to address@hidden
;                  with "unsubscribe lynx-dev" (without the
;                  quotation marks) on a line by itself.
;

reply via email to

[Prev in Thread] Current Thread [Next in Thread]