[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: LYNX-DEV Charset names in ac105
From: |
Leonid Pauzner |
Subject: |
Re: LYNX-DEV Charset names in ac105 |
Date: |
Sat, 31 Jan 1998 21:05:46 +0300 (MSK) |
> > that's about "Other ISO Latin" aka x-iso-8859-other ?
> >
> > Seems it should gone since we have all 1-9 ISO 8859 entity sets
> > (and even #10, unknown for ftp.unicode.org),
> > more tables may be included later (if required and registered).
>
> I guess (a wild guess) this was meant as a rather "generic" table that
> contains characters common to ASCII and all ISO-8859-x charsets, no?
>
> Hynek
No, it works *exactly* as "7 bit approximation".
[the only difference - it calls "nbsp" and "soft-hyphen" directly,
but "7 bit approximation" do translate it properly anyway.]
Anno Domini, it come from "before unicode" scheme.
First ans last note - between 2-5 and 2-6:
> 06-14-96
> * Updated the help files, lynx.man and lynx.hlp concerning the character
> set handling. - FM
> * Changed the -jpn switch to a generic -raw switch, and the LYK_JPN_TOGGLE
> ('@') to a generic LYK_RAW_TOGGLE. The toggling also can be done via
> the 'o'ptions menu, in conjunction with selecting character sets. See
> the comments about CHARACTER_SET in userdefs.h and lynx.cfg for more
> information. The toggle determines whether raw 8-bit characters in the
> document are assumed to be ISO-8859-1, and therefore translated for the
> non-"ISO Latin 1" character sets based on their LYCharSets.c arrays, or
> are assumed to match the character set (e.g, 8-bit ISO-8859-2 when the
> "ISO Latin 2" set is selected, or Kanji when a CJK set is selected) and
> therefore processed raw. Use raw or CJK mode when you know the charset
> of the document is a match to your selected character set, but no header
> or suffix has indicated that the charset is not ISO-8859-1. Be careful
> about this when "Other ISO Latin" is selected, since the document could
> be, e.g., ISO-8859-3, but you are using, e.g., ISO-8859-5. - FM
> * Added "Other ISO Latin", "Chinese", "Japanese (EUC)", "Japanese (SJIS)",
> "Korean", and "Taipei (Big5)", character sets, which presently translate
> 8-bit ISO-8859-1 named and numeric entities to 7-bit approximations, and
> added a default "CHARACTER_SET" definition in userdefs.h, complementary
> to that in lynx.cfg. - FM
>
- Re: LYNX-DEV Charset names in ac105, (continued)
- Re: LYNX-DEV Charset names in ac105, Foteos Macrides, 1998/01/12
- Re: LYNX-DEV Charset names in ac105, Foteos Macrides, 1998/01/13
- Re: LYNX-DEV Charset names in ac105, Leonid Pauzner, 1998/01/14
- Re: LYNX-DEV Charset names in ac105, Foteos Macrides, 1998/01/20
- Re: LYNX-DEV Charset names in ac105, Leonid Pauzner, 1998/01/25
- Re: LYNX-DEV Charset names in ac105, Leonid Pauzner, 1998/01/29
- Re: LYNX-DEV Charset names in ac105,
Leonid Pauzner <=