lynx-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

LYNX-DEV Re: new Lynx SGML.c parser


From: Klaus Weide
Subject: LYNX-DEV Re: new Lynx SGML.c parser
Date: Wed, 23 Apr 1997 14:43:57 -0500 (CDT)

On Wed, 23 Apr 1997, Christopher R. Maden wrote:

> [Klaus Weide reports on a more SGML-ish HTML parser.]
> 
> Klaus, and anyone else:
> 
> One of the reasons I'm so excited about XML is that it's a chance for
> the Web community to make a fresh start and recover from all the crap
> out there that calls itself HTML.  Lynx (and other browsers) can
> continue to do what they do now for HTML, but do things the Right Way
> for XML.

Well I was thinking of you when I started this "new parser"[*] project.
I remember you made the claim that a structured parser with error recovery
heuristics could improve handling of invalid markup (or similar wording;
I hope I didn't get your meaning too wrong).  So there is now some way
to test that claim...  This of course is not doing real SGML parsing,
just trying to resemble it a bit better.  (Not that I really understand
all the things a real SGML parser is supposed to do...)

[*] It is also not really a "new" parser, just the old one, with some
exceptions taken out, and some (crude) heuristics and some more
per-element information added in.  All changes only refer to the
content models and nesting aspects.  The added "DTD" information is
hardwired and looks like this:

#define T_ABBREV        0x0002,0x8B04F,0x8FFFF,0xA778F,0xF7FBF,0x00003,0x00000

which is rather unreadable but fits the info on one line per element :)
and there's still some unused bits left in that...

> I would be *very* interested in beginning work on a new internal MIME
> type for XML handling in Lynx.  I need some assistance, though, with
> the HTTP side of things.  Would anyone who's interested in beginning
> this project please drop me a line off-list?

Why not keep it on the list?  At least I would be interested to hear
what you have in mind.

  Klaus


;
; To UNSUBSCRIBE:  Send a mail message to address@hidden
;                  with "unsubscribe lynx-dev" (without the
;                  quotation marks) on a line by itself.
;

reply via email to

[Prev in Thread] Current Thread [Next in Thread]