Parsing input from a stream... [Was: Re: Parsing input from a string...]

help-flex

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Parsing input from a stream... [Was: Re: Parsing input from a string...]

From:	Dave Trombley
Subject:	Parsing input from a stream... [Was: Re: Parsing input from a string...]
Date:	Thu, 31 Jan 2002 14:47:30 -0500
User-agent:	Mozilla/5.0 (X11; U; Linux i686; en-US; rv:0.9.2) Gecko/20010628

John W. Millaway wrote:

In the next release, you will be able to change the initial buffer size.
Currently, people do this with sed/perl by redefining YY_BUF_SIZE, which is by
default 16k.

I've downloaded the devloper's pre-release from your website, andI've been playing with that.

What do you want to know about the buffering? By default, flex requests as many
bytes "up front" as it can get, or as much as it needs to match something.
Obviously, you can change this behavior by returning a different # of bytes

from YY_INPUT than flex requested.

I suppose what I'm interested in doing is to understand how theyy_*buffer* functions work. Correct me if I'm mistaken, but they seemto assume to a large degree that files will be the underlying datasource for the buffers (although there are functions for specificallycopying strings into a new buffer), and more broadly, that all of thedata will be avaialable by the time the lexer entry point is reached.What I'd really like to be able to do is to have a parser/lexer pairwhich is fully reentrant, and have the lexer drain a stream until eitherthe parse terminates, or the stream is empty. In the latter case, I'dlike the parser/lexer to block on stream input until more is available.It seems I could implement this in 2.5.6, especially given the factthat you can pass extra data along in a reentrant lexer, but I'm havingtrouble because I don't know the exact contract for the bufferfunctions. (For example, should I assume that YY_INPUT will only everbe called from a single place? How can I access the extra data fromthat place? Is that data placed into a flex buffer? Is there a morelow level way of getting input to the lexer, since my MT buffers will bearound anyway?)

Are there any plans/thoughts about making the buffer systemextensible and abstract? Do you think it would be possible/desirablefor me to attempt this, and could it be done without sacrificingperformance?


   Cheers,

   -dj

[Prev in Thread]

Current Thread

[Next in Thread]

Re: Parsing input from a string..., Akim Demaille, 2002/01/31
- Re: Parsing input from a string..., John W. Millaway, 2002/01/31
  - Parsing input from a stream... [Was: Re: Parsing input from a string...], Dave Trombley <=
    - Developer's Pre-Release Website, Hans Aberg, 2002/01/31
    - Re: Developer's Pre-Release Website, John W. Millaway, 2002/01/31
    - Message not available
    - Re: Developer's Pre-Release Website, Hans Aberg, 2002/01/31
    - Re: Parsing input from a stream... [Was: Re: Parsing input from a string...], Nikos Balkanas, 2002/01/31
    - Re: Parsing input from a stream..., John W. Millaway, 2002/01/31

Prev by Date: Re: trailing sequence
Next by Date: Developer's Pre-Release Website
Previous by thread: Re: Parsing input from a string...
Next by thread: Developer's Pre-Release Website
Index(es):
- Date
- Thread