On Fri, 5 Aug 2005 address@hidden wrote:
I have a need that's not satisfied with split, but which split could do.
At present, I'm doing it in Perl (and that's probably adequate for what
I need to do).
Contemplate a document such as one returned by this command:
wget -O /tmp/houses.html \
"http://www.realestate.com.au/cgi-bin/rsearch?a=qfp&cat=House&p=200&s=wa&o=p&t=res&id=6285}"
That will produce a list of homes available for purchase in a region
become famous for Fine Wines.
The document has a heap of junk top and bottom, and some identifiable
material between the descriptions of individual homes.
What I suggest is that split be enhanced to provide a means of splitting
based on content.
Have you looked at csplit?