bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] Combining --output-document with --recursive


From: Giuseppe Scrivano
Subject: Re: [Bug-wget] Combining --output-document with --recursive
Date: Sat, 26 May 2012 02:26:20 +0200
User-agent: Gnus/5.13 (Gnus v5.13) Emacs/23.4 (gnu/linux)

Gijs van Tulder <address@hidden> writes:

> Hi,
>
> There's a problem if you combine --output-document with --recursive or
> --page-requisites. --output-document breaks the recursion.
>
> First you get a warning:
>
>   WARNING: combining -O with -r or -p will mean that all downloaded
>   content will be placed in the single file you specified.
>
> That is what you'd expect, no problem there.
>
> However, there is a problem with the recursion. Because Wget *appends*
> all downloaded content in the same file, the HTML and CSS parsers get
> confused. The same content is parsed over and over again, each time
> with a different URL context.

a solution would be probably to keep the file opened and continue
reading from the last position.  It seems a bit tricky though, maybe the
wise solution is to don't allow at all -r and -O together.

Giuseppe



reply via email to

[Prev in Thread] Current Thread [Next in Thread]