[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] Any limits on -X? Last item being ignored

From: Henry C.
Subject: Re: [Bug-wget] Any limits on -X? Last item being ignored
Date: Wed, 15 Dec 2010 12:42:38 +0200
User-agent: SquirrelMail/1.5.2 [SVN]

On Mon, December 13, 2010 19:25, Micah Cowan wrote:
> Virtually none of the ones you listed ought to work, as you seem to be
> expecting -X to match against the entire URL, which it doesn't. It matches
> against the "directory" paths, which wouldn't include
> "foo.php?name=value&bar=baz" type components.
> -X rejects for directories; -R rejects for filenames; but at this time,
> there isn't anything that matches against anything that comes after the
> question mark ("query strings").

Typo - sorry, I meant -R, not -X.

Allow me to rephrase my question:  it looks like -R *first* downloads the page
(even if it matches the regex), then removes it since it's a match:

Removing www..../index.php?option=com_jevents&... since it should be rejected.

I don't suppose there's a way to get wget to *not* download the page at all if
it matches one of the items in -R?  -R and -X seem like a great way to save
bandwidth and not download pages you don't want; instead they are downloaded
anyway, then deleted.


reply via email to

[Prev in Thread] Current Thread [Next in Thread]