[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Bug-wget] [PATCH] Patch to change behavior with redirects under --r
From: |
Dale R. Worley |
Subject: |
Re: [Bug-wget] [PATCH] Patch to change behavior with redirects under --recurse. |
Date: |
Fri, 07 Oct 2016 15:40:55 -0400 |
Tim Ruehsen <address@hidden> writes:
> the changes in recur.c are not acceptable. They circumvent too many checks
> like host-spanning, excludes and even --https-only.
I suppose it depends on what you consider the semantics to be.
Generally, I look at it if I've specified to download http://x/y/z and
http://x/y/z redirects to http://a/b/c, if http://x/y/z passes the tests
I've specified, then the page should be downloaded; the fact that it's
redirected to http://a/b/c is incidental. Most checks *should* be
circumvented.
I guess I'd make exceptions for --https-only, which is presumably
placing a requirement on *how* the pages should be fetched, and probably
the robots check, as that's a policy statement by the server.
Dale