bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] Enqueue logic problems


From: Micah Cowan
Subject: Re: [Bug-wget] Enqueue logic problems
Date: Thu, 2 May 2013 08:49:56 -0700

> On Thu, May 2, 2013 at 9:00 PM, Tim Ruehsen <address@hidden> wrote:
>>
>> Darshit, I guess you are talking about redirection.
>>
>> That is 'wget -r gnu.org' is being redirected to www.gnu.org (via Location
>> header). Wget now follows the redirection, but only downloads index.html
>> since
>> all included URLs in index.html refer to www.gnu.org. But we requested
>> stuff
>> from gnu.org.

Ah, yeah that's a decent point. I like it, but then, we run into
name-trusting problems along the lines of why --trust-server-names was
introduced, if we just happily translate a hostname to its redirection
(and in particular, begin accepting more pages with that unasked-for
hostname). Obviously, using --trust-server-names solves the issue, but
I'm not sure requiring that is any better, from a user experience,
than adding -D -H.

On Thu, May 2, 2013 at 8:41 AM, Darshit Shah <address@hidden> wrote:
> However, my question was much more specific, if the server redirects to a
> domain which matches www.<old domain name> then shouldn't wget just accept
> it and refresh the parent domain name that it holds?

I'd be leery of treating any hostname, even "www", as special, but am
not 100% opposed to it.

-mjc



reply via email to

[Prev in Thread] Current Thread [Next in Thread]