bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] "Transparent proxy URL" ariation on "-E -k" options ?


From: Ángel González
Subject: Re: [Bug-wget] "Transparent proxy URL" ariation on "-E -k" options ?
Date: Sun, 19 Oct 2014 21:08:16 +0200
User-agent: Thunderbird

Gabriel Somlo wrote:
If I try to add --convert-links into the mix, the referencing link
does get rewritten, but ends up looking like

"../site.com/article.cgi?25.html"

which is designed for offline viewing via "file://", and is unsuitable
for actually hosting both the referencing and referenced sites as
virtual servers in a web server within the sandbox.
Are you using --span-hosts ? Otherwise wget won't be crawling pages to
a different host and thus won't produce a relative url down to the hostname.

I think that not using --span-hosts will suit your use case.



If not, assuming I can come up with a patch, would there be any
interest in upstreaming this type of additional functionality ?
However, you may provide a patch for making links relative to the hostname in
such case (you would need to add a parameter for --convert-links to enable
that alternative conversion).




Bug notice: just listing the domains to span on --domains doesn't seem to work
Deciding whether to enqueue "http://www.example.org/script.js";.
This is not the same hostname as the parent's (www.example.org and www.example.com).
Decided NOT to load it.





reply via email to

[Prev in Thread] Current Thread [Next in Thread]