[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Bug-wget] "Transparent proxy URL" ariation on "-E -k" options ?
From: |
Ángel González |
Subject: |
Re: [Bug-wget] "Transparent proxy URL" ariation on "-E -k" options ? |
Date: |
Sun, 19 Oct 2014 21:08:16 +0200 |
User-agent: |
Thunderbird |
Gabriel Somlo wrote:
If I try to add --convert-links into the mix, the referencing link
does get rewritten, but ends up looking like
"../site.com/article.cgi?25.html"
which is designed for offline viewing via "file://", and is unsuitable
for actually hosting both the referencing and referenced sites as
virtual servers in a web server within the sandbox.
Are you using --span-hosts ? Otherwise wget won't be crawling pages to
a different host and thus won't produce a relative url down to the hostname.
I think that not using --span-hosts will suit your use case.
If not, assuming I can come up with a patch, would there be any
interest in upstreaming this type of additional functionality ?
However, you may provide a patch for making links relative to the
hostname in
such case (you would need to add a parameter for --convert-links to enable
that alternative conversion).
Bug notice: just listing the domains to span on --domains doesn't seem
to work
Deciding whether to enqueue "http://www.example.org/script.js".
This is not the same hostname as the parent's (www.example.org and
www.example.com).
Decided NOT to load it.