bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] Infinite loop, and bad 'adjust extension' on pdf


From: Lluís Batlle i Rossell
Subject: [Bug-wget] Infinite loop, and bad 'adjust extension' on pdf
Date: Sat, 24 Nov 2012 11:33:05 +0100
User-agent: Mutt/1.5.21 (2010-09-15)

Hello,

I was downloading recursively. Specifically:
wget 
--domains="data.inh.cat,data.jordibilbeny.com,www.inh.cat,www.jordibilbeny.com" 
\
    -H --adjust-extension -k -r -c -l 3 http://www.jordibilbeny.com/

And:

1) It went in an infinite loop while downloading http://www.inh.cat/robots.txt ,
it returning HTTP 416, and retrying again and again. I had to remove '-c' to
make wget go.

2) All links to '.pdf' files had its target changed to '.pdf.html' (that is, -k
and --adjust-extension I guess). But the pdf files downloaded didn't have the
".html" name addition. So the local links failed.

I used a "sed -i" in the files of my interest, to rewrite the anchor targets.

I'm running 1.13.4.

Thank you,
Lluís.



reply via email to

[Prev in Thread] Current Thread [Next in Thread]