[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Bug-wget] Unable to mirror site with %code links wit wget 1.11.4
From: |
fernando cassia |
Subject: |
[Bug-wget] Unable to mirror site with %code links wit wget 1.11.4 |
Date: |
Thu, 05 May 2011 17:08:24 -0300 |
User-agent: |
Mozilla/5.0 (X11; Linux i686; rv:2.0.1) Gecko/20100101 Firefox/4.0.1 |
I´m trying to mirror a site using wget v1.11.4.
The parameters I´m passing it are simple:
wget -m -np -k -c (url)
I have tried adding --user-agent="Firefox/linux real user agent here"
along with --referer="[parent dir here]" but it doesn´t make any
difference. It outputs "Error 404 not found", even while I can see the
site just fine (and download files) using a web browser....
I´ve noticed that whatever web server it´s using, it outputs links
without paths, ie
<a
href="%D0%A0%D0%B0%D0%B1%D0%BE%D1%82%D0%B0%20%D1%81%20USB%20Flash%20%D0%BA%D0%B0%D0%BA%20%D1%81%20%D0%B6%D1%91%D1%81%D1%82%D0%BA%D0%B8%D0%BC%20%D0%B4%D0%B8%D1%81%D0%BA%D0%BE%D0%BC.rtf
<view-source:http://exit.ktnet.kg/Distr2/_Release/Flash%20controller/%D0%A0%D0%B0%D0%B1%D0%BE%D1%82%D0%B0%20%D1%81%20USB%20Flash%20%D0%BA%D0%B0%D0%BA%20%D1%81%20%D0%B6%D1%91%D1%81%D1%82%D0%BA%D0%B8%D0%BC%20%D0%B4%D0%B8%D1%81%D0%BA%D0%BE%D0%BC.rtf>">
FWIW, the full url is
wget -m -np -k -c --user-agent="Mozilla/5.0 (X11; Linux i686; rv:2.0.1)
Gecko/20100101 Firefox/4.0.1" --no-http-keep-alive
--referer="http://exit.ktnet.kg/Distr2/_Release/"
--keep-session-cookies
"http://exit.ktnet.kg/Distr2/_Release/Flash%20controller/"
Thoughts?
FC
- [Bug-wget] Unable to mirror site with %code links wit wget 1.11.4,
fernando cassia <=