bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] Unable to mirror site with %code links wit wget 1.11.4


From: fernando cassia
Subject: [Bug-wget] Unable to mirror site with %code links wit wget 1.11.4
Date: Thu, 05 May 2011 17:08:24 -0300
User-agent: Mozilla/5.0 (X11; Linux i686; rv:2.0.1) Gecko/20100101 Firefox/4.0.1

I´m trying to mirror a site using wget v1.11.4.

The parameters I´m passing it are simple:

wget -m -np -k  -c (url)

I have tried adding --user-agent="Firefox/linux real user agent here" along with --referer="[parent dir here]" but it doesn´t make any difference. It outputs "Error 404 not found", even while I can see the site just fine (and download files) using a web browser....

I´ve noticed that whatever web server it´s using, it outputs links without paths, ie

<a  
href="%D0%A0%D0%B0%D0%B1%D0%BE%D1%82%D0%B0%20%D1%81%20USB%20Flash%20%D0%BA%D0%B0%D0%BA%20%D1%81%20%D0%B6%D1%91%D1%81%D1%82%D0%BA%D0%B8%D0%BC%20%D0%B4%D0%B8%D1%81%D0%BA%D0%BE%D0%BC.rtf
  
<view-source:http://exit.ktnet.kg/Distr2/_Release/Flash%20controller/%D0%A0%D0%B0%D0%B1%D0%BE%D1%82%D0%B0%20%D1%81%20USB%20Flash%20%D0%BA%D0%B0%D0%BA%20%D1%81%20%D0%B6%D1%91%D1%81%D1%82%D0%BA%D0%B8%D0%BC%20%D0%B4%D0%B8%D1%81%D0%BA%D0%BE%D0%BC.rtf>">


FWIW, the full url is

wget -m -np -k -c --user-agent="Mozilla/5.0 (X11; Linux i686; rv:2.0.1) Gecko/20100101 Firefox/4.0.1" --no-http-keep-alive --referer="http://exit.ktnet.kg/Distr2/_Release/"; --keep-session-cookies "http://exit.ktnet.kg/Distr2/_Release/Flash%20controller/";

Thoughts?

FC



reply via email to

[Prev in Thread] Current Thread [Next in Thread]