bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] Problem mirroring site with two domain names


From: Chris Dorsey
Subject: [Bug-wget] Problem mirroring site with two domain names
Date: Thu, 19 May 2011 10:30:50 +1200

I am trying to mirror a web site that has two domain names, let's call them 
www.abc.com and www.abcdef.com. Both URLs get to the same site. If I browse the 
site in IE I can see some hyperlinks point to http://www.abc.com/... and some 
point to http://www.abcdef.com.

I am using this command line:

wget.exe -r -l inf -w 10 --random-wait -E -k -K -N -H -D abcdef.com,abc.com -o 
wgetlog.txt http://abc.com/

What I get is two directories named www.abc.com/ and www.abcdef.com/ with 
almost identical contents. The content has effectively been downloaded twice.

What I want to do is make a single mirror copy of www.abc.com, with all the 
references to www.abcdef.com treated as references to www.abc.com when the 
links are converted in the local copy (-k).

Any ideas?


Chris Dorsey
WARNING
=======
The content of this message is intended only for the use of the person it is 
addressed to and is confidential and may also be legally privileged.  
If this message is not addressed to you, you must not read, use, distribute or 
copy this document.  If you have received this message in error please advise 
Solid Energy by return email at address@hidden and destroy the original 
message.  

Please consider the environment before printing this email 


Thankyou.

=======



reply via email to

[Prev in Thread] Current Thread [Next in Thread]