[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Bug-wget] Problem mirroring site with two domain names
From: |
Chris Dorsey |
Subject: |
[Bug-wget] Problem mirroring site with two domain names |
Date: |
Thu, 19 May 2011 10:30:50 +1200 |
I am trying to mirror a web site that has two domain names, let's call them
www.abc.com and www.abcdef.com. Both URLs get to the same site. If I browse the
site in IE I can see some hyperlinks point to http://www.abc.com/... and some
point to http://www.abcdef.com.
I am using this command line:
wget.exe -r -l inf -w 10 --random-wait -E -k -K -N -H -D abcdef.com,abc.com -o
wgetlog.txt http://abc.com/
What I get is two directories named www.abc.com/ and www.abcdef.com/ with
almost identical contents. The content has effectively been downloaded twice.
What I want to do is make a single mirror copy of www.abc.com, with all the
references to www.abcdef.com treated as references to www.abc.com when the
links are converted in the local copy (-k).
Any ideas?
Chris Dorsey
WARNING
=======
The content of this message is intended only for the use of the person it is
addressed to and is confidential and may also be legally privileged.
If this message is not addressed to you, you must not read, use, distribute or
copy this document. If you have received this message in error please advise
Solid Energy by return email at address@hidden and destroy the original
message.
Please consider the environment before printing this email
Thankyou.
=======
- [Bug-wget] Problem mirroring site with two domain names,
Chris Dorsey <=