Re: [Bug-wget] wget mirror site failing due to file / directory name cla

bug-wget

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] wget mirror site failing due to file / directory name cla

From:	Micah Cowan
Subject:	Re: [Bug-wget] wget mirror site failing due to file / directory name clashes
Date:	Fri, 12 Oct 2012 13:50:03 -0700
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:15.0) Gecko/20120827 Thunderbird/15.0

On 10/12/2012 06:38 AM, Paul Beckett (ITCS) wrote:
> I am attempting to use wget to create a mirrored copy of a CMS (Liferay) 
> website. I want to be able to failover to this static copy in case the 
> application server goes offline. I therefore need the URL's to remain 
> absolutely identical. The problem I have is that I cannot figure out how I 
> can configure wget in a way that will cope with:
> http://www.example.com/about
> http://www.example.com/about/something
> 
> In this case either the file or directory 'about' already exists at prevents 
> the second being created.

Further discussion/info about this problem:

http://savannah.gnu.org/bugs/?func=detailitem&item_id=23756
http://savannah.gnu.org/bugs/?func=detailitem&item_id=29647

> 
> Initially I though the most obvious solution, was to rely on Apache's 
> DirectoryIndex, and save the files as:
> /about/index.html
> /about/something/index.html
> 
> But, currently I can't figure out how I can do this in a way that doesn't 
> break either the relative path to other pages or create links to the 
> index.html rather than the original location. I need the links (a href etc.) 
> to still go to /about and not explicitly call /index.html - as this will mean 
> people may bookmark things that won't exist when the CMS came back.

Why not use links like /about/, rather than /about? Then it should
hopefully work for both cases.

-mjc

[Prev in Thread]

Current Thread

[Next in Thread]

[Bug-wget] wget mirror site failing due to file / directory name clashes, Paul Beckett (ITCS), 2012/10/12
- Re: [Bug-wget] wget mirror site failing due to file / directory name clashes, Micah Cowan <=
- Re: [Bug-wget] wget mirror site failing due to file / directory name clashes, Ángel González, 2012/10/13
  - Re: [Bug-wget] wget mirror site failing due to file / directory name clashes, Paul Beckett (ITCS), 2012/10/15
    - Re: [Bug-wget] wget mirror site failing due to file / directory name clashes, Ángel González, 2012/10/25
- Re: [Bug-wget] wget mirror site failing due to file / directory name clashes, Tim Ruehsen, 2012/10/16
  - Re: [Bug-wget] wget mirror site failing due to file / directory name clashes, Paul Beckett (ITCS), 2012/10/16

Prev by Date: [Bug-wget] wget mirror site failing due to file / directory name clashes
Next by Date: Re: [Bug-wget] [PATCH] Change growable string object to null terminate
Previous by thread: [Bug-wget] wget mirror site failing due to file / directory name clashes
Next by thread: Re: [Bug-wget] wget mirror site failing due to file / directory name clashes
Index(es):
- Date
- Thread