bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] How do I tell wget not to follow links in a file?


From: David Skalinder
Subject: [Bug-wget] How do I tell wget not to follow links in a file?
Date: Sat, 2 Apr 2011 08:23:57 +0100
User-agent: SquirrelMail/1.4.21

Hello,

I'm trying to use wget to do something that seems very simple, but I
haven't been able to find a solution anywhere and I'm hoping someone here
could point me in the right direction.

I want to mirror part of a website that contains two links pages, each of
which contains links to many root-level directories and also to the other
links page.  I want to download recursively all the links from one links
page, but not from the other: that is, I want to tell wget "download
links1 and follow all of its links, but do not download or follow links
from links2".

I've put a demo of this problem up at http://fangjaw.com/wgettest -- there
is a diagram there that might state the problem more clearly.

This functionality seems so basic that I assume I must be overlooking
something.  Clearly wget has been designed to give users control over
which files they download; but all I can find is that -X controls both
saving and link-following at the directory level, while -R controls saving
at the file level but still follows links from unsaved files.

Is there an obvious solution I'm missing?  Or a manual section I don't
have or something?

Thanks in advance,

Fang

(PS: wget I'm  using is 1.12.)




reply via email to

[Prev in Thread] Current Thread [Next in Thread]