bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] downloading all files on page (with identical filenames)


From: Micah Cowan
Subject: Re: [Bug-wget] downloading all files on page (with identical filenames)
Date: Sun, 25 Jul 2010 02:10:12 -0700
User-agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1.9) Gecko/20100423 Thunderbird/3.0.4

On 07/24/2010 11:15 AM, Vinh Nguyen wrote:
> Dear list,
> 
> I'm using wget 1.12 on ubuntu 10.04.  I don't know if this is a bug or
> not.  I'm using
> 
> wget -U firefox -r -l1 -nd -e robots=off -A.pdf http://example.com
> 
> to download pdf's off a page.  The dilemma is that a lot of the pdf
> links on the page has the same name (example.pdf).  Wget is supposed
> to append .1, .2, etc, to those files.  However, with the above
> command, only .1 is appended, and hence, only one file with .1 is
> seen.  If I set "-A.pdf,.pdf.1", then .1 and .2 gets appended, but .2
> gets repeated and only one .2 file is available at the end.
> 
> Are some of my arguments conflicting?

Looks like that blasted delete-after logic again: it's because after the
rename, the files no longer match -A.pdf, so they get deleted (not sure
how you still have a .pdf.1 at all at the end, unless you're
interrupting wget before it gets a chance to delete it). As a
workaround, you should be able to use something like -A '*.pdf,*.pdf.*'

-- 
Micah J. Cowan
http://micah.cowan.name/



reply via email to

[Prev in Thread] Current Thread [Next in Thread]