bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] downloading all files on page (with identical filenames)


From: Vinh Nguyen
Subject: Re: [Bug-wget] downloading all files on page (with identical filenames)
Date: Sun, 25 Jul 2010 11:59:08 -0700

On Sun, Jul 25, 2010 at 2:10 AM, Micah Cowan <address@hidden> wrote:
> On 07/24/2010 11:15 AM, Vinh Nguyen wrote:
>> Dear list,
>>
>> I'm using wget 1.12 on ubuntu 10.04.  I don't know if this is a bug or
>> not.  I'm using
>>
>> wget -U firefox -r -l1 -nd -e robots=off -A.pdf http://example.com
>>
>> to download pdf's off a page.  The dilemma is that a lot of the pdf
>> links on the page has the same name (example.pdf).  Wget is supposed
>> to append .1, .2, etc, to those files.  However, with the above
>> command, only .1 is appended, and hence, only one file with .1 is
>> seen.  If I set "-A.pdf,.pdf.1", then .1 and .2 gets appended, but .2
>> gets repeated and only one .2 file is available at the end.
>>
>> Are some of my arguments conflicting?
>
> Looks like that blasted delete-after logic again: it's because after the
> rename, the files no longer match -A.pdf, so they get deleted (not sure
> how you still have a .pdf.1 at all at the end, unless you're
> interrupting wget before it gets a chance to delete it). As a
> workaround, you should be able to use something like -A '*.pdf,*.pdf.*'
>

Thanks Micah, this works.

> --
> Micah J. Cowan
> http://micah.cowan.name/
>



reply via email to

[Prev in Thread] Current Thread [Next in Thread]