bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] How remove all after ? or @ (REAL PROBLEM)


From: Dale R. Worley
Subject: Re: [Bug-wget] How remove all after ? or @ (REAL PROBLEM)
Date: Tue, 23 Aug 2016 11:07:19 -0400

address@hidden (Dale R. Worley) writes:
> Павел Серегов <address@hidden> writes:
>> file on server http://site.com/style.css?v1000
>> downloaded file address@hidden
>>
>> How remove @v1000
>> I want result: style.css (without @v1000)
>
> The easiest way is to specify the output file name you want with
> "--output-file=style.css".

Correction, that should be "--output-document=style.css" (for a single file).

Павел Серегов <address@hidden> writes:
> The file is not one. I'm downloading the whole site.
> wget -m -E -o wget_log.txt http://www.store-discount.ru/

The problem is that the URL contains a query part ("?v1000"), and wget
needs to record that in the file name.  In general, it is possible that
wget will download both http://site.com/style.css?v1000 and
http://site.com/style.css?v2000, and it needs separate file names for
both.

wget does not provide a facility to adjust the names of the downloaded
files, except in a few particular ways.  You will probably have to first
download all the files and then run a program to rename the downloaded
files to the names you want.

> =============================
> Next problem "duplicate". if use "-E"
> 
> without:
> address@hidden
> 
> with "-E"
> address@hidden
> =============================

That is the behavior that is prescribed for -E.  Similarly, you will
probably have to run a separate program to rename the files.

Dale



reply via email to

[Prev in Thread] Current Thread [Next in Thread]