[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] --spider output interpretation

From: Ángel González
Subject: Re: [Bug-wget] --spider output interpretation
Date: Sat, 22 Jun 2013 00:32:14 +0200
User-agent: Thunderbird

On 21/06/13 17:47, andreas wpv wrote:
Can  you help me understand what this means? Just pulling the urls to build
a sitemap with it, but there is so much more in the output - and I cannot
find any documentation of the details in the output:

wget --spider --recursive --no-verbose --no-parent -t 3
--output-file=wgetlog.txt $URL

2013-06-17 18:56:54 URL:http:/domain/file/19[149408/149408] ->
"domain/file/19.1" [1]

Bunch of questions:
1 what do numbers in brackets mean?
Page length. Downloaded 149408 out of 149408 bytes (if the server
hadn't provided a Content-Length, you would only have one number there).

2. why is there a second url?
domain/file/19.1 would be the file where it would store the web page
(actually, --spider is deleting the files after downloading, so you will
have a "domain" folder sitting there).

3. why is the second url modified with a '.1'  at the end?
There was a file called "domain/file/19", so it was downloaded to "domain/file/19.1"

4. what does the '[1]' at the end mean? Error code is kind of unlikely,
That's the try count. It was downloaded at the first time.


reply via email to

[Prev in Thread] Current Thread [Next in Thread]