bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] wget confused by URL


From: Yousong Zhou
Subject: Re: [Bug-wget] wget confused by URL
Date: Fri, 21 Feb 2014 16:12:47 +0800 (China Standard Time)
User-agent: Alpine 2.11 (WNT 23 2013-08-11)

Hi

address@hidden' should be CC-ed so the list can see this conversaion.

On Fri, 21 Feb 2014, James Macomber wrote:

> It looks like the commands are processing the URL in the input file as the 
> output target based on the "Saving to:" line.  I am wondering
> if the "-1" values are throwing the command processor off.  I am not sure how 
> or if that is even possible though.
> 

It's expected since you have specified the `-r' option and wget tried to 
construct a hierarchy of directires as the URL indicates.  Without it, 
wget will try to save it with the name `-1'.  You can also disable this 
behaviour with `--no-directories'.

If you are talking about the size of downloaded file being zero, that's 
because the response body of that URL is actually empty.  Your browser 
redirected to another URL as indicated by the `Refresh' header, but wget 
seems to be not aware of this header currently.


                yousong

> 
> On Fri, Feb 21, 2014 at 2:21 AM, Yousong Zhou <address@hidden> wrote:
>       Hi,
> 
>       On Thu, 20 Feb 2014, James Macomber wrote:
> 
>       > Hi,
>       >
>       > May be my n00bness, but I can't seem to get the syntax right for this
>       > command or the command is getting confused by my values.
>       >
>       > I am using the win86_64 version 1.11.4.
>       >
>       > I am calling wget -r -i C:\Users\macombej\Desktop\wgeturl.txt -S -o
>       > C:\Users\macombej\Desktop\wgetresponse.txt
>       >
>       > wgeturl.txt looks like this:
>       >
>       > 
> http://u.eq2wire.com/soe/item_search_link/Arcane/95/-1/-1/-1/-1/-1/-1/Armor/Fury/-1/-1/-1/-1/-1/-1/-1/-1/-1
>       >
>       > I have tried it with username/password in the proper syntax for the 
> above
>       > URL, but this doesn't seem to matter either.
>       >
>       > and wgetresponse.txt shows this:
>       >
>       > --2014-02-20 22:16:23--
>       > 
> http://u.eq2wire.com/soe/item_search_link/Arcane/95/-1/-1/-1/-1/-1/-1/Armor/Fury/-1/-1/-1/-1/-1/-1/-1/-1/-1
>       > Resolving u.eq2wire.com... 67.23.252.182
>       > Connecting to u.eq2wire.com|67.23.252.182|:80... connected.
>       > HTTP request sent, awaiting response...
>       >   HTTP/1.1 200 OK
>       >   Date: Fri, 21 Feb 2014 03:16:45 GMT
>       >   Server: Apache
>       >   X-Powered-By: PHP/5.4.23
>       >   Refresh: 0;url=http://u.eq2wire.com/soe/item_search_results
> 
>       Looks like wget didn't understand this header very well?
> 
> 
>                       yousong
> 
>       >   Set-Cookie:
>       
> >ci_session=a%3A4%3A%7Bs%3A10%3A%22session_id%22%3Bs%3A32%3A%22225c5f724a6c93947f470361ed6c37e8%22%3Bs%3A10%3A%22ip_address%22%3Bs%3A14%3
> A%22108.48.199.124%22%3Bs%3A10%3A%22user_agent%22%3Bs%3A11%3A%22Wget%2F1.11.4%22%3Bs%3A13%3A%22last_activity%22%3Bi%3A1392952605%3B%7D2f
>       db4ad7da33521f95643c3980fe9922;
>       > expires=Sat, 22-Feb-2014 03:16:45 GMT; path=/
>       >   Set-Cookie:
>       
> >ci_session=a%3A4%3A%7Bs%3A10%3A%22session_id%22%3Bs%3A32%3A%22225c5f724a6c93947f470361ed6c37e8%22%3Bs%3A10%3A%22ip_address%22%3Bs%3A14%3
> A%22108.48.199.124%22%3Bs%3A10%3A%22user_agent%22%3Bs%3A11%3A%22Wget%2F1.11.4%22%3Bs%3A13%3A%22last_activity%22%3Bi%3A1392952605%3B%7D2f
>       db4ad7da33521f95643c3980fe9922;
>       > expires=Sat, 22-Feb-2014 03:16:45 GMT; path=/
>       >   Vary: Accept-Encoding
>       >   Content-Length: 0
>       >   Connection: close
>       >   Content-Type: text/html
>       > Length: 0 [text/html]
>       > Saving to: `
>       > 
> u.eq2wire.com/soe/item_search_link/Arcane/95/-1/-1/-1/-1/-1/-1/Armor/Fury/-1/-1/-1/-1/-1/-1/-1/-1/-1
>       > '
>       >
>       >      0K                                                        0.00 
> =0s
>       >
>       > 2014-02-20 22:16:23 (0.00 B/s) - `
>       > 
> u.eq2wire.com/soe/item_search_link/Arcane/95/-1/-1/-1/-1/-1/-1/Armor/Fury/-1/-1/-1/-1/-1/-1/-1/-1/-1'
>       > saved [0/0]
>       >
>       > I have compared this to wireshark captures and these are the first two
>       > cookies that get pulled, but all the rest of the html code values are 
> not
>       > getting pulled.
>       >
>       > Any idea what I am missing or why this may not pull the page values I 
> get
>       > with the same URL in a browser?
>       >
> 
> 
> 
> 



reply via email to

[Prev in Thread] Current Thread [Next in Thread]