bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] wget and srcset tag


From: Tim Rühsen
Subject: Re: [Bug-wget] wget and srcset tag
Date: Mon, 12 Jun 2017 16:35:48 +0200
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:45.0) Gecko/20100101 Thunderbird/45.8.0

On 06/12/2017 10:27 AM, chris wrote:
> Hi Tim,
> 
> Thanks for your reply, I notice the following in the debug logs:
> 
> """
> will convert url
> http://www.anfractuosity.com/wp-content/uploads/2014/02/fsk.png to local
> site_output/fsk.png
> will convert url
> https://www.anfractuosity.com/wp-content/uploads/2014/02/fsk.png to local
> site_output/fsk.png.html
> """
> 
> The difference between those URLs seems to be one is https and one isn't.
> When I wget those URLs though, both seem to return a .png, with 'Length:
> 51068 (50K) [image/png]'.
> 
> So I'm a bit confused why I get the fsk.png.html URL.

What version of wget are you using ? (1.19.1 here)

I tried some combinations of srcset (with https and http) and your
original options. I thought of an issue with redirection (because that's
an answer with text/html Content-Type).

Could you create a small reproducer page ? e.g. like
<html><body>
<img
srcset="https://www.anfractuosity.com/wp-content/uploads/2014/02/fsk.png
533w,
http://www.anfractuosity.com/wp-content/uploads/2014/02/fsk-266x300.png
266w"><a>
</body></html>

With whatever paths you are using for the .png files.
I don't want to download tons of files (limited bandwidth here).

> cheers
> Chris
> 
> On Mon, Jun 12, 2017 at 9:08 AM, Tim Rühsen <address@hidden> wrote:
> 
>> Hi Chris,
>>
>>
>> On 06/11/2017 05:24 PM, chris wrote:
>>> Hi,
>>>
>>> I'm just wondering if I've possibly found a bug, unless I'm just doing
>>> something incorrectly (which I assume is more likely).
>>>
>>> I grab my webpage using 'wget -T1 -t1 -E -k -H -nd -N -p -P site_output
>>> https://www.anfractuosity.com/projects/ultrasound-networking/ > note1 2>
>>> note2'
>>>
>>> But i notice the srcset tags in the resulting downloaded files produce
>>> 'srcset="fsk.png.html 533w, fsk-266x300.png 266w" sizes="(max-width:
>> 533px)
>>> 100vw, 533px" /></a></p>' in the output index.html.
>>>
>>> On the actual webpage it looks like "srcset="
>>> https://www.anfractuosity.com/wp-content/uploads/2014/02/fft.png
>> 762w,...."
>>> no .html extension on the .png.
>>
>> You requested -E (--adjust-extension) and -k (--convert-links).
>> That would change the file name when the server tags the file as
>> content-type 'text/html'. You could see that in the debug output
>> (options -d or --debug).
>>
>>>
>>> Cheers
>>> Chris
>>>
>>
>> With Best Regards, Tim

Attachment: signature.asc
Description: OpenPGP digital signature


reply via email to

[Prev in Thread] Current Thread [Next in Thread]