bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] Website with broken img tags; browsers can handle it but


From: Keisial
Subject: Re: [Bug-wget] Website with broken img tags; browsers can handle it but wget can not.
Date: Sat, 29 May 2010 23:06:33 +0200
User-agent: Thunderbird

Alexander Lane wrote:
> I've encountered a website that does not put the ">" at the end of
> some of its img tags. Wget skips downloading those images as a result,
> but I checked several web browsers & they were all able to cope with
> it.
>
> I don't know whether this was done in an attempt to break automated
> downloading or if it's just bad HTML.
>
> Here's what they look like:
>
> <p><img src="something/something1.jpg" border="1" width="1060" 
> height="1592"</p>
>
> Is there any way I can make wget recognize & follow these malformed img tags?
>
> Thanks,
> Alex
>   

I think that SGML allowed such kind of (mis)behavior.
I don't see the appropiate rule in HTML5 spec, though.





reply via email to

[Prev in Thread] Current Thread [Next in Thread]