[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Bug-wget] Website with broken img tags; browsers can handle it but
From: |
Keisial |
Subject: |
Re: [Bug-wget] Website with broken img tags; browsers can handle it but wget can not. |
Date: |
Sat, 29 May 2010 23:06:33 +0200 |
User-agent: |
Thunderbird |
Alexander Lane wrote:
> I've encountered a website that does not put the ">" at the end of
> some of its img tags. Wget skips downloading those images as a result,
> but I checked several web browsers & they were all able to cope with
> it.
>
> I don't know whether this was done in an attempt to break automated
> downloading or if it's just bad HTML.
>
> Here's what they look like:
>
> <p><img src="something/something1.jpg" border="1" width="1060"
> height="1592"</p>
>
> Is there any way I can make wget recognize & follow these malformed img tags?
>
> Thanks,
> Alex
>
I think that SGML allowed such kind of (mis)behavior.
I don't see the appropiate rule in HTML5 spec, though.