bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] [bug #55603] Naive comment type in use by webdevs not recogni


From: anonymous
Subject: [Bug-wget] [bug #55603] Naive comment type in use by webdevs not recognized
Date: Tue, 29 Jan 2019 10:56:14 -0500 (EST)
User-agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/71.0.3578.98 Safari/537.36

URL:
  <https://savannah.gnu.org/bugs/?55603>

                 Summary: Naive comment type in use by webdevs not recognized
                 Project: GNU Wget
            Submitted by: None
            Submitted on: Tue 29 Jan 2019 03:56:13 PM UTC
                Category: Program Logic
                Severity: 3 - Normal
                Priority: 5 - Normal
                  Status: None
                 Privacy: Public
             Assigned to: None
         Originator Name: 
        Originator Email: 
             Open/Closed: Open
         Discussion Lock: Any
                 Release: 1.20
        Operating System: Microsoft Windows
         Reproducibility: Every Time
           Fixed Release: None
         Planned Release: None
              Regression: None
           Work Required: None
          Patch Included: No

    _______________________________________________________

Details:

Some webdevs do comments like this:

<!--comment comment--!>

It displays correctly in web browsers (chrome and firefox checked). Wget does
not recognize the termination.

I invoke Wget with:

start wget -nd -r -l 0 --reject-regex sort=nam -I /neopets
http://lost.quiggle.org/

Using "--strict-comments" fixes it for this particular website. I don't know
how it sees the comments in this mode, is it 

"<!--comment>"?

The only occurrence of a "-->" is towards the end of the webpage and that is
what is throwing naive comment parsing off. Maybe a strict html comment just
can't be pieced together on the website.

I also suspect that naive comments mode parses this block differently from
chrome, not sure though.

<script type="text/javascript"><!--
google_ad_client = "ca-pub-2676160449557179";
/* indexbase */
google_ad_slot = "6483197392";
google_ad_width = 300;
google_ad_height = 250;
//-->
</script>
 
chrome does not see a html comment there

log.txt attached



    _______________________________________________________

File Attachments:


-------------------------------------------------------
Date: Tue 29 Jan 2019 03:56:13 PM UTC  Name: log.txt  Size: 49KiB   By: None

<http://savannah.gnu.org/bugs/download.php?file_id=46125>

    _______________________________________________________

Reply to this item at:

  <https://savannah.gnu.org/bugs/?55603>

_______________________________________________
  Message sent via Savannah
  https://savannah.gnu.org/




reply via email to

[Prev in Thread] Current Thread [Next in Thread]