bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] [bug #47689] Support parsing of UTF-16 HTML encoding


From: HB
Subject: [Bug-wget] [bug #47689] Support parsing of UTF-16 HTML encoding
Date: Fri, 8 Jul 2016 06:46:06 +0000 (UTC)
User-agent: Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Ubuntu Chromium/51.0.2704.79 Chrome/51.0.2704.79 Safari/537.36

Follow-up Comment #4, bug #47689 (project wget):

>The following site has UTF-16 encoding: 
>http://www.free-energy-info.co.uk/ 
>W3C claim it's UTF-16LE, but it's not relevant. 

I have just spent the last two hours trying to wget this same site. When I
finally figured out that it wasn't working because of UTF-16 I googled how to
get wget to support UTF-16 and found this bug.

I want to mirror this site but wget finds no urls to follow in the index.html


I was able to convert the index.html to UTF-8 but no way that I know of to
easily feed that back to wget for mirroring.

Pls advise. Or fix. Thanks

    _______________________________________________________

Reply to this item at:

  <http://savannah.gnu.org/bugs/?47689>

_______________________________________________
  Message sent via/by Savannah
  http://savannah.gnu.org/




reply via email to

[Prev in Thread] Current Thread [Next in Thread]