[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] [bug #47689] Support for UTF-16 encoding.

From: kenorb
Subject: [Bug-wget] [bug #47689] Support for UTF-16 encoding.
Date: Wed, 13 Apr 2016 18:42:53 +0000
User-agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_11_2) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/49.0.2623.87 Safari/537.36


                 Summary: Support for UTF-16 encoding.
                 Project: GNU Wget
            Submitted by: kenorb
            Submitted on: Wed 13 Apr 2016 06:42:52 PM GMT
                Category: Localization
                Severity: 3 - Normal
                Priority: 5 - Normal
                  Status: None
                 Privacy: Public
             Assigned to: None
         Originator Name: 
        Originator Email: 
             Open/Closed: Open
         Discussion Lock: Any
                 Release: 1.16.3
        Operating System: Mac OS
         Reproducibility: Every Time
           Fixed Release: None
         Planned Release: None
              Regression: None
           Work Required: None
          Patch Included: None



The following site has UTF-16 encoding:
W3C claim it's UTF-16LE, but it's not relevant.

By default wget doesn't recognise the source of it, because it's not following
any links when using with -m or -r.

When specifying remote-encoding, it doesn't work either:

$ wget --remote-encoding=UTF-16 http://www.free-energy-info.co.uk/
This version does not have support for IRIs

The same for any format, including when specifying `--no-iri`.

What should be the fix in order that encoding of that site can be parsed by

Related: http://stackoverflow.com/q/36605946/55075


Reply to this item at:


  Message sent via/by Savannah

reply via email to

[Prev in Thread] Current Thread [Next in Thread]