bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] Tilde issue with recursive download when IRI is enabled a


From: William Prescott
Subject: Re: [Bug-wget] Tilde issue with recursive download when IRI is enabled and a page uses Shift JIS
Date: Fri, 17 Feb 2017 06:13:07 -0500

Yes, it still does the same even when explicitly setting --local-encoding.
It actually does make it to the linked file eventually, but it saves it to the
wrong folder (this can be seen in the output below).

----------------------------------------
On server:
~foo/index.html
~foo/baz.html (empty)

~foo/index.html contains
<meta http-equiv="Content-Type" content="text/html;charset=Shift_JIS">
<a href="baz.html">Baz</a>

Output:
$ wget -np -r --local-encoding=utf-8 'http://127.0.0.1/~foo/'
--2017-02-17 06:04:02--  http://127.0.0.1/~foo/
Connecting to 127.0.0.1:80... connected.
HTTP request sent, awaiting response... 200 OK
Length: 98 [text/html]
Saving to: ‘127.0.0.1/~foo/index.html’

127.0.0.1/~foo/index.html

2017-02-17 06:04:02 (3.76 MB/s) - ‘127.0.0.1/~foo/index.html’ saved [98/98]

Loading robots.txt; please ignore errors.
--2017-02-17 06:04:02--  http://127.0.0.1/robots.txt
Reusing existing connection to 127.0.0.1:80.
HTTP request sent, awaiting response... 404 Not Found
2017-02-17 06:04:02 ERROR 404: Not Found.

--2017-02-17 06:04:02--  http://127.0.0.1/%E2%80%BEfoo/baz.html
Reusing existing connection to 127.0.0.1:80.
HTTP request sent, awaiting response... 404 Not Found
2017-02-17 06:04:02 ERROR 404: Not Found.

--2017-02-17 06:04:02--  http://127.0.0.1/~foo/baz.html
Reusing existing connection to 127.0.0.1:80.
HTTP request sent, awaiting response... 200 OK
Length: 0 [text/html]
Saving to: ‘127.0.0.1/‾foo/baz.html’

127.0.0.1/‾foo/baz.html

2017-02-17 06:04:02 (0.00 B/s) - ‘127.0.0.1/‾foo/baz.html’ saved [0/0]

FINISHED --2017-02-17 06:04:02--
Total wall clock time: 0.005s
Downloaded: 2 files, 98 in 0s (3.76 MB/s)



reply via email to

[Prev in Thread] Current Thread [Next in Thread]