bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] Save 3 byte utf8 url


From: bes
Subject: [Bug-wget] Save 3 byte utf8 url
Date: Thu, 7 Feb 2013 17:06:47 +0300

Hi,

i found some bug in wget with interpreting and save percent-encoding 3 byte
utf8 url

example:
1. Create url with "—". This is U+2014 (EM DASH). Percent-encoding UTF-8 is
"%E2%80%94"
2. Try wget it: wget "http://example.com/abc—d"; or wget "
http://example.com/abc%E2%80%94d"; directly
3. Wget save this URL to file "abc\342%80%94d". Expected is
"abc%E2%80%94d". This is a bug.

$ wget -V
----------------------------------------
GNU Wget 1.14 built on linux-gnu.

+digest +https -ipv6 -iri +large-file -nls -ntlm +opie +ssl/openssl

Wgetrc:
/etc/wgetrc (system)
Compile: x86_64-pc-linux-gnu-gcc -DHAVE_CONFIG_H
-DSYSTEM_WGETRC="/etc/wgetrc"
-DLOCALEDIR="/usr/share/locale" -I. -I../lib -I../lib -DHAVE_LIBSSL
-O2 -march=native -pipe -fomit-frame-pointer
Link: x86_64-pc-linux-gnu-gcc -DHAVE_LIBSSL -O2 -march=native -pipe
-fomit-frame-pointer -Wl,-O1 -Wl,--as-needed -lssl -lcrypto -lpcre
-lrt ftp-opie.o openssl.o ../lib/libgnu.a
--------------------


reply via email to

[Prev in Thread] Current Thread [Next in Thread]