[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
bug#16168: uniq mis-handles UTF8 (8bit) characters
From: |
Shlomo Urbach |
Subject: |
bug#16168: uniq mis-handles UTF8 (8bit) characters |
Date: |
Mon, 16 Dec 2013 15:50:15 +0200 |
Lines with CJK letters are deemed equal by length only, since the
characters seem to be ignored.
I understand this is due to locale.
But, it would be nice if a simple flag would do a locale-free comparison
(i.e. equal = all bytes are equal).
- bug#16168: uniq mis-handles UTF8 (8bit) characters,
Shlomo Urbach <=