bug-gnu-utils
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

major gawk bug


From: Stanislav Ievlev
Subject: major gawk bug
Date: Tue, 8 Jun 2004 15:51:19 +0400

Hello friends!

Why gawk uses setlocale(), but have a hardcoded table  (const char casetable[] )
for case-independent regexp matching?

This table is correct for latin1 charset only, but incorrect for others,
e.g. for KOI8-R (russian).

KOI8-R encoding is fully compatible with 7-bit ASCII (so gawk compiles well),
but has other symbols for codes greater then 128.

So gawk supports only latin1, but ignore cp1251,koi8-r,koi8-u, etc.

As I understand, it's not a problem to fill this table with locale
specific symbols at start.

--
With best regards
Stanislav Ievlev

ALT Linux Team.





reply via email to

[Prev in Thread] Current Thread [Next in Thread]