bug-gnu-utils
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: grep single-ascii-char matches Chinese


From: Alain Magloire
Subject: Re: grep single-ascii-char matches Chinese
Date: Sun, 29 Jul 2001 11:31:42 -0400 (EDT)

> 
> I do
> $ grep @ file
> and grep finds all the Chinese characters with '@'
> as one of their bytes, including the very common "yi1" meaning "one".
> Emacs' isearch-forward wouldn't do that, thank goodness.
> 
> $ grep A file, grep B file... same problem.
> 
> I have
> LC_MESSAGES=zh_TW.Big5
> LC_TIME=zh_TW.Big5
> LC_NUMERIC=zh_TW.Big5
> LC_CTYPE=zh_TW.Big5
> LC_MONETARY=zh_TW.Big5
> LC_COLLATE=zh_TW.Big5
> 
> grep (GNU grep) 2.4.2 on Mandrake 7.2
> by the way, I wish grep --version would tell what options grep was
> compiled with.

grep --version is consistent with the other GNU utilities.
If you want this, you will have to push if for _all_ GNU utilities.

> I do
> $ grep '[Chinese chars]' file
> and it finds them ok...
> 
> [Perhaps I discovered a big bug because not often do folks grep for
> single ascii chars?   Or is there some fundamental concept I'm not
> grabbing here as usual?]

The new maintainer of GNU grep will be putting support for multibyte
etc .. which will make GNU grep workable with other languages.

--
alain




reply via email to

[Prev in Thread] Current Thread [Next in Thread]