bug-gnulib
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: diacrit: mark deprecated


From: Bruno Haible
Subject: Re: diacrit: mark deprecated
Date: Wed, 17 Jul 2019 23:31:01 +0200
User-agent: KMail/5.1.3 (Linux/4.4.0-154-generic; KDE/5.18.0; x86_64; ; )

Bernhard Voelker wrote:
> In GNU coreutils, we now get this warning during bootstrap:
> 
>   Notice from module diacrit:
>     This module is deprecated. Use the module 
> 'uninorm/canonical-decomposition' instead.
> 
> And indeed, the 'diacrit' module is still in use by 1 source:
> 
>   $ GIT_PAGER= git grep -En 'todiac|tobase'
>   src/ptx.c:1053:          diacritic = todiac (character);
>   src/ptx.c:1056:              base = tobase (character);
>   src/ptx.c:1338:        edited_flag[character] = todiac (character) != 0;

Indeed, 'ptx' does not yet support multibyte locales.

$ echo "Böse Bübchen" | ptx -
                                       Böse Bübchen
                               Böse   Bübchen
                           Böse Bü   bchen
                                 Bö   se Bübchen

It looks even weirder with Greek input:

$ echo "Το τέλος του Ψυχρού Πολέμου και η διάλυση της Σοβιετικής Ένωσης άφησαν 
τις Ηνωμένες Πολιτείες για ένα διάστημα ως τη μόνη υπερδύναμη." | ptx -

(No output at all!)

Bruno

reply via email to

[Prev in Thread] Current Thread [Next in Thread]