bug-gnu-emacs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#3687: 23.1.50; inconsistency in multibyte eight-bit regexps [PATCH]


From: Mattias Engdegård
Subject: bug#3687: 23.1.50; inconsistency in multibyte eight-bit regexps [PATCH]
Date: Fri, 28 Jun 2019 17:00:33 +0200

28 juni 2019 kl. 16.40 skrev Eli Zaretskii <eliz@gnu.org>:
> 
> So this means \240 is no longer the same as NBSP and \300 is no longer
> the same as À?  But \176 is still the same as ~?

This has been the case for quite a while; the patch does not change any of this.

> So you are saying that we will consider the raw bytes as if they
> followed ASCII characters in the lexicographical order?  But non-ASCII
> characters whose codepoints start at 0x80? where are they in this
> order?

Again, this is existing semantics and the patch does not change any of it.

It sounds like you misunderstand the patch, which means that I have been bad at 
explaining it. It just fixes a few edge cases related to raw bytes in regexp 
matching. It does not attempt to change existing semantics, other than where 
they are clearly buggy, such as "\x9f" and "[\x9f]" not being equivalent 
regexps.






reply via email to

[Prev in Thread] Current Thread [Next in Thread]