[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte
From: |
Mike Miller |
Subject: |
[Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte UTF-8 characters |
Date: |
Mon, 29 Jul 2019 12:36:06 -0400 (EDT) |
User-agent: |
Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/75.0.3770.142 Safari/537.36 |
Follow-up Comment #16, bug #35910 (project octave):
It's similar, but not bug #50409, because in that bug report you are clearly
pointing to something being initialized in the Windows terminal emulator.
I guess the behavior I'm seeing can be grouped into two bugs / questions:
* This change to regular expressions seems to require a UTF-8 locale be
active, is this simply a call to 'setlocale'? And where should this be done
for it to work correctly in octave-cli?
* If the above is correct, what should we do in cases where a user is running
Octave in a non-UTF-8 locale, for example LC_ALL=C? Should we only
conditionally add the "(*UTF8)" prefix to regular expressions when the locale
will support it? Or should we just let the error message I've shown happen?
_______________________________________________________
Reply to this item at:
<https://savannah.gnu.org/bugs/?35910>
_______________________________________________
Message sent via Savannah
https://savannah.gnu.org/
- [Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte UTF-8 characters, Markus Mützel, 2019/07/21
- [Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte UTF-8 characters, Mike Miller, 2019/07/21
- [Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte UTF-8 characters, Markus Mützel, 2019/07/22
- [Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte UTF-8 characters, Rik, 2019/07/22
- [Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte UTF-8 characters, Mike Miller, 2019/07/22
- [Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte UTF-8 characters, Andrew Janke, 2019/07/27
- [Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte UTF-8 characters, Mike Miller, 2019/07/28
- [Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte UTF-8 characters, Mike Miller, 2019/07/28
- [Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte UTF-8 characters, Markus Mützel, 2019/07/29
- [Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte UTF-8 characters,
Mike Miller <=
- [Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte UTF-8 characters, Mike Miller, 2019/07/29
- [Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte UTF-8 characters, Mike Miller, 2019/07/31
- [Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte UTF-8 characters, Rik, 2019/07/31