[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 in
From: |
Markus Mützel |
Subject: |
[Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 input |
Date: |
Mon, 19 Jun 2023 12:37:38 -0400 (EDT) |
Follow-up Comment #43, bug #57107 (project octave):
> My question is if the search pattern is purely single-byte patterns (say
ascii only), does regexp input string must be a valid utf-8 string? I don't
see this could ever be a problem. no?
This test containing only ASCII characters in the pattern:
regexp('aäiöü', '[^\w]')
In Octave 4.4.1:
>> test_regexp_utf8
ans =
2 3 5 6 7 8
In Octave 6.4.0:
>> test_regexp_utf8
ans =
2 5 7
The old result was wrong (pointing to second bytes of a character). There
should be only three matches like in newer versions of Octave.
_______________________________________________________
Reply to this item at:
<https://savannah.gnu.org/bugs/?57107>
_______________________________________________
Message sent via Savannah
https://savannah.gnu.org/
- [Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 input, Qianqian Fang, 2023/06/15
- [Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 input, Markus Mützel, 2023/06/18
- [Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 input, Qianqian Fang, 2023/06/18
- [Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 input, Markus Mützel, 2023/06/19
- [Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 input, Qianqian Fang, 2023/06/19
- [Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 input, Qianqian Fang, 2023/06/19
- [Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 input,
Markus Mützel <=
- [Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 input, Markus Mützel, 2023/06/19
- [Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 input, Markus Mützel, 2023/06/19
- [Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 input, Qianqian Fang, 2023/06/19