octave-bug-tracker
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 in


From: Markus Mützel
Subject: [Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 input
Date: Mon, 19 Jun 2023 12:37:38 -0400 (EDT)

Follow-up Comment #43, bug #57107 (project octave):


> My question is if the search pattern is purely single-byte patterns (say
ascii only), does regexp input string must be a valid utf-8 string? I don't
see this could ever be a problem. no?

This test containing only ASCII characters in the pattern:

regexp('aäiöü', '[^\w]')


In Octave 4.4.1:

>> test_regexp_utf8

ans =

   2   3   5   6   7   8


In Octave 6.4.0:

>> test_regexp_utf8

ans =

   2   5   7


The old result was wrong (pointing to second bytes of a character). There
should be only three matches like in newer versions of Octave.


    _______________________________________________________

Reply to this item at:

  <https://savannah.gnu.org/bugs/?57107>

_______________________________________________
Message sent via Savannah
https://savannah.gnu.org/




reply via email to

[Prev in Thread] Current Thread [Next in Thread]