[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Chicken-janitors] #345: utf8 regexp bug with underscores
From: |
Chicken Trac |
Subject: |
[Chicken-janitors] #345: utf8 regexp bug with underscores |
Date: |
Fri, 20 Aug 2010 01:09:29 -0000 |
#345: utf8 regexp bug with underscores
----------------------------+-----------------------------------------------
Reporter: zbigniew | Owner:
Type: defect | Status: new
Priority: minor | Milestone: 4.6.0
Component: core libraries | Version: 4.5.x
Keywords: |
----------------------------+-----------------------------------------------
Only in utf8 mode, regexp seems to have a bug with negated charsets
containing underscores in either POSIX REs or SREs. I'm not sure but I
think any charset with one underscore and one or more other chars is
affected.
Irregex has the same problem but only when called with option 'utf8.
I'm using chicken-experimental 4.5.8 without irregex 0.8, so perhaps this
bug is fixed in irregex 0.8? I haven't checked. I know this occurs back
to Chicken 4.5.0 at least.
{{{
#;> (use utf8)
#;> (regexp "[^_]")
#<regexp>
#;> (regexp "[^a_b]")
Error: (cddr) bad argument type: ()
#;> (regexp "[^a_]")
Error: (cddr) bad argument type: ()
#;> (regexp "[^_a]")
Error: (cddr) bad argument type: ()
#;> (regexp '(~ #\_ #\a))
Error: (cddr) bad argument type: ()
#;> (irregex '(~ #\_ #\a))
#(*irregex-tag* ...)
#;> (irregex '(~ #\_ #\a) 'utf8)
Error: (cddr) bad argument type: ()
}}}
--
Ticket URL: <https://bugs.call-cc.org/ticket/345>
Chicken Scheme <http://www.call-with-current-continuation.org/>
Chicken Scheme is a compiler for the Scheme programming language.
- [Chicken-janitors] #345: utf8 regexp bug with underscores,
Chicken Trac <=