chicken-janitors
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Chicken-janitors] #345: utf8 regexp bug with underscores


From: Chicken Trac
Subject: [Chicken-janitors] #345: utf8 regexp bug with underscores
Date: Fri, 20 Aug 2010 01:09:29 -0000

#345: utf8 regexp bug with underscores
----------------------------+-----------------------------------------------
 Reporter:  zbigniew        |       Owner:       
     Type:  defect          |      Status:  new  
 Priority:  minor           |   Milestone:  4.6.0
Component:  core libraries  |     Version:  4.5.x
 Keywords:                  |  
----------------------------+-----------------------------------------------
 Only in utf8 mode, regexp seems to have a bug with negated charsets
 containing underscores in either POSIX REs or SREs.  I'm not sure but I
 think any charset with one underscore and one or more other chars is
 affected.

 Irregex has the same problem but only when called with option 'utf8.

 I'm using chicken-experimental 4.5.8 without irregex 0.8, so perhaps this
 bug is fixed in irregex 0.8?  I haven't checked.  I know this occurs back
 to Chicken 4.5.0 at least.
 {{{
 #;> (use utf8)
 #;> (regexp "[^_]")
 #<regexp>
 #;> (regexp "[^a_b]")
 Error: (cddr) bad argument type: ()
 #;> (regexp "[^a_]")
 Error: (cddr) bad argument type: ()
 #;> (regexp "[^_a]")
 Error: (cddr) bad argument type: ()
 #;> (regexp '(~ #\_ #\a))
 Error: (cddr) bad argument type: ()

 #;> (irregex '(~ #\_ #\a))
 #(*irregex-tag* ...)
 #;> (irregex '(~ #\_ #\a) 'utf8)
 Error: (cddr) bad argument type: ()

 }}}

-- 
Ticket URL: <https://bugs.call-cc.org/ticket/345>
Chicken Scheme <http://www.call-with-current-continuation.org/>
Chicken Scheme is a compiler for the Scheme programming language.

reply via email to

[Prev in Thread] Current Thread [Next in Thread]