[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[MIT-Scheme-devel] UTF-8 sequences
From: |
Matt Birkholz |
Subject: |
[MIT-Scheme-devel] UTF-8 sequences |
Date: |
Wed, 18 Feb 2015 18:24:33 -0700 |
> From: Matt Birkholz <address@hidden>
> Date: Wed, 18 Feb 2015 14:52:48 -0700
>
> [...] if 0244 was not slashified, why was 0237? Smells like a bug.
Sorry, I was smelling something else. :-o What is it about the Send
key that makes the lightbulb go on over my head? :-}
0244 is an ISO-8859-1 (nee Latin-1, nee ASCII) "generic currency sign"
-- a graphical character. 0237 is undefined, non-graphical,
slashified.
https://en.wikipedia.org/wiki/ISO/IEC_8859-1
So I think we're OK.
I'm probably NOT supposed to say "Latin-1" or even "ASCII". Our
documentation mentions only "ISO-8859-1".
If you are looking for something to fix, the documentation also
says this:
@strong{Note}: Although character objects can represent all of Unicode,
the model of alphabetic case used covers only @acronym{ASCII} letters,
which means that case-insensitive comparisons and case conversions are
incorrect for address@hidden letters. This will eventually be
fixed.
You are forewarned.