[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: u32_normalize UNINORM_NFKC on 0xD800
From: |
Simon Josefsson |
Subject: |
Re: u32_normalize UNINORM_NFKC on 0xD800 |
Date: |
Fri, 27 May 2011 20:13:25 +0200 |
User-agent: |
Gnus/5.110018 (No Gnus v0.18) Emacs/23.2 (gnu/linux) |
Bruno Haible <address@hidden> writes:
> Simon Josefsson wrote:
>> I'm calculating this IDNA2008 property
>>
>> toNFKC(toCaseFold(toNFKC(cp))) != cp
>>
>> for all code points.
>
> It makes no sense to consider non-character code points here. Citing again
> the Unicode standard, chapter 3 [1], section 3.8:
>
> "High-surrogate and low-surrogate code units are used only in the context
> of the UTF-16 character encoding form."
It seems Mark Davis believes toNFKC should be defined for all code points:
http://www.alvestrand.no/pipermail/idna-update/2011-May/007106.html
The issue turned out to be irrelevant for me, so I don't care strongly
either way.
/Simon