|
From: | Stefan Bidi |
Subject: | Re: Merging CoreBase into Base |
Date: | Mon, 12 Aug 2013 10:56:46 -0500 |
On 12 Aug 2013, at 16:26, Stefan Bidi <address@hidden> wrote:I find this an odd observation. UTF-16 is multibyte, so comes with all of the same pain as UTF-8, but has the disadvantage that it's almost always larger than UTF-16 (most two-byte characters in UTF-16 are also two-byte characters in UTF-16). You also start hitting endian issues with UTF-16, whereas UTF-8 is endian-independent. Finally, UTF-8 is the format that you typically want for input or output, as it's well supported by most libc functions, terminals, and so on.
> (2) I'm working towards making corebase use Unicode (ie UTF-16) internally wherever possible. I believe this is a saner choice than trying to deal with UTF-8.
David
[Prev in Thread] | Current Thread | [Next in Thread] |