[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Lynx-dev] Zero-width space
From: |
Thomas Dickey |
Subject: |
Re: [Lynx-dev] Zero-width space |
Date: |
Fri, 23 Aug 2024 20:18:40 -0400 |
On Fri, Aug 23, 2024 at 08:38:11AM -0600, rbell--- via Lynx-dev wrote:
> Why doesn't this in UCdomap.c:
>
> else if (ucs == 0xfeff || (ucs >= 0x200b && ucs <= 0x200f)) {
> /*
> * Zero-width space.
> */
> return ucZeroWidth;
>
> cause 200b (zero-width space, aka 8203) to be undisplayed?
>
>
> It seems to make this in LYCharUtils.c :
>
> /*
> * Ignore 8204 (zwnj), 8205 (zwj) 8206 (lrm), and 8207 (rlm),
> * for now, if we got this far without finding a
> representation
> * for them.
> */
> } else if (code == 8204 || code == 8205 ||
> code == 8206 || code == 8207) {
> CTRACE((tfp, "LYUCFullyTranslateString: Ignoring '%"
> PRI_UCode_t "'.\n", CAST_UCode_t (code)));
> replace_buf[0] = '\0';
> state = S_got_outstring;
> break;
>
>
> unnecessary. And why does it neglect 8203?
>
> 8203 appears in
> https://www.newyorker.com/news/the-lede/the-democratic-party-rebrands-itself-before-viewers-eyes
I see U+200B in one place in the HTML file (two occurrences immediately
after a left-double-quote), and in lynx it doesn't use any spacing,
which is what I'd expect for zero-width.
--
Thomas E. Dickey <dickey@invisible-island.net>
https://invisible-island.net
signature.asc
Description: PGP signature