lynx-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Lynx-dev] Zero-width space


From: Thomas Dickey
Subject: Re: [Lynx-dev] Zero-width space
Date: Fri, 23 Aug 2024 20:18:40 -0400

On Fri, Aug 23, 2024 at 08:38:11AM -0600, rbell--- via Lynx-dev wrote:
>       Why doesn't this in UCdomap.c:
> 
>       else if (ucs == 0xfeff || (ucs >= 0x200b && ucs <= 0x200f)) { 
>         /*
>          * Zero-width space.
>          */
>         return ucZeroWidth;
> 
> cause 200b (zero-width space, aka 8203) to be undisplayed?
> 
> 
>       It seems to make this in LYCharUtils.c :
> 
>        /*
>                  * Ignore 8204 (zwnj), 8205 (zwj) 8206 (lrm), and 8207 (rlm),
>                  * for now, if we got this far without finding a 
> representation
>                  * for them.
>                  */
>             } else if (code == 8204 || code == 8205 ||
>                        code == 8206 || code == 8207) {
>                 CTRACE((tfp, "LYUCFullyTranslateString: Ignoring '%"
>                         PRI_UCode_t "'.\n", CAST_UCode_t (code)));
>                 replace_buf[0] = '\0';
>                 state = S_got_outstring;
>                 break;
> 
> 
> unnecessary.  And why does it neglect 8203?
> 
>       8203 appears in 
> https://www.newyorker.com/news/the-lede/the-democratic-party-rebrands-itself-before-viewers-eyes

I see U+200B in one place in the HTML file (two occurrences immediately
after a left-double-quote), and in lynx it doesn't use any spacing,
which is what I'd expect for zero-width.

-- 
Thomas E. Dickey <dickey@invisible-island.net>
https://invisible-island.net

Attachment: signature.asc
Description: PGP signature


reply via email to

[Prev in Thread] Current Thread [Next in Thread]