[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Org markup and non-ASCII punctuation (was: org parser and priorities
From: |
Ihor Radchenko |
Subject: |
Re: Org markup and non-ASCII punctuation (was: org parser and priorities of inline elements) |
Date: |
Tue, 18 Jul 2023 09:45:32 +0000 |
Tom Gillespie <tgbugs@gmail.com> writes:
>> We might probably generalize to
>> PRE = Zs Zl Pc Pd Ps Pi ' "
>> POST = Zs Zl Pc Pd Pe Pf . ; : ! ? ' " \ [
>
> If this works I think it is reasonable. We might want to
> specify what to do in cases where an org implementation
> might not fully support unicode,
Just fall back to ASCII subset? If the implementation does not support
unicode, it probably cannot properly work with UTF-encoded documents
anyway.
> ...and might want to do a
> review of related issues in syntax with respect to ascii
> vs unicode, because iirc there is some ambiguity in
> the current syntax doc.
> For example, I'm pretty sure that I'm mixing and matching
> unicode and ascii whitespace in the tokenizer I have in Racket.
Feel free to open new bug reports about such ambiguities.
--
Ihor Radchenko // yantar92,
Org mode contributor,
Learn more about Org mode at <https://orgmode.org/>.
Support Org development at <https://liberapay.com/org-mode>,
or support my work at <https://liberapay.com/yantar92>