[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [PATCH] standards: rewrite section on quoting
From: |
Paolo Bonzini |
Subject: |
Re: [PATCH] standards: rewrite section on quoting |
Date: |
Wed, 21 Dec 2011 12:04:58 +0100 |
User-agent: |
Mozilla/5.0 (X11; Linux x86_64; rv:7.0.1) Gecko/20110930 Thunderbird/7.0.1 |
On 12/21/2011 12:44 AM, Karl Berry wrote:
> It never occurred to me to associate '...' in error messages with the
> British quoting style and "..." with American. I know well that that is
> true typographically, but ASCII and computer messages in general are
> such a limited domain, compared to, say, book printing.
>
> However, I myself don't see any particular problem with the GCS allowing
> either "..." or '...'.
>
> Paolo, since Paul clearly feels strongly about this, can you change the
> wording to allow either one? Then we can see what rms says.
Sure. I just removed any reference to Unicode in the English locale,
since there's no way that it can be right. :)
Paolo
------------------- 8< ------------------------
>From b1e6cfcb360cd3cf8e6b2993fc518746efad4791 Mon Sep 17 00:00:00 2001
From: Paolo Bonzini <address@hidden>
Date: Tue, 20 Dec 2011 09:42:36 +0100
Subject: [PATCH v2] standards: rewrite section on quoting
GCC has been using apostrophes for quotes in the C locale for seven
years, and gnulib is following suit. Describe this in the GNU coding
standards and add instructions about translating quotes and providing
parseable output.
* doc/standards.texi (Quote Characters): Add note about translating
quote characters and providing parseable output. Do not anymore suggest
the grave accent as the opening quote character. Suggest adding an
option such as GNU ls's --quoting-style.
---
doc/standards.texi | 39 +++++++++++++++++++--------------------
1 files changed, 19 insertions(+), 20 deletions(-)
diff --git a/doc/standards.texi b/doc/standards.texi
index 99d76f0..e9099c3 100644
--- a/doc/standards.texi
+++ b/doc/standards.texi
@@ -3274,12 +3274,28 @@ Quote Characters
@cindex quote characters
@cindex locale-specific quote characters
@cindex left quote
address@hidden opening quote
address@hidden single quotes
@cindex grave accent
In the C locale, GNU programs should stick to plain ASCII for quotation
-characters in messages to users: preferably 0x60 (@samp{`}) for left
-quotes and 0x27 (@samp{'}) for right quotes. It is ok, but not
-required, to use locale-specific quotes in other locales.
+characters in messages to users: preferably 0x27 (@samp{'}) for both
+opening and closing quotes.
+
+It is ok, but not required, to use locale-specific quotes in other
+locales. Remember however that you should never use Unicode in the
+C locale.
+
+If you support internationalization, translators should be able to provide
+their own quote characters. By convention, the string @samp{"`"} will
+translate to the opening quote and the string @samp{"'"} will translate
+to the closing quote.
+
+If the output of your program is ever likely to be parsed by another
+program, provide an option that makes this parsing reliable. For example
+you could escape special characters using conventions from the C language
+or the Bourne shell. See for example the option @option{--quoting-style}
+of GNU @code{ls}.
The @uref{http://www.gnu.org/software/gnulib/, Gnulib} @code{quote} and
@code{quotearg} modules provide a reasonably straightforward way to
@@ -3287,23 +3303,6 @@ Quote Characters
other issues, such as quoting a filename that itself contains a quote
character. See the Gnulib documentation for usage details.
-In any case, the documentation for your program should clearly specify
-how it does quoting, if different than the preferred method of @samp{`}
-and @samp{'}. This is especially important if the output of your
-program is ever likely to be parsed by another program.
-
-Quotation characters are a difficult area in the computing world at
-this time: there are no true left or right quote characters in Latin1;
-the @samp{`} character we use was standardized there as a grave
-accent. Moreover, Latin1 is still not universally usable.
-
-Unicode contains the unambiguous quote characters required, and its
-common encoding UTF-8 is upward compatible with Latin1. However,
-Unicode and UTF-8 are not universally well-supported, either.
-
-This may change over the next few years, and then we will revisit
-this.
-
@node Mmap
@section Mmap
--
1.7.7.1
- [PATCH] standards: rewrite section on quoting, Paolo Bonzini, 2011/12/20
- Re: [PATCH] standards: rewrite section on quoting, Paul Eggert, 2011/12/20
- Re: [PATCH] standards: rewrite section on quoting, Paolo Bonzini, 2011/12/20
- Re: [PATCH] standards: rewrite section on quoting, Karl Berry, 2011/12/20
- Re: [PATCH] standards: rewrite section on quoting,
Paolo Bonzini <=
- Re: [PATCH] standards: rewrite section on quoting, Paul Eggert, 2011/12/22
- Re: [PATCH] standards: rewrite section on quoting, Karl Berry, 2011/12/22
- Re: [PATCH] standards: rewrite section on quoting, Paul Eggert, 2011/12/23
- Re: [PATCH] standards: rewrite section on quoting, Paul Smith, 2011/12/23
- Re: [PATCH] standards: rewrite section on quoting, Paolo Bonzini, 2011/12/23
- Re: [PATCH] standards: rewrite section on quoting, Paul Eggert, 2011/12/23
- Re: [PATCH] standards: rewrite section on quoting, Paolo Bonzini, 2011/12/23
- Re: [PATCH] standards: rewrite section on quoting, Paul Eggert, 2011/12/23
- Re: [PATCH] standards: rewrite section on quoting, Karl Berry, 2011/12/23
- Re: [PATCH] standards: rewrite section on quoting, Paul Eggert, 2011/12/24