Incorrect character encoding in received messages

info-gnus-english

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Incorrect character encoding in received messages

From:	Garjola Dindi
Subject:	Incorrect character encoding in received messages
Date:	Wed, 25 Dec 2019 19:13:21 +0100
User-agent:	Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux)

Hi all,

I have recently been having trouble with Gnus decoding some e-mails as
ASCII when actually they should be decoded as unicode.

For instance, in French, the “à” char gets displayed as “\340”.

If I go to «edit mode» with 'gnus-summary-edit-article' and just do C-c
C-c (with no real edit), the message gets displayed correctly.

Another example with the "é" char which appears as 'i' in an HTML
message. Describe char gives me this:
>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> > >
>             character: i (displayed as i) (codepoint 105, #o151, #x69)        
>             >
>               charset: ascii (ASCII (ISO646 IRV))                             
>             >
> code point in charset: 0x69                                                   
>             >
>                script: latin                                                  
>             >
>                syntax: w      which means: word                               
>             >
>              category: .:Base, L:Left-to-right (strong), a:ASCII, l:Latin, 
> r:Roman        >
>              to input: type "C-x 8 RET 69" or "C-x 8 RET LATIN SMALL LETTER 
> I"            >
>           buffer code: #x69                                                   
>             >
>             file code: #x69 (encoded by coding system utf-8-unix)             
>             >
>               display: by this font (glyph code)                              
>             >
>     xfthb:-PfEd-DejaVu Sans-normal-normal-normal-*-16-*-*-*-*-0-iso10646-1 
> (#x4C)         >
>                                                                               
>             >
> Character code properties: customize what to show                             
>             >
>   name: LATIN SMALL LETTER I                                                  
>             >
>   general-category: Ll (Letter, Lowercase)                                    
>             >
>   decomposition: (105) ('i')                                                  
>             >
> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >> 
> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> > >

And after 'gnus-summary-edit-article' followed by C-c C-c:

>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> > >
>             character: é (displayed as é) (codepoint 233, #o351, #xe9)        
>                          >
>               charset: unicode (Unicode (ISO10646))                           
>                          >
> code point in charset: 0xE9                                                   
>                          >
>                script: latin                                                  
>                          >
>                syntax: w      which means: word                               
>                          >
>              category: .:Base, L:Left-to-right (strong), c:Chinese, 
> j:Japanese, l:Latin, v:Viet        >
>              to input: type "C-x 8 RET e9" or "C-x 8 RET LATIN SMALL LETTER E 
> WITH ACUTE"              >
>           buffer code: #xC3 #xA9                                              
>                          >
>             file code: #xC3 #xA9 (encoded by coding system utf-8-unix)        
>                          >
>               display: by this font (glyph code)                              
>                          >
>     xfthb:-PfEd-DejaVu Sans-normal-normal-normal-*-16-*-*-*-*-0-iso10646-1 
> (#xAB)                      >
>                                                                               
>                          >
> Character code properties: customize what to show                             
>                          >
>   name: LATIN SMALL LETTER E WITH ACUTE                                       
>                          >
>   old-name: LATIN SMALL LETTER E ACUTE                                        
>                          >
>   general-category: Ll (Letter, Lowercase)                                    
>                          >
>   decomposition: (101 769) ('e' '́')                                          
>                           >
> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >> 
> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> > >

Any idea of what may be happening here?

I am on emacs master, but there is no difference with 26.3.

Thanks!

G.

--

[Prev in Thread]

Current Thread

[Next in Thread]

Incorrect character encoding in received messages, Garjola Dindi <=
- Re: Incorrect character encoding in received messages, Lars Ingebrigtsen, 2019/12/27

Prev by Date: Re: Reverse thread sorting by most recent number
Next by Date: Re: Incorrect character encoding in received messages
Previous by thread: Reverse thread sorting by most recent number
Next by thread: Re: Incorrect character encoding in received messages
Index(es):
- Date
- Thread