lynx-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: lynx-dev hyphenation


From: Vlad Harchev
Subject: Re: lynx-dev hyphenation
Date: Thu, 29 Jul 1999 23:13:41 +0500 (SAMST)

On Thu, 29 Jul 1999, Klaus Peter Wegge wrote:

> > 1) how to get information about the language of the current html file (based
> > on the charset name of the current document or user setups).
> Most specs in german site are wrong. I tried to use this mechanism
> for choosing the right speech synthesizer for reading the site to a
> multitasking user. I think the wrong specs come with the common usage
> of generators for html-files, which are not configured very well.
> I think, it's the same for other languages.
> A collegue of mine played arround with a small word statistic tool:
> very fast, heuristic and good detection for a lot of language.
> As I remember implementation was done in about 500 lines pascal.
> If you are interested I'll give you more details.

 Please provide the details about word statistic tool (how big dictionary
files does it need, is there an URL for this tool, is it OpenSource, does it 
handle multiply charsets for a given language...). 
 And seems that we need a mapping from charset name to language name (if
mapping in strict sense is possible, ie the given charset name is used for
encoding only one language) - otherwise the user will have to select right
language for current document manually.


  
> Klaus
> 

 Best regards,
  -Vlad


reply via email to

[Prev in Thread] Current Thread [Next in Thread]