aramorph-users
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Aramorph-users] A contribution for AraMorph


From: Ahmed El-dawy
Subject: Re: [Aramorph-users] A contribution for AraMorph
Date: Mon, 13 Jun 2005 10:18:51 +0300

Yes, I got your idea. You can work with the previous changes that
makes the startup just fine.
For the SolutionsHandler, I don't think it is very useful to enhance
it unless it is really a bottle neck.

Expect an incoming patch for the ArabicTokenizer that will use a range
set (I will make it soon) for recognizing Arabic letters instead of a
long list of if statements.

My next step is to translate the dictionaries into Arabic instead of
the translitered format. I think I saw this in the TODO list. If I
succeded in this, we will not have to romanize the words before
running the Tim Buckwalter algorithm.

After that, I will try to make a JDBC dictionary handler. For this, I
will use IBM CloudScape database. It is open source and also very
simple.

On 6/12/05, Pierrick Brihaye <address@hidden> wrote:
> Hi,
> 
> Ahmed El-dawy a écrit :
> 
> > Now I have made my old changes over the latest version.
> 
> Fine :-)
> 
> > I attached the patch file you told me about.
> 
> It looks OK as such.
> 
> > I have made more changes that will make it start more faster.
> 
> See below...
> 
> > I don't know if the latest changes are acceptable or not.
> 
> This list is dedicated to discussion ;-)
> 
> > I have changed the format of the dictionary and compatiblity files.
> 
> Mmmh... I'd prefer to keep the compatibility with Buckwalter's
> dictionaries. Anyway, if you have a small patch taken from one of your
> modified dictionaries...
> 
> > First, I loaded them using the ordinary method from the text files.
> > After that I wrote them to binary files using the java serializable
> > format. In order to accomplish this, I had to implement the interface
> > serializable in the Dictionary Entry class. Also I changed some of its
> > attributes to transient to keep the binary files small.
> 
> My idea was rather to have a dedicated interface to access a binary
> format for the dictionaries. Currently, we have an
> InMemoryDictionaryHandler ; I would like it to implement a
> DictionaryHandler interface and maybe to override an AbstractDirectory
> class.
> 
> Then, we could easily design a JDBCDictionaryHandler, a
> LuceneDictionaryHandler and - why not ? - a SerializedDictionaryHandled.
> 
> I urge you to consider this kind of design that would have a small
> impact on existing classes (do remember that small patches mean early
> commits and conversely).
> 
> We could have the same design for the InMemorySolutionsHandler class,
> although it is probably less useful.
> 
> What's your mind ?
> 
> Cheers,
> 
> p.b.
> 
> 
> _______________________________________________
> Aramorph-users mailing list
> address@hidden
> http://lists.nongnu.org/mailman/listinfo/aramorph-users
> 


-- 
Regards,
Ahmed Saad




reply via email to

[Prev in Thread] Current Thread [Next in Thread]