[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Aramorph-users] A contribution for AraMorph
From: |
Ahmed El-dawy |
Subject: |
Re: [Aramorph-users] A contribution for AraMorph |
Date: |
Mon, 13 Jun 2005 10:18:51 +0300 |
Yes, I got your idea. You can work with the previous changes that
makes the startup just fine.
For the SolutionsHandler, I don't think it is very useful to enhance
it unless it is really a bottle neck.
Expect an incoming patch for the ArabicTokenizer that will use a range
set (I will make it soon) for recognizing Arabic letters instead of a
long list of if statements.
My next step is to translate the dictionaries into Arabic instead of
the translitered format. I think I saw this in the TODO list. If I
succeded in this, we will not have to romanize the words before
running the Tim Buckwalter algorithm.
After that, I will try to make a JDBC dictionary handler. For this, I
will use IBM CloudScape database. It is open source and also very
simple.
On 6/12/05, Pierrick Brihaye <address@hidden> wrote:
> Hi,
>
> Ahmed El-dawy a écrit :
>
> > Now I have made my old changes over the latest version.
>
> Fine :-)
>
> > I attached the patch file you told me about.
>
> It looks OK as such.
>
> > I have made more changes that will make it start more faster.
>
> See below...
>
> > I don't know if the latest changes are acceptable or not.
>
> This list is dedicated to discussion ;-)
>
> > I have changed the format of the dictionary and compatiblity files.
>
> Mmmh... I'd prefer to keep the compatibility with Buckwalter's
> dictionaries. Anyway, if you have a small patch taken from one of your
> modified dictionaries...
>
> > First, I loaded them using the ordinary method from the text files.
> > After that I wrote them to binary files using the java serializable
> > format. In order to accomplish this, I had to implement the interface
> > serializable in the Dictionary Entry class. Also I changed some of its
> > attributes to transient to keep the binary files small.
>
> My idea was rather to have a dedicated interface to access a binary
> format for the dictionaries. Currently, we have an
> InMemoryDictionaryHandler ; I would like it to implement a
> DictionaryHandler interface and maybe to override an AbstractDirectory
> class.
>
> Then, we could easily design a JDBCDictionaryHandler, a
> LuceneDictionaryHandler and - why not ? - a SerializedDictionaryHandled.
>
> I urge you to consider this kind of design that would have a small
> impact on existing classes (do remember that small patches mean early
> commits and conversely).
>
> We could have the same design for the InMemorySolutionsHandler class,
> although it is probably less useful.
>
> What's your mind ?
>
> Cheers,
>
> p.b.
>
>
> _______________________________________________
> Aramorph-users mailing list
> address@hidden
> http://lists.nongnu.org/mailman/listinfo/aramorph-users
>
--
Regards,
Ahmed Saad
- Re: [Aramorph-users] A contribution for AraMorph, Ahmed El-dawy, 2005/06/05
- Re: [Aramorph-users] A contribution for AraMorph, Pierrick Brihaye, 2005/06/05
- Re: [Aramorph-users] A contribution for AraMorph, Ahmed El-dawy, 2005/06/08
- Re: [Aramorph-users] A contribution for AraMorph, Pierrick Brihaye, 2005/06/08
- Re: [Aramorph-users] A contribution for AraMorph, Ahmed El-dawy, 2005/06/12
- Re: [Aramorph-users] A contribution for AraMorph, Pierrick Brihaye, 2005/06/12
- Re: [Aramorph-users] A contribution for AraMorph,
Ahmed El-dawy <=
- Re: [Aramorph-users] A contribution for AraMorph, Pierrick Brihaye, 2005/06/13
- Re: [Aramorph-users] A contribution for AraMorph, Ahmed El-dawy, 2005/06/13
- Re: [Aramorph-users] A contribution for AraMorph, Pierrick Brihaye, 2005/06/13