[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Ifile-discuss] naive bayes algorithm in ifile?

From: C. Fischer
Subject: [Ifile-discuss] naive bayes algorithm in ifile?
Date: Mon, 07 Mar 2005 12:47:21 +0100
User-agent: Gnus/5.110003 (No Gnus v0.3) Emacs/22.0.50 (berkeley-unix)

another idea i'm toying with is making a (portable) standard-prolog
implementation of naive bayes for (email/usenet) text classification.  the
free prologs have improved much over the years, and i want to know if a prolog
implementation is fast enough.

given n categories, t[i]; i {1..n} tokens per category, m[i]; i {1..n}
messages per category and for every token a record (age, c:i); i {1..n}, could
somebody please give a simple, english description of the algorithm needed to
classify a message?  i need to understand how token ageing can be used to keep
the database small, containing only the tokens that contribute the most to
classification and dropping the rest.

do i really need floating point operations or can i get away with integer
arithmetic?  could rational numbers be a better solution?


reply via email to

[Prev in Thread] Current Thread [Next in Thread]