1 Sep 2005 02:23
Re: plese help me...urgent
Ashraf M. Kibriya <amk14 <at> cs.waikato.ac.nz>
2005-09-01 00:23:23 GMT
2005-09-01 00:23:23 GMT
> > >> Hi, I'm a student of a Bicocca University in Milan, Italy and is the >> first time that I use Weka. >> >> I want to create a text classifier for some documents of my University. >> >> I have got only a database of 142 entry both for train and >> test.(multilabel problem with around 100 categories) >> >> I have made the preprocessing phase myself, because my documents are >> written in italian and this language is very difficoult to preprocess >> with a automatic tool. >> >> Now I have built all file in arff format, and I want to ask if there is >> some algorithm that is best for my problem. Now I have used Multiclass >> in order to aggregate my file and a neural network, but it take more >> than two hour to make a model...It's normal? > > Try using Multinomial Naive Bayes or the Complement Naive Bayes, they are generally more well suited for text classification problems. For higher accuracy, you might want to try SVM (i.e. SMO classifier in Weka), although it would take longer to run. Regards, Ashraf
RSS Feed