Dr. Arie Ben David | 3 Jan 2005 07:55
Picon

WEKA Documentation

Hi everyone
I am considering using WEKA as a software tool for an undergraduate course in machine learning (we currently use Clementine). Can you kindly recommend a web site where students can find  theoretical background, updated description, examples, bibliography, etc of  all (or most) models which are currently used in WEKA  (I am not talking about object level details).
Thank you
Happy New Year
Dr. Arie Ben David
 
 
 
_______________________________________________
Wekalist mailing list
Wekalist <at> list.scms.waikato.ac.nz
https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist
ted pedersen | 3 Jan 2005 08:13
Picon
Gravatar

Re: WEKA Documentation


I recommend the book :

Data Mining: Practical Machine Learning Tools and Techniques with Java
Implementations by Ian H. Witten, Eibe Frank
http://www.cs.waikato.ac.nz/~ml/weka/book.html

Yes, I know this is sort of obvious and maybe not what you think you want,
but if you are teaching an undergrad class in machine learning, you really
do want this book. It's great. It's clear, it's concise, and it's even
sort of fun. I routinely refer students who are new to machine learning to
this book and they like it - they can understand it and it doesn't even
cost too much (compared to other Machine Learning books that shall remain
nameless ;) Besides, it's written with Weka in mind. It may not include
all the latest bells and whistles in Weka, but in an undergrad class
you'll probably be dealing with decision trees and Naive Bayesian
classifiers, etc. rather than freaky kernels and the like.

Also, I think Weka is an excellent choice for a classroom tool. It's
stable, easy to use, and has lots of room for growth. So it doesn't limit
very bright or ambitious students, while not being impossible for the more
average ones.

Cordially,
Ted

On Mon, 3 Jan 2005, Dr. Arie Ben David wrote:

> Hi everyone
> I am considering using WEKA as a software tool for an undergraduate course in machine learning (we
currently use Clementine). Can you kindly recommend a web site where students can find  theoretical
background, updated description, examples, bibliography, etc of  all (or most) models which are
currently used in WEKA  (I am not talking about object level details).
> Thank you
> Happy New Year
> Dr. Arie Ben David
>
>
>

--
Ted Pedersen
http://www.d.umn.edu/~tpederse
Leo | 3 Jan 2005 16:15
Picon
Favicon

CrossValidation Training

Thanks for the previous answer, and forgive my english.
 
I used the multilayerperceptron classifier, with 10-fold-crossvalidation. Nine folds should be training sets(training_instances), and 1 the testing set(testing_instances), and this process is made 10 times, with different key_folds(testing fold), right?
 
My questions, about the resuts at the CSV file:
The Percent_Incorrect is the error occurred at the testing_instances?
How do i know the error that occurred at the training_instances? It's possible?
What is the training error at this csv file, after all?
_______________________________________________
Wekalist mailing list
Wekalist <at> list.scms.waikato.ac.nz
https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist
Emanuel Amaral Schimidt | 3 Jan 2005 18:01
Picon
Favicon

Saving results

Hi everyone!

It's my first message on the list, and I'm a new user
of Weka.

I would like to save the trained set, and recover it
again. By this way, I don't have to run the data
everytime.

But...I didn't find how to do this. Can you help me,
please? How can I do that? Or Weka doesn't has this
feature?

Sincerely

Emanuel

__________________________________________________
Converse com seus amigos em tempo real com o Yahoo! Messenger 
http://br.download.yahoo.com/messenger/ 
Emanuel Amaral Schimidt | 3 Jan 2005 20:56
Picon
Favicon

Re: Saving results

I'm using Weka in my code (linking to the algorithms).
I print the results, but I would like to save this
results and reuse it on future minings (then I won't
need to mine de database again).

I hope you understand my poor english. Please, correct
me, than I can learn :-)

Thanks!

Emanuel

 --- David <pythonner <at> gmail.com> escreveu: 
> Hello,
> 
> are you running Weka through the GUI or are you
> playing in the code directly?
> 
> David
> 

	
	
		
_______________________________________________________ 
Yahoo! Acesso Grátis - Instale o discador do Yahoo! agora. http://br.acesso.yahoo.com/ - Internet
rápida e grátis
Grazia Russo-Lassner | 3 Jan 2005 21:25
Picon
Favicon

Re: Saving results


If I am not mistaken, a flag on the command line (-o) allows you to 
specify the output filename.

Grazia

On Mon, 3 Jan 2005, Emanuel Amaral Schimidt wrote:

> I'm using Weka in my code (linking to the algorithms).
> I print the results, but I would like to save this
> results and reuse it on future minings (then I won't
> need to mine de database again).
>
> I hope you understand my poor english. Please, correct
> me, than I can learn :-)
>
> Thanks!
>
> Emanuel
>
> --- David <pythonner <at> gmail.com> escreveu:
>> Hello,
>>
>> are you running Weka through the GUI or are you
>> playing in the code directly?
>>
>> David
>>
>
>
>
>
>
> _______________________________________________________
> Yahoo! Acesso Grátis - Instale o discador do Yahoo! agora. http://br.acesso.yahoo.com/ - Internet
rápida e grátis
>
> _______________________________________________
> Wekalist mailing list
> Wekalist <at> list.scms.waikato.ac.nz
> https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist
>
_______________________________________________
Wekalist mailing list
Wekalist <at> list.scms.waikato.ac.nz
https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist
Grazia Russo-Lassner | 4 Jan 2005 20:16
Picon
Favicon

Re: Saving results


Yes, you can do it.

For instance, in a Perl script
$cmd = "nice /path-to-java/java -cp /path-to-weka.jar/weka.jar -Dfile.encoding=ISO8859-1
weka.classifiers.trees.j48.J48 (or any other classifier you are using) -t train_file -d
output-file-for-model -T test-file > filename-in-which-to-save-statistics";
system($cmd);
With the -d option you are saving your model in a file and you can call it 
when you need it.

look in Chapter 8 (page 296) of Data Mining by Witten and Frank.

Grazia

On Tue, 4 Jan 2005, Emanuel Amaral Schimidt wrote:

> Hello Grazia!
>
> Thanks for your answer!
>
> I would like to know if I can reuse the learned model
> in Weka, tell weka that this is a model that have
> already learned, then I won't need to run the model
> again each time (I use the saved one).
>
> I'm using weka in my own code, so, how could I save
> and  call the saved model by it?
>
> Thanks for your help and time
>
> Emanuel
>
> --- Grazia Russo-Lassner <glassner <at> umiacs.umd.edu>
> escreveu:
>>
>> If I am not mistaken, a flag on the command line
>> (-o) allows you to
>> specify the output filename.
>>
>> Grazia
>>
>>
>> On Mon, 3 Jan 2005, Emanuel Amaral Schimidt wrote:
>>
>>> I'm using Weka in my code (linking to the
>> algorithms).
>>> I print the results, but I would like to save this
>>> results and reuse it on future minings (then I
>> won't
>>> need to mine de database again).
>>>
>>> I hope you understand my poor english. Please,
>> correct
>>> me, than I can learn :-)
>>>
>>> Thanks!
>>>
>>> Emanuel
>>>
>>> --- David <pythonner <at> gmail.com> escreveu:
>>>> Hello,
>>>>
>>>> are you running Weka through the GUI or are you
>>>> playing in the code directly?
>>>>
>>>> David
>>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>
> _______________________________________________________
>>> Yahoo! Acesso Grátis - Instale o discador do
>> Yahoo! agora. http://br.acesso.yahoo.com/ - Internet
>> rápida e grátis
>>>
>>> _______________________________________________
>>> Wekalist mailing list
>>> Wekalist <at> list.scms.waikato.ac.nz
>>>
>>
> https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist
>>>
>
> __________________________________________________
> Converse com seus amigos em tempo real com o Yahoo! Messenger
> http://br.download.yahoo.com/messenger/
>
_______________________________________________
Wekalist mailing list
Wekalist <at> list.scms.waikato.ac.nz
https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist
Eibe Frank | 4 Jan 2005 22:55
Picon
Picon

Re: Wekalist Digest, Vol 22, Issue 24


On Dec 26, 2004, at 11:03 PM, Samatha Kottha wrote:

>> The Resample filters in Weka perform sampling WITH replacement.
>> (Unfortunately the documentation hasn't been very clear about that but
>> we have fixed it recently in CVS.)
>>
>
> I am using weka for trainig and testing for C45, but for SVM I am 
> using LibSVM.
> So, till now I am using the resample to generate the random training 
> set and
> saving it and using that saved set for LibSVM. Does the resample with
> uniformbiastoclass as false(0) also does the replacement. When I 
> checked it, it
> does not chainging the proportion of class that much.

Yes, Resample always does sampling with replacement. So you might end 
up with duplicate instances. However, the per-class proportions should 
be similar to the original data if you set that parameter to zero.

> We have holidays till the first week of January, I will
> send you the data some time in second week.

Great, thanks.

Cheers,
Eibe
Clifford Lyon | 5 Jan 2005 05:15
Picon

Factor Analysis

Hi, is there anyone on the list who has used Weka for factor analysis? 
I want to write a factor analysis class, probably maximum likelihood, 
but don't want to reinvent the wheel.

Thanks for any hints.
Baber Zaman | 5 Jan 2005 15:29
Picon

Text Classification And Weka/Judge

Hello All,

I am quite new in using Weka and need help to classify text.

Can any body help me how can I  use Weka / Judge  for text mining.

I want to perform text mining on large collection of documents. So I
would like to have

Stop words removal
Stemming
TF x IDF weighting
KNN Classification

Can some body provide me some example of using Text Classification.

Secondly I want to build the feature matrix and store it in some way,
and use this feature matrix to classify new documents in future .

Can any body help me in this regard, how can I get the feature matrix
and store it and later build classifier using this matrix.

Thanx in advance.

--

-- 
Baber Zaman
Master Student Software Systems Engineering
Aachen University Of Technology 
Germany.
Phone : (+)49-288-3066118
            (+)49-179-1489662(Handy)

Gmane