Anuj Kumar Gupta | 2 Feb 08:53
Picon

Sentiment Analysis.

Hi All-

I need to create a Sentiment Analysis.

Pass an input doc à Sentence Splitting à there is a table in Database with
some words and score of those I need to check each Sentence and check for is
there any word present there if affirmative check score from database and
add total score from the entire sentences in the Doc.

For all this please suggest how can I take value from database and pass it
as Input.

And what other things I need to do.

Thanks

Anuj
Anuj Kumar Gupta | 2 Feb 10:08
Picon

Pass Multiple Docs.

How can we pass multiple docs as an input in UIMA?
Jörn Kottmann | 2 Feb 10:19
Picon

Re: Pass Multiple Docs.

> How can we pass multiple docs as an input in UIMA?

UIMA obtains documents e.g. from a database through a Collection Reader.
In your case this means that you have to implement your own Collection  
Reader
which can access your specific database. The UIMA examples contains a  
sample
which shows how to implement a Collection Reader which reads documents
from the file system. Have a look at the code.

You can find more about Collection Reader in the UIMA documentation:
http://incubator.apache.org/uima/downloads/releaseDocs/2.2.2-incubating/docs/html/tutorials_and_users_guides/tutorials_and_users_guides.html#ugr.tug.cpe.collection_reader.developing

Hope this helps,
Jörn

Thilo Goetz | 2 Feb 10:21
Picon
Picon

Re: Pass Multiple Docs.

Anuj Kumar Gupta wrote:
> How can we pass multiple docs as an input in UIMA?

Anuj, I think you would really benefit from working
through some of the UIMA documentation.  Take a week
or so, it'll be worth it in the end.

I believe the question you really want to ask is:
how can I create and drive a UIMA app programmatically?
That's described here:  http://tinyurl.com/aysawn
If that was not your question, you may want to give
more details.

--Thilo

Anuj Kumar Gupta | 2 Feb 10:25
Picon

Re: Pass Multiple Docs.

Thilo-

I am working on a text Mining Project in which I need to create some
component like Classifier, POS tagging , Co referencing, Sentiment Analysis,
Negation Handling, Aggregation Handling.

Classifier à Classify the Input data as per some given words.
POS tagging à Add POS Tagging on that Data.
Co referencing à suppose there is a sentence like   "Arnold is a good
person. He is a Actor."  So in the 2nd sentence he would be co reff to
Arnold.
Sentiment Analysis à there would be some given list of words and also some
points for these words like 'good .5, bad .3 , ass .6'
So in Sentiment Analysis is would be find out is there any words matching
with those list or not.
And according to them show results. And score of the sentence. Like doe
above sentence .5

Negation Handling à "Arnold is a not good person. He is a Actor." Then score
would be -.5

Aggregation Handling à Show the Aggregation Score.

Something like this.

I want to use UIMA and GATE for this Project. And I am in very initial
state.
So please help me as much as Possible.

If any one has something similar to these component please share with me.
(Continue reading)

Sharma, Kishor | 2 Feb 10:34

RE: Pass Multiple Docs.

Hi Anuj,
I was also trying to develop a text mining system using UIMA but couldn't succeed, I used vector space model
of data mining for my project.
If u will be able to do it please let me know ur appraoch? 

Thanks,
Kishor 

-----Original Message-----
From: Anuj Kumar Gupta [mailto:virgoanuj@...] 
Sent: Monday, February 02, 2009 2:56 PM
To: uima-user@...
Subject: Re: Pass Multiple Docs.

Thilo-

I am working on a text Mining Project in which I need to create some
component like Classifier, POS tagging , Co referencing, Sentiment Analysis,
Negation Handling, Aggregation Handling.

Classifier à Classify the Input data as per some given words.
POS tagging à Add POS Tagging on that Data.
Co referencing à suppose there is a sentence like   "Arnold is a good
person. He is a Actor."  So in the 2nd sentence he would be co reff to
Arnold.
Sentiment Analysis à there would be some given list of words and also some
points for these words like 'good .5, bad .3 , ass .6'
So in Sentiment Analysis is would be find out is there any words matching
with those list or not.
And according to them show results. And score of the sentence. Like doe
(Continue reading)

Thilo Goetz | 2 Feb 10:39
Picon
Picon

Re: Pass Multiple Docs.

Anuj Kumar Gupta wrote:
> Thilo-
> 
> I am working on a text Mining Project in which I need to create some
> component like Classifier, POS tagging , Co referencing, Sentiment Analysis,
> Negation Handling, Aggregation Handling.
[...]

Yes, I think you have your hands full.  That's
a lot of work.  I don't know what you need this
for, but there are companies out there making
money with that kind of analysis.

You'll find POS tagging in the UIMA sandbox,
and there are any number of open source classifiers
out there.  Coreference resolution and sentiment
analysis is another matter.  I don't know of any
open source components for those, but maybe
someone else does.

--Thilo

Thilo Goetz | 2 Feb 10:40
Picon
Picon

Re: Pass Multiple Docs.

Sharma, Kishor wrote:
> Hi Anuj,
> I was also trying to develop a text mining system using UIMA but couldn't succeed, I used vector space model
of data mining for my project.
> If u will be able to do it please let me know ur appraoch? 

Check out ClearTK: http://code.google.com/p/cleartk/

--Thilo

Jörn Kottmann | 2 Feb 10:46
Picon

Re: Pass Multiple Docs.

> You'll find POS tagging in the UIMA sandbox,
> and there are any number of open source classifiers
> out there.  Coreference resolution and sentiment
> analysis is another matter.  I don't know of any
> open source components for those, but maybe
> someone else does.

The OpenNLP tools provide you with most of the tools you
need. It has sentence detector, tokenizer, pos tagger,
chunker, parser and coreference resolution.

It is licensed currently under LGPL, but this will be changed for the  
next
major release to ASL to ease integration with apache projects.

You can find more information about it at the project website:

http://opennlp.sourceforge.net/

Jörn
Anuj Kumar Gupta | 2 Feb 10:46
Picon

Re: Pass Multiple Docs.

I have already checkout UIMA Sandbox annotators.
but I am not able to ru POS tagging.

can you please let me nkow the process.??

On Mon, Feb 2, 2009 at 3:09 PM, Thilo Goetz <twgoetz@...> wrote:

> Anuj Kumar Gupta wrote:
> > Thilo-
> >
> > I am working on a text Mining Project in which I need to create some
> > component like Classifier, POS tagging , Co referencing, Sentiment
> Analysis,
> > Negation Handling, Aggregation Handling.
> [...]
>
> Yes, I think you have your hands full.  That's
> a lot of work.  I don't know what you need this
> for, but there are companies out there making
> money with that kind of analysis.
>
> You'll find POS tagging in the UIMA sandbox,
> and there are any number of open source classifiers
> out there.  Coreference resolution and sentiment
> analysis is another matter.  I don't know of any
> open source components for those, but maybe
> someone else does.
>
> --Thilo
>
(Continue reading)


Gmane