Anuj Gupta | 2 Jan 06:47
Picon

How can I use UIMA for Text Mining Project.

Hello All,

 

I want to use UIMA in my new project which is text mining.

I jus simple download all required things like UIMA SDK. Configure it in my Eclipse. And try to run given examples.

Some examples are working fine but some are not.

Then I also am creating a very small application.

But I am getting some Errors. Please see below for more details.

 

1. While saving Analysis Engine Descriptor file.

 

 

 

2. While try to test my Annotator by Document Analyzer.

 

 

Can any body please help me on this?

As I want to use UIMA in text mining. So please give me more inputs so that I can use it more efficiently.

 

Thanks in Advance. J

 

Regards,

Anuj Kumar Gupta | Software Engineer | Persistent Systems

anuj_kgupta-BI6C26+1BusSpmxQ0ygm9Q@public.gmane.org  |

Innovation in software product design, development and delivery- www.persistentsys.com

 

DISCLAIMER ========== This e-mail may contain privileged and confidential information which is the property of Persistent Systems Ltd. It is intended only for the use of the individual or entity to which it is addressed. If you are not the intended recipient, you are not authorized to read, retain, copy, print, distribute or use this message. If you have received this communication in error, please notify the sender and delete all copies of this message. Persistent Systems Ltd. does not accept any liability for virus infected mails.

Tong Fin | 4 Jan 17:17
Picon

Re: How can I use UIMA for Text Mining Project.

Hi,
It looks like there is a problem with the text of your post since I cannot
see the error texts that you try to get helps.

Please post your message one more time (a plain text of message is the
best).

-- Tong

On Fri, Jan 2, 2009 at 12:47 AM, Anuj Gupta <anuj_kgupta@...>wrote:

>  Hello All,
>
>
>
> I want to use UIMA in my new project which is *text mining*.
>
> I jus simple download all required things like UIMA SDK. Configure it in my
> Eclipse. And try to run given examples.
>
> Some examples are working fine but some are not.
>
> Then I also am creating a very small application.
>
> But I am getting some Errors. Please see below for more details.
>
>
>
> 1. While saving Analysis Engine Descriptor file.
>
>
>
>
>
>
>
> 2. While try to test my Annotator by Document Analyzer.
>
>
>
>
>
> Can any body please help me on this?
>
> As I want to use UIMA in text mining. So please give me more inputs so that
> I can use it more efficiently.
>
>
>
> Thanks in Advance. J
>
>
>
> *Regards,*
>
> *Anuj Kumar Gupta **|** Software Engineer **|** Persistent Systems*
>
> *anuj_kgupta@...**  **|*
>
> *Innovation in software product design, development and delivery-* *
> www.persistentsys.com* <http://www.persistentsys.com/>
>
>
>
> DISCLAIMER ========== This e-mail may contain privileged and confidential
> information which is the property of Persistent Systems Ltd. It is intended
> only for the use of the individual or entity to which it is addressed. If
> you are not the intended recipient, you are not authorized to read, retain,
> copy, print, distribute or use this message. If you have received this
> communication in error, please notify the sender and delete all copies of
> this message. Persistent Systems Ltd. does not accept any liability for
> virus infected mails.
>

--

-- 
 Tong
Anuj Gupta | 5 Jan 07:25
Picon

UIMA related Issues

Hello All,

 

I want to use UIMA in my new project which is text mining.

I jus simple download all required things like UIMA SDK. Configure it in my Eclipse. And try to run given examples.

Some examples are working fine but some are not.

Then I also am creating a very small application.

But I am getting some Errors. Please see below for more details.

 

1. While saving Analysis Engine Descriptor file.

 

 

 

2. While try to test my Annotator by Document Analyzer.

 

 

Can any body please help me on this?

As I want to use UIMA in text mining. So please give me more inputs so that I can use it more efficiently.

 

Thanks in Advance. J

 

Regards,

Anuj Kumar Gupta | Software Engineer | Persistent Systems

anuj_kgupta-BI6C26+1BusSpmxQ0ygm9Q@public.gmane.org  |

Innovation in software product design, development and delivery- www.persistentsys.com

 

 

DISCLAIMER ========== This e-mail may contain privileged and confidential information which is the property of Persistent Systems Ltd. It is intended only for the use of the individual or entity to which it is addressed. If you are not the intended recipient, you are not authorized to read, retain, copy, print, distribute or use this message. If you have received this communication in error, please notify the sender and delete all copies of this message. Persistent Systems Ltd. does not accept any liability for virus infected mails.

Marshall Schor | 5 Jan 14:36

Re: UIMA related Issues

Hi Anuj -

The images in the email do not survive the internal processing done by
the email hosting system here.

I did manage to see the original images, but others cannot.  It would be
good if you can in the future copy and paste the message texts from the
images, and just include the text, so others can read it.

The problem appears to be that you are running some software examples
for UIMA that go with a much older version of UIMA (one that existed
before UIMA became an open source project at Apache).

The error message says the file in the location:
C:\Program
Files\apache-uima\docs\examples\descriptors\tutorial\ex3\TutorialDateTime.xml
has a line in it that contains:

"com.ibm.uima.java"

This file in the current download of UIMA from Apache (release 2.2.2)
doesn't have that. So, it appears that you obtained a version of UIMA
that is corrupt.  Please delete that version and download it again.

-Marshall

Anuj Gupta wrote:
>
> Hello All,
>
>  
>
> I want to use UIMA in my new project which is *text mining*.
>
> I jus simple download all required things like UIMA SDK. Configure it
> in my Eclipse. And try to run given examples.
>
> Some examples are working fine but some are not.
>
> Then I also am creating a very small application.
>
> But I am getting some Errors. Please see below for more details.
>
>  
>
> 1. While saving Analysis Engine Descriptor file.
>
>  
>
>  
>
>  
>
> 2. While try to test my Annotator by Document Analyzer.
>
>  
>
>  
>
> Can any body please help me on this?
>
> As I want to use UIMA in text mining. So please give me more inputs so
> that I can use it more efficiently.
>
>  
>
> Thanks in Advance. J
>
>  
>
> *Regards,*
>
> *Anuj Kumar Gupta **|** Software Engineer **|** Persistent Systems*
>
> *anuj_kgupta@...
<mailto:anuj_kgupta@...>** 
> **|*
>
> *Innovation in software product design, development and delivery-*
> *www.persistentsys.com* <http://www.persistentsys.com/>
>
>  
>
>  
>
> DISCLAIMER ========== This e-mail may contain privileged and
> confidential information which is the property of Persistent Systems
> Ltd. It is intended only for the use of the individual or entity to
> which it is addressed. If you are not the intended recipient, you are
> not authorized to read, retain, copy, print, distribute or use this
> message. If you have received this communication in error, please
> notify the sender and delete all copies of this message. Persistent
> Systems Ltd. does not accept any liability for virus infected mails.
>

Marshall Schor | 5 Jan 18:00

Re: Anyone got UIMA to work on Vista 64bit?

Dan figured out he needed to change the default heap space for Java. 
See his post on this on the uima wiki, here:
http://cwiki.apache.org/UIMA/compiling-and-running-under-microsoft-vista-64bit.html

-Marshall

Marshall Schor wrote:
> Dan McCreary wrote:
>   
>> Hello UIMA people,
>>
>> First of all, a warm "thanks!" to all involved.  UIMA is a wonderful vision
>> and I hope to help out making it easy for everyone else to use.
>>
>> My quetion is: has anyone got UIMA to work on a 64-bit Vista system?
>>
>> I have attempted to do this but I can not seem to get UIMA running on VISTA
>> with no luck yet.  First I tried the binary downloads for Windows.  When I
>> attempt launch tools such as documentAnalyzer.bat the JVM crashes and Vista
>> creates very large binary dump in your ~home/AppData/Local/Temp folder.  
>>     
> What error messages accompany the JVM crash?  One approach may be to
> take the most interesting terms from that, add "java" "crash" "vista"
> and "64", and put them into Google :-)  and see what comes up.
>   
>> The
>> program adjustExamplePaths.bat seems to work however.
>>
>> Next I tried to download all the source from subversion using Eclipse and
>> the standard subclipse plugin.  When it go to the DOS shell and type "mvn
>> clean install" I get many sucessful complies and many successful test runs
>> but then I get a "BUILD FAILURE".  I have gone into the
>> uimaj-core\core\target\surefile-reports as the directions indicate but when
>> I grep through the results all the files have a failure="0" in them.
>>   
>>     
> The maven install runs maven on many projects. Are you sure the
> uimaj-core project is the one where the failure was?
>   
>> I am somewhat new to UIMA and Mavin so I apologize for my lack of
>> understanding of how UIMA and Mavin works.
>>   
>>     
> No problem - keep posting details and asking questions :-)
>   
>> My versions are:
>> Java: 1.6.0_10
>> Eclipse: 3.4.1
>> Mavin: 2.0.9
>>
>> Thanks - Dan
>>
>> PS - I have a friend with a 4 year-old Mac notebook that got UIMA running in
>> about 30 minutes....
>>
>>   
>>     
>
>
>   

Dan McCreary | 5 Jan 18:23
Picon

Re: Anyone got UIMA to work on Vista 64bit?

Hi Marshall,

I am starting to really like the UIMA framework.  A beautiful architecture.
We just got a demo of the POS analyzer running last week.

I would like to blog on UIMA on OReilly.com.  Are you available for a short
interview?  Either over the phone or perhaps a chat-based Q and A?

- Dan

Dan McCreary
Senior Enterprise Data Architecture and Strategy Consulting
(952) 931-9198
cell: (612) 986-1552
dan@...
http://www.danmccreary.com
Marshall Schor | 5 Jan 20:50

Re: Anyone got UIMA to work on Vista 64bit?

Sure, happy to chat - I'll contact you off the list.

-Marshall

Dan McCreary wrote:
> Hi Marshall,
>
> I am starting to really like the UIMA framework.  A beautiful architecture.
> We just got a demo of the POS analyzer running last week.
>
> I would like to blog on UIMA on OReilly.com.  Are you available for a short
> interview?  Either over the phone or perhaps a chat-based Q and A?
>
> - Dan
>
> Dan McCreary
> Senior Enterprise Data Architecture and Strategy Consulting
> (952) 931-9198
> cell: (612) 986-1552
> dan@...
> http://www.danmccreary.com
>
>   

Marshall Schor | 5 Jan 21:56

Re: P2P UIMA


Yosi Mass wrote:
> Hi,
>
> I would like to suggest a scale-out of UIMA by enabling it to run in a P2P
> environment.
>
> >From my understanding, the CPE is a 1st generation scaleout, and it can run
> a distributed pipeline using vinci/soap but the machines involved in the
> pipeline are predefined in the UIMA descriptors.
>
> The 2nd generation scaleout is called UIMA-AS (AS = Asynchronous Scaleout),
> and is based on some Java and web standards, such as JMS (Java Messaging
> Service).  It is now officially released on Apache UIMA.  This allows users
> to selectively choose which parts of their pipeline to run in this mode,
> which in turn allows scaling out individual parts of the pipeline, as
> needed. Again there is no dynamic discovery of resources after startup.
>   
Hmm, I think this may not be quite accurate.  In UIMA-AS, connections
are made using a JMS infrastructure, such as ActiveMQ.  Each service has
an associated "address" in the network space, made up of a Broker URL
and Port.

The actual service implementation is done by 1 or more servers that
register themselves with the Broker URL and Port.  During a run, servers
can be dynamically added or removed; this changes the "capacity" of the
service.  Of course, if all of the servers for a particular service are
removed, then the service "fails". 

But maybe what is meant, is, rather, the ability of the system to
recognize when a service becomes available, rather than merely changing
its capacity.  For instance, in the UIMA-AS case, this could mean
several kinds of things:

1) allowing a service to be configured with 0 servers available at startup

2) having the flow controller "know" more explicitly about service
"availablilty", for instance, the number of "servers" there might be for
a particular service.  Here, the idea would be that a flow controller
could dynamically decide, based on what the service level of different
steps in the pipeline were, how to "route" a CAS for a particular aggregate.

Are these the kinds of function that are desired?
> I would like to suggest a 3rd generation scaleout using a fully
> decentralized P2P network. Assume that each peer can publish its
> capabilities (namely which annotators it can run) and its current
> availability, then we may extend UIMA/UIMA-AS pipeline to discover an
> available and capable peer for running an annotator and thus achieve better
> load balancing and thus better performance than previous generations.
>   
The "publication" would need to include the type system of the
annotators, and some notion of which annotators would ever "want" to be
run together in a pipeline, because a key part of the UIMA design is the
"merging" of type systems to allow interoperability among the parts.

Is there a "reservation" idea here too?  For instance, in an open
environment, where there are lots of clients and services and servers
for those services, a particular client might want to reserve some
amount of processing capability for itself, (not necessarily all of the
capability).

Finally, I wonder -- are there systems / infrastructure / middleware
already out there that do this kind of thing that we could perhaps
easily adapt / adopt for these purposes?

-Marshall
> What people on the list think about this?
>
> Thanks, Yosi
>
>
>
>
>
>   

Anuj Kumar Gupta | 6 Jan 05:50
Picon

Re: UIMA related Issues

Hello Marshall-

 Thanks a lot for reply. J

As I am a very beginner in UIMA so I need some more help.

So can you please give me your GTalk or IM Id. So that I can communicate you
easily.

Marshall I want to know some queries as how can we use UIMA in text mining
Project?

Can we extract information from any Database rather than any Documents?

Can we do classification of that data?

Can we do Co-referencing by UIMA?

GATE is a Part of UIMA or UIMA is a part of GATE?

My aim would be like this flow.

*Fetch **à** Classify **à** Extraction **à** Sentiment **à** Display
(Charts/Reports)*

Waiting for your reply

Thanks in Advance. J

On Mon, Jan 5, 2009 at 7:06 PM, Marshall Schor <msa <at> schor.com> wrote:

> Hi Anuj -
>
> The images in the email do not survive the internal processing done by
> the email hosting system here.
>
> I did manage to see the original images, but others cannot.  It would be
> good if you can in the future copy and paste the message texts from the
> images, and just include the text, so others can read it.
>
> The problem appears to be that you are running some software examples
> for UIMA that go with a much older version of UIMA (one that existed
> before UIMA became an open source project at Apache).
>
> The error message says the file in the location:
> C:\Program
>
> Files\apache-uima\docs\examples\descriptors\tutorial\ex3\TutorialDateTime.xml
> has a line in it that contains:
>
> "com.ibm.uima.java"
>
> This file in the current download of UIMA from Apache (release 2.2.2)
> doesn't have that. So, it appears that you obtained a version of UIMA
> that is corrupt.  Please delete that version and download it again.
>
> -Marshall
>
>
>
> Anuj Gupta wrote:
> >
> > Hello All,
> >
> >
> >
> > I want to use UIMA in my new project which is *text mining*.
> >
> > I jus simple download all required things like UIMA SDK. Configure it
> > in my Eclipse. And try to run given examples.
> >
> > Some examples are working fine but some are not.
> >
> > Then I also am creating a very small application.
> >
> > But I am getting some Errors. Please see below for more details.
> >
> >
> >
> > 1. While saving Analysis Engine Descriptor file.
> >
> >
> >
> >
> >
> >
> >
> > 2. While try to test my Annotator by Document Analyzer.
> >
> >
> >
> >
> >
> > Can any body please help me on this?
> >
> > As I want to use UIMA in text mining. So please give me more inputs so
> > that I can use it more efficiently.
> >
> >
> >
> > Thanks in Advance. J
> >
> >
> >
> > *Regards,*
> >
> > *Anuj Kumar Gupta **|** Software Engineer **|** Persistent Systems*
> >
> > *anuj_kgupta <at> persistent.co.in <mailto:anuj_kgupta <at> persistent.co.in>**
> > **|*
> >
> > *Innovation in software product design, development and delivery-*
> > *www.persistentsys.com* <http://www.persistentsys.com/>
>  >
> >
> >
> >
> >
> > DISCLAIMER ========== This e-mail may contain privileged and
> > confidential information which is the property of Persistent Systems
> > Ltd. It is intended only for the use of the individual or entity to
> > which it is addressed. If you are not the intended recipient, you are
> > not authorized to read, retain, copy, print, distribute or use this
> > message. If you have received this communication in error, please
> > notify the sender and delete all copies of this message. Persistent
> > Systems Ltd. does not accept any liability for virus infected mails.
> >
>
Anuj Kumar Gupta | 6 Jan 05:55
Picon

Re: UIMA related Issues

Hello Marshall-

Can you please let me know.
From where I can download the latest version of UIMA as well as examples.
as http://incubator.apache.org/uima/downloads.cgi
from this link I am not getting any working file. :(

Thanks anuj.

On Mon, Jan 5, 2009 at 7:06 PM, Marshall Schor <msa@...> wrote:

> Hi Anuj -
>
> The images in the email do not survive the internal processing done by
> the email hosting system here.
>
> I did manage to see the original images, but others cannot.  It would be
> good if you can in the future copy and paste the message texts from the
> images, and just include the text, so others can read it.
>
> The problem appears to be that you are running some software examples
> for UIMA that go with a much older version of UIMA (one that existed
> before UIMA became an open source project at Apache).
>
> The error message says the file in the location:
> C:\Program
>
> Files\apache-uima\docs\examples\descriptors\tutorial\ex3\TutorialDateTime.xml
> has a line in it that contains:
>
> "com.ibm.uima.java"
>
> This file in the current download of UIMA from Apache (release 2.2.2)
> doesn't have that. So, it appears that you obtained a version of UIMA
> that is corrupt.  Please delete that version and download it again.
>
> -Marshall
>
>
>
> Anuj Gupta wrote:
> >
> > Hello All,
> >
> >
> >
> > I want to use UIMA in my new project which is *text mining*.
> >
> > I jus simple download all required things like UIMA SDK. Configure it
> > in my Eclipse. And try to run given examples.
> >
> > Some examples are working fine but some are not.
> >
> > Then I also am creating a very small application.
> >
> > But I am getting some Errors. Please see below for more details.
> >
> >
> >
> > 1. While saving Analysis Engine Descriptor file.
> >
> >
> >
> >
> >
> >
> >
> > 2. While try to test my Annotator by Document Analyzer.
> >
> >
> >
> >
> >
> > Can any body please help me on this?
> >
> > As I want to use UIMA in text mining. So please give me more inputs so
> > that I can use it more efficiently.
> >
> >
> >
> > Thanks in Advance. J
> >
> >
> >
> > *Regards,*
> >
> > *Anuj Kumar Gupta **|** Software Engineer **|** Persistent Systems*
> >
> > *anuj_kgupta@... <mailto:anuj_kgupta@...>**
> > **|*
> >
> > *Innovation in software product design, development and delivery-*
> > *www.persistentsys.com* <http://www.persistentsys.com/>
>  >
> >
> >
> >
> >
> > DISCLAIMER ========== This e-mail may contain privileged and
> > confidential information which is the property of Persistent Systems
> > Ltd. It is intended only for the use of the individual or entity to
> > which it is addressed. If you are not the intended recipient, you are
> > not authorized to read, retain, copy, print, distribute or use this
> > message. If you have received this communication in error, please
> > notify the sender and delete all copies of this message. Persistent
> > Systems Ltd. does not accept any liability for virus infected mails.
> >
>

Gmane