James Baker | 24 Jul 14:16 2014
Picon

Passing additional parameters through to CPE components

Is it possible to provide additional configuration parameters in a CPE
descriptor XML file that aren't specified in the annotator/collection
reader descriptor XML file?

I have a collection reader that accepts the classname of a class to use to
do the content extraction as a parameter. This works fine, but I'd like to
be able to pass additional parameters to the content extractor via the XML.
The parameters will be dependant on the content extractor though, so I
can't specify them in the collection reader descriptor. For example,
ContentExtractor1 might need a parameter 'encoding', and ContentExtractor2
might need a parameter 'baseUrl'.

I have been able to achieve this with UimaFIT by creating the collection
reader without the XML and injecting the parameters, but when I try and do
it from the XML file the parameters don't make it through to my content
extractor (I pass the UimaContext object through to the content extractor).
I suspect they might be being ignored by UIMA because they aren't in the
descriptor. How can I work around this?

Thanks,
James
James Baker | 24 Jul 14:19 2014
Picon

Passing additional parameters through to CPE components

Is it possible to provide additional configuration parameters in a CPE
descriptor XML file that aren't specified in the annotator/collection
reader descriptor XML file?

I have a collection reader that accepts the classname of a class to use to
do the content extraction as a parameter. This works fine, but I'd like to
be able to pass additional parameters to the content extractor via the XML.
The parameters will be dependant on the content extractor though, so I
can't specify them in the collection reader descriptor. For example,
ContentExtractor1 might need a parameter 'encoding', and ContentExtractor2
might need a parameter 'baseUrl'.

I have been able to achieve this with UimaFIT by creating the collection
reader without the XML and injecting the parameters, but when I try and do
it from the XML file the parameters don't make it through to my content
extractor (I pass the UimaContext object through to the content extractor).
I suspect they might be being ignored by UIMA because they aren't in the
descriptor. How can I work around this?

Thanks,
James
Dr. Armin Wegner | 24 Jul 09:53 2014

DKpro StanfordNamedEntityRecognizer ClassCastException

Hi!

I tried to do the introductory example for DKpro. But DKpro's
StanfordNamedEntityRecognizer throws a ClassCastException. Any ideas?
I had to replace some code snippets with [...].

Thanks
Armin

Jul 24, 2014 9:27:42 AM
de.tudarmstadt.ukp.dkpro.core.api.resources.ResourceObjectProviderBase
loadResource
INFO: Producing resource from
[jar:file:[...].jar!/de/tudarmstadt/ukp/dkpro/core/stanfordnlp/lib/ner-en-all.3class.distsim.crf.ser.gz]
redirected from
[jar:file:[...].jar!/de/tudarmstadt/ukp/dkpro/core/stanfordnlp/lib/ner-en-all.3class.distsim.crf.properties]
Jul 24, 2014 9:27:45 AM
de.tudarmstadt.ukp.dkpro.core.api.resources.ResourceObjectProviderBase
loadResource
INFO: Producing resource from
jar:file:[...]!/de/tudarmstadt/ukp/dkpro/core/stanfordnlp/lib/ner-en-all.3class.distsim.crf.map
Jul 24, 2014 9:27:45 AM
org.apache.uima.analysis_engine.impl.PrimitiveAnalysisEngine_impl
callAnalysisComponentProcess(417)
SEVERE: Exception occurred
org.apache.uima.analysis_engine.AnalysisEngineProcessException:
Annotator processing failed.
	at org.apache.uima.analysis_engine.impl.PrimitiveAnalysisEngine_impl.callAnalysisComponentProcess(PrimitiveAnalysisEngine_impl.java:401)
	at org.apache.uima.analysis_engine.impl.PrimitiveAnalysisEngine_impl.processAndOutputNewCASes(PrimitiveAnalysisEngine_impl.java:309)
	at org.apache.uima.analysis_engine.asb.impl.ASB_impl$AggregateCasIterator.processUntilNextOutputCas(ASB_impl.java:569)
(Continue reading)

Armin.Wegner | 22 Jul 17:05 2014
Picon

uimaFIT - types.txt

Hi,

The final runnable jar contains the META-INF/org.apache.uima.fit/types.txt from a maven dependency
and not from the project itself. Can something be done about this?

Cheers
Armin

Egbert van der Wal | 22 Jul 08:51 2014
Picon

Building UIMA-CPP on (K)Ubuntu 14.04

Hi,

I've been trying to add an annotator in C++ to an Annotation Engine in Java. 
However, building UIMA-CPP is not a trivial task, so it seems.

So far, I've identified dependencies on ActiveMQ-CPP, APR, a Java JRE, Xerces 
and ICU. Maybe there's more key dependencies but those do not appear to be a 
problem on Ubuntu.

libxerces was easy to fix as a compatible version is in the Ubuntu repository.

APR is harder: Ubuntu 14.04 ships with 1.5.x while the configure script checks 
for 1.2, 1.3 or 1.4. Hacking the configure-script to also accept 1.5.x works 
but I didn't get to compiling yet so I don't know about the API-differences 
and if this will work.

ActiveMQ-CPP is not in the Ubuntu repository. I had to locate and built this 
myself, but this actually didn't prove to be so hard.

ICU is harder. The configure script wants a --with-icu path, but then assumes 
other facts. In Ubuntu, icu-config is located in /usr/bin while the header 
filers are located in /usr/include/x86-64-linux-gnu/unicode/. The configure-
script seems to have problems to recognize this difference. I would've thought 
that just having the icu-config script in the path would be sufficient, as 
icu-config spits out the rest of the required information, but this doesn't 
seem to be the case.

Java JRE is also a problem. It wants jni.h but is not able to locate it. 
Ubuntu installs JREs in /usr/lib/jvm/≤name of JVM>/ but specyfing, for 
example, --with-jre=/usr/lib/jvm/java-7-oracle/ or --with-
(Continue reading)

Jaroslaw Cwiklik | 17 Jul 19:51 2014
Picon

[ANNOUNCE] Apache UIMA-AS 2.6.0 released

The Apache UIMA team is pleased to announce the release of version 2.6.0 of
the Apache
UIMA AS, which includes asynchronous scaleout capabilities for the UIMA
annotators.

The UIMA AS 2.6.0 binaries and source are available for download from:
http://uima.apache.org/downloads.cgi

UIMA AS includes the base UIMA SDK and augments it with scaleout
capability; it is a next-generation replacement for the original CPM
(Collection Processing Management) scaleout that is part of the core UIMA
Framework. For more information, please visit:

http://uima.apache.org/doc-uimaas-what.html

This release contains a number of improvements and bug fixes. Notable
updates
in this release include:

- Replaced ActiveMQ version 5.6.0 with version 5.7.0

- Added HTTP support

- Improved error handling and recovery

- Performance improvements

For a complete list of bugs and improvements included in this release
please see
https://uima.apache.org/d/uima-as-2.6.0/issuesFixed/jira-report.html.
(Continue reading)

Markus K. | 10 Jul 12:35 2014
Picon

UIMA CAS Editor question

Hi,

Lately i encountered a problem including the CAS-Editor.
Its more a development question, ill go more into detail:

I wrote a view which is able to display some Annotations of the same Type.
Its intention is to apply a given action to those Annotations one by one. As
soon a user finished the changes of a given Annotation [Problem Start] I
want the CAS editor to Highlight ( or mark whatever it is called when you
mark some text with your mouse) the next Annotation of the given Type BUT it
should not move the cursor ( or scroll) to that Annotation ( User should
scroll by himself) [Problem End] All i found was the selectAndReveal method
but it also moves the cursor to the said Annotation. If anyone knows a
solution to that issue i would be very thankful

Markus K.

Debbie Zhang | 9 Jul 13:41 2014
Picon

Read file name in an annotator

Hi,

Can anyone tell me how to read the file name in an annotator using the
JCas? It seems the DocumentAnnotation does't contain file name. Thank
you!

Best regards,

Debbie Zhang

Manuel Fiorelli | 7 Jul 16:36 2014
Picon

UIMA and OSGi

Dear list,

I need to use UIMA in a project, which uses a runtime platform (Apache
Karaf [1]), which is based on an OSGi container (e.g., Apache Felix [2]).

Unfortunately, the artifacts on Maven Central (both uimaj-core and
uimafit-core) are not OSGi bundles, nor they could be easily migrated to
OSGi, because of the management of the class loader performed by OSGi.

In [3] I found a thoughtful analysis of the problem, and an explanation of
how the Clerezza project [4] managed to integrate UIMA [5].

I don't know the extent to the solution above is bound to Clerezza, or if
the UIMA community has developed a general-purpose solution for the
integration with OSGi.

[1] http://karaf.apache.org/
[2] http://felix.apache.org/
[3]
http://www.osgi.org/wiki/uploads/CommunityEvent2012/Adapting%20Apache%20UIMA%20to%20OSGi-%20Tommaso%20Teofili.pdf
[4] http://clerezza.apache.org/
[5] http://clerezza.apache.org/clerezza-uima/
--

-- 
Manuel Fiorelli
reshu.agarwal | 7 Jul 11:03 2014

Infinte initialization of a process even after restarting DUCC


Hi,

I have faced a problem in DUCC after continuous processing in DUCC, the 
job initialization or ending processing go in to infinite loop. So, a 
new job can not be started even after restarting of DUCC and job is 
showing end of job status internally initialization waiting time is 
increasing.

I have read a issue on JIRA where you have talked about this same 
problem . i.e. https://issues.apache.org/jira/browse/UIMA-3645. 
<https://issues.apache.org/jira/browse/UIMA-3645>

When will you release UIMA DUCC 1.1.0 where you have fixed this issue? 
As we have to restart DUCC after certain period of time.
<https://issues.apache.org/jira/browse/UIMA-3645>

--

-- 
Thanks,
Reshu Agarwal

Pedro Santos | 27 Jun 17:35 2014
Picon

[ANNOUNCE] DKPro Core 1.6.1 released

We are pleased to announce the release of

  DKPro Core, version 1.6.1 (ASL & GPL)

a collection of interoperable software components for natural language processing
(NLP) based on the Apache UIMA framework.

  http://code.google.com/p/dkpro-core-asl
  http://code.google.com/p/dkpro-core-gpl

Changed requirements:
- UIMAJ SDK 2.6.0
- uimaFIT 2.1.0

Major improvements:
- Many writers cannot write to ZIP files
- Better support for reading/writing binary CAS formats

Major bug fixes:
- treetagger - NPE when explicitly specifying a model
- stanfordnlp - StanfordPosTagger not applying PTB3 escaping

A more detailed overview of the changes in this release can be found here

https://code.google.com/p/dkpro-core-asl/issues/list?can=1&q=label:Milestone-1.6.1

When upgrading, please mind that you should not mix different versions of DKPro Core components in your
projects - they may not be compatible with each other. 

-- Pedro Santos, for the DKPro Core development team
(Continue reading)


Gmane