Picon
Favicon

[jira] Created: (UIMA-1299) Contribution of Lucene CAS Indexer

Contribution of Lucene CAS Indexer
----------------------------------

                 Key: UIMA-1299
                 URL: https://issues.apache.org/jira/browse/UIMA-1299
             Project: UIMA
          Issue Type: New Feature
          Components: Sandbox
            Reporter: Rico Landefeld

Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based
"mapping configuration  file" in which the user can determine which UIMA annotations should be put into
which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic
functionality for (ontolgical) hypernym indexing is provided.

Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA
annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

--

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Picon
Favicon

[jira] Updated: (UIMA-1299) Contribution of Lucene CAS Indexer


     [
https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rico Landefeld updated UIMA-1299:
---------------------------------

    Attachment: lucene-indexer.tar.gz

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox
>            Reporter: Rico Landefeld
>         Attachments: lucene-indexer.tar.gz
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based
"mapping configuration  file" in which the user can determine which UIMA annotations should be put into
which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic
functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA
annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

--

-- 
This message is automatically generated by JIRA.
(Continue reading)

Picon
Favicon

[jira] Created: (UIMA-1300) The client API needs to support optional callback listener before sendCas

The client API needs to support optional callback listener before sendCas
-------------------------------------------------------------------------

                 Key: UIMA-1300
                 URL: https://issues.apache.org/jira/browse/UIMA-1300
             Project: UIMA
          Issue Type: New Feature
          Components: Async Scaleout
            Reporter: Eddie Epstein

When using a collection reader with the client API, application control is limited to callbacks on the
completion of commands. In order to measure processing latency, a client must in addition be notified
just before each process request is sent.

--

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Picon
Favicon

[jira] Closed: (UIMA-1294) Enable access of service's ipaddr from process Cas replies


     [
https://issues.apache.org/jira/browse/UIMA-1294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Eddie Epstein closed UIMA-1294.
-------------------------------

    Resolution: Fixed

Added optional logging of each process request with service IP address, send time and duration.

> Enable access of service's ipaddr from process Cas replies
> ----------------------------------------------------------
>
>                 Key: UIMA-1294
>                 URL: https://issues.apache.org/jira/browse/UIMA-1294
>             Project: UIMA
>          Issue Type: Improvement
>          Components: Async Scaleout
>            Reporter: Eddie Epstein
>            Assignee: Eddie Epstein
>            Priority: Minor
>
> Process Cas reply messages contain the service's host ipaddr, but there is no mechanism to retrieve this
info. Also, would be nice for the sample program, RunRemoteAsyncAE to show how to access this info and to
display it.

--

-- 
This message is automatically generated by JIRA.
-
(Continue reading)

Picon
Favicon

[jira] Closed: (UIMA-1300) The client API needs to support optional callback listener before sendCas


     [
https://issues.apache.org/jira/browse/UIMA-1300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jerry Cwiklik closed UIMA-1300.
-------------------------------

    Resolution: Fixed

Modified Uima AS client API to use UimaAsBasicCallbackListener class. This abstract class implements a
new method onBeforeMessageSend() that is called by the client API just before a message is sent to the
service. The application can extend this class and override the method to receive notifications

> The client API needs to support optional callback listener before sendCas
> -------------------------------------------------------------------------
>
>                 Key: UIMA-1300
>                 URL: https://issues.apache.org/jira/browse/UIMA-1300
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Async Scaleout
>            Reporter: Eddie Epstein
>
> When using a collection reader with the client API, application control is limited to callbacks on the
completion of commands. In order to measure processing latency, a client must in addition be notified
just before each process request is sent.

--

-- 
This message is automatically generated by JIRA.
-
(Continue reading)

Thilo Goetz | 4 Mar 11:16
Picon
Picon

Re: [jira] Updated: (UIMA-1299) Contribution of Lucene CAS Indexer

In order to move this along, I'll call for a vote.
No use prevaricating about the bush ;-)

--Thilo

Rico Landefeld (JIRA) wrote:
>      [
https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
> 
> Rico Landefeld updated UIMA-1299:
> ---------------------------------
> 
>     Attachment: lucene-indexer.tar.gz
> 
>> Contribution of Lucene CAS Indexer
>> ----------------------------------
>>
>>                 Key: UIMA-1299
>>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>>             Project: UIMA
>>          Issue Type: New Feature
>>          Components: Sandbox
>>            Reporter: Rico Landefeld
>>         Attachments: lucene-indexer.tar.gz
>>
>>
>> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based
"mapping configuration  file" in which the user can determine which UIMA annotations should be put into
which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic
functionality for (ontolgical) hypernym indexing is provided.
(Continue reading)

Thilo Goetz (JIRA | 4 Mar 11:21
Picon
Favicon

[jira] Commented: (UIMA-1299) Contribution of Lucene CAS Indexer


    [
https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12678678#action_12678678
] 

Thilo Goetz commented on UIMA-1299:
-----------------------------------

The md5 signature of lucene-indexer.tar.gz is 48 15 80 58 4D D0 96 3A  4E 90 74 E3 45 97 51 12

> Contribution of Lucene CAS Indexer
> ----------------------------------
>
>                 Key: UIMA-1299
>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>             Project: UIMA
>          Issue Type: New Feature
>          Components: Sandbox
>            Reporter: Rico Landefeld
>         Attachments: lucene-indexer.tar.gz
>
>
> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based
"mapping configuration  file" in which the user can determine which UIMA annotations should be put into
which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic
functionality for (ontolgical) hypernym indexing is provided.
> Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA
annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

--

-- 
(Continue reading)

Thilo Goetz | 4 Mar 11:32
Picon
Picon

[VOTE] Accept contribution of Lucene CAS Indexer into the sandbox

Please vote to accept the contribution of the Lucene
CAS indexer into the sandbox.  See Jira issue UIMA-1299
(https://issues.apache.org/jira/browse/UIMA-1299) for
the tar ball.

  [ ] +1 Accept Lucene CAS indexer into UIMA sandbox
  [ ] -1 Do not accept contribution of Lucene CAS indexer

You're all encouraged to vote, even if you're not a
UIMA committer.  If you vote to reject the contribution,
please remember to give a reason.

--Thilo

Thilo Goetz | 4 Mar 11:35
Picon
Picon

Re: [VOTE] Accept contribution of Lucene CAS Indexer into the sandbox


+1

Thilo Goetz wrote:
> Please vote to accept the contribution of the Lucene
> CAS indexer into the sandbox.  See Jira issue UIMA-1299
> (https://issues.apache.org/jira/browse/UIMA-1299) for
> the tar ball.
> 
>   [ ] +1 Accept Lucene CAS indexer into UIMA sandbox
>   [ ] -1 Do not accept contribution of Lucene CAS indexer
> 
> You're all encouraged to vote, even if you're not a
> UIMA committer.  If you vote to reject the contribution,
> please remember to give a reason.
> 
> --Thilo
jochen.leidner | 4 Mar 16:32
Favicon

RE: [VOTE] Accept contribution of Lucene CAS Indexer into the sandbox

Accept Lucene CAS Indexer into the sandbox

  [+1] +1 Accept Lucene CAS indexer into UIMA sandbox
  [  ] -1 Do not accept contribution of Lucene CAS indexer

Jochen

--
Dr. Jochen Leidner
Research Scientist

Thomson Reuters Corporation
Research & Development
610 Opperman Drive
St. Paul, MN 55123
USA

http://www.ThomsonReuters.com


Gmane