Alberto Lavelli | 1 Jul 10:57 2004
Picon

annotation tools

Dear CORPORA list members,

I'm interested in graphical tools for manual annotation of texts.  The
goal is to manually annotate documents to train/test IE systems.  In
particular, I'm interested in tools that allow to annotate not only
entities (e.g. Named Entities) but also relations between such
entities (e.g., the relations of the Template Relation task in MUC-7:
employee_of connecting person and organization, or location_of
connecting organization and location).  Tools I'm already aware of:

 - the ALEMBIC Workbench by MITRE (already downloaded; it's the tool
   I'm more familiar with)
 - WordFreak (downloaded a few days ago; one of the problems with
   WordFreak seems to be the shortage of documentation)
 - the ACE annotation tools by LDC

I have had a look at the old messages of the list without being able
to find anything interesting.  I have already consulted the web page
on Linguistic Annotation by Steven Bird and Mark Liberman
(http://www.ldc.upenn.edu/annotation/) but I have found nothing which
appears to be relevant (BTW, the last update was in December 2001).
I'm particularly interested in first-hand experience of use of the
tools (including the ones mentioned above).

I'll post a summary of the replies.

thanks in advance

	alberto

(Continue reading)

Anil Kumar Singh | 1 Jul 11:40 2004
Picon

Wanted: Robotics / Human-computer interaction corpora

Dear All.

I am working on a speech-cum-natural language
interface for a robot. For this I need a corpus (for
the speech recognizer) of sentences used to control a
robot.

A corpora of sentences for controlling some other
computing device will also be useful.

Regards,

Anil Kumar Singh
PhD (Computationl Linguistics)
LTRC, IIIT, Hyderabad, India.

________________________________________________________________________
Yahoo! India Careers: Over 50,000 jobs online
Go to: http://yahoo.naukri.com/

Vincenzo Pallotta | 1 Jul 16:46 2004
Picon
Picon

Call for participation: 3rd ROMAND workshop at COLING 2004

(apologies for multiple copies)

*************************************************************************
CALL FOR PARTICIPATION

ROMAND 2004
3rd  workshop on RObust Methods in Analysis of Natural language Data

A satelite event of COLING 2004
University of Geneva - Switzerland
29th August  2004

[ http://lithwww.epfl.ch/romand2004/ ]

*************************************************************************

Robustness in Computational Linguistics has been recently recognized as 
a central issue for the design of interactive applications based on 
natural language communication. If a failure of the system can be 
acceptable in batch applications requiring a human intervention, an 
on-line system should be capable of dealing with unforeseen situations 
in a more flexible way. When we talk about system failure we do not 
think at inherent program failures like infinite loops or system 
exception, we consider, rather, failures related to the processing of 
the input and its assimilation in the system's knowledge base. A failure 
of this kind means simply that the system does not "understand" the 
input. The automated analysis of natural language data has become a 
central issue in the design of Intelligent Information Systems. 
Processing unconstrained natural language data is still considered an 
AI-hard task. However, various analysis techniques have been proposed in 
(Continue reading)

Miles Osborne | 5 Jul 12:18 2004
Picon
Picon

Faculty Job at Edinburgh: Statistical Natural Language Processing

a number of us will be at ACL and/or Coling, so feel free to ask questions then

Miles
********************************************************************************

Expiry Date 	30-Jul-2004

Permanent Position

The successful candidate will be highly-qualified and have expertise in
statistical natural language processing, in one or more of the following areas:
statistical parsing, interpretation, or generation, multimodal processing,
machine learning for NLP, statistical machine translation, information
retrieval/extraction, or language resources.  A track record of research in
semantics or discourse, or an interest in how human models of language
processing can inform language engineering would also be an advantage.  You
should demonstrate a world-class research record and both interest and ability
in teaching.  Informal enquiries to Professor Johanna Moore (J.Moore <at> ed.ac.uk),
tel. 0131 651 1336 or Professor Mark Steedman (steedman <at> inf.ed.ac.uk), tel.
0131 650 4631.  

https://www.jobs.ed.ac.uk/jobs/index.cfm?action=jobdet&jobid=3002169

78303018 | 5 Jul 08:51 2004
Picon

About Inside outside algorithm,


Dear Collegues,
Does any body know any paper or reference for the result of applying 
Pure Inside outside grammar induction algorithm [Lari & Young 1990] on PENN
TREEBANK.
I Know the paper of applying inside outside on Penn Treebank : 

"[Schabes93] Y. Schabes, M. Roth and R. Obsorne, “Parsing the Wall Street
Journal with the inside-outside Algorithm”, In Proceedings of the Sixth
Conference of the European Chapter of the ACL. Pages 341-347, 1993."

but it is relatd the verfied version of inside-outside described by [Peirra &
Schabes 1992].

regards

-------------------------------------------------
This mail sent through : http://mehr.sharif.ir/

Steve Renals | 5 Jul 14:01 2004
Picon
Picon

Another faculty job at Edinburgh


In addition to the statistical NLP job, there is also a vacancy for a
lecturer (= assistant professor) in Speech Technology.
Steve Renals

---------

Lectureship in Speech Technology
Department of Theoretical and Applied Linguistics /
Centre for Speech Technology Research

Following the appointment of Simon King to an EPSRC Advanced
Fellowship at Edinburgh, applications are invited for a fixed term
Lectureship in Speech Technology in the Centre for Speech Technology
Research (CSTR). The successful applicant will be highly qualified,
have expertise in spoken language processing, particularly speech
recognition, speech synthesis or speech signal processing, and will
demonstrate an excellent research record as well as interest and
ability in teaching. He or she will be expected to develop and
participate in research activities at CSTR, which is a collaborative
activity of the Department of Theoretical and Applied Linguistics
(TAAL) and the School of Informatics. 

Informal enquiries to  Steve Renals, Director of CSTR
(s.renals <at> ed.ac.uk, tel: +44 131 650 4589) or Ellen Gurman Bard, Head
of TAAL (ellen <at> ling.ed.ac.uk, tel: +44 131 650 3951).  For further
information about this role, and application procedure please go to:
  https://www.jobs.ed.ac.uk/jobs/index.cfm?action=jobdet&jobid=3002230

The appointment is tenable from 1 January 2005 for a fixed term of
(Continue reading)

Antonio Branco | 5 Jul 17:35 2004
Picon

Conf. on Anaphora: call for participation


               ** Call for Participation **


                        DAARC 2004
            The 5th International Conference on
         Discourse Anaphora and Anaphor Resolution
               http://daarc2004.di.fc.ul.pt

         S. Miguel, Azores, September 23 - 24, 2004

Following the success of the previous international colloquia
on Discourse Anaphora and Anaphor Resolution at Lancaster
in 1996, 1998 and 2000, and at Lisbon in 2002, the next colloquium
in the series will be held in Furnas, S. Miguel island, Azores,
Portugal, in 2004, on the 23-24th September.

The programme of DAARC2004 includes 2 invited talks, by Tony
Sanford and Amit Bagga, and 30 contributed presentations,
which reflect the most recent advances of the work on anaphora,
ranging from theoretical linguistic approaches through
psycholinguistic and cognitive work to computational modelling
of anaphor resolution.

Detailed information on the venue, accommodation and registration
can be found at: http://daarc2003.di.fc.ul.pt . July 15 is the 
deadline for registration with reduced fees.

This is the colloquium programme:

(Continue reading)

Cormac O'Brien | 6 Jul 10:46 2004
Picon
Picon

Casement Diaries


Hi,

Does anyone know where I could get a machine readable copy of Sir Roger
Casement's diaries. I'm interested in using statistical experiments 
to test their authenticity.

Many thanks,

Cormac

-----------------------------------------
Cormac O'Brien
Postgraduate Student (M.Sc. by research)
Computational Linguistics Group
Trinity College, Dublin

Tel: 00353 1 608 2866

Ken Litkowski | 6 Jul 17:28 2004

Calling for OUP licensees

About a year ago, Oxford University Press invited research proposals for 
making use of its lexical resources.  I was one of about 15 proposers 
who was granted a license.  I wish to make contact with others who were 
granted a license and who are willing to make themselves known to me. 
(Oxford, naturally, will not make these known.)  I have three topics I'd 
like to discuss with other licensees.  I hope that each will be mutually 
beneficial.

Thanks,
	Ken
--

-- 
Ken Litkowski                     TEL.: 301-482-0237
CL Research                       EMAIL: ken <at> clres.com
9208 Gue Road
Damascus, MD 20872-1025 USA       Home Page: http://www.clres.com

Serge Sharoff | 7 Jul 09:11 2004
Picon

RE: Calling for OUP licensees

Ken,

We got four bilingual dictionaries (French, German, Russian and Spanish) and used them for experiments in
machine translation and training of translators.  There're many problems with encoding of translation
equivalents.  I developed a Perl script that tres to extract some information and store it in the TEI
format, but the results are far from being perfect (and I mostly worked on the Russian dictionary, but the
script also extracts smthg from the other three).  I'm ready to share the script and I'll also be interested
in knowing your experience in dealing with the data.

Best wishes,
Serge
> -----Original Message-----
> From: owner-corpora <at> lists.uib.no [mailto:owner-corpora <at> lists.uib.no] On
> Behalf Of Ken Litkowski
> Sent: Tuesday, July 06, 2004 4:28 PM
> To: corpora <at> hd.uib.no
> Subject: [Corpora-List] Calling for OUP licensees
> 
> About a year ago, Oxford University Press invited research proposals for
> making use of its lexical resources.  I was one of about 15 proposers
> who was granted a license.  I wish to make contact with others who were
> granted a license and who are willing to make themselves known to me.
> (Oxford, naturally, will not make these known.)  I have three topics I'd
> like to discuss with other licensees.  I hope that each will be mutually
> beneficial.
> 
> Thanks,
> 	Ken
> --
> Ken Litkowski                     TEL.: 301-482-0237
(Continue reading)


Gmane