Diana Inkpen | 1 Jan 2009 10:00
Picon
Favicon

Reminder: NAACL HLT 2009 Student Research Workshop

   Reminder: NAACL HLT 2009 Student Research Workshop

  Submission deadline: January 4, 2009

CALL FOR PAPERS

Paper Submission Deadline: Sunday, January 4th, 2009

Unless otherwise stated, all submissions are due by 11:59 PM EST
on the specified day.

1. General Invitation for Submissions

The Student Research Workshop is an established tradition at ACL
conferences. The workshop provides a venue for student researchers
investigating topics in Computational Linguistics and Natural Language
Processing to present their work and receive feedback from a general
audience as well as from panelists. The panelists are experienced
researchers who will prepare in-depth comments and questions in
advance of the presentation.

We would like to invite student researchers to submit their work to
the workshop. Since this workshop is an excellent opportunity to ask
for suggestions, to receive useful feedback and to run your ideas by
an international audience of researchers, the emphasis of the workshop
will be on work in progress. The research being presented can come
from any topic area within computational linguistics and is understood
to be applied to speech and/or text. A list of topic areas is provided
in the Call for Papers for the NAACL HLT 2009 Conference available at:

(Continue reading)

Kristiina Jokinen | 5 Jan 2009 13:22
Picon
Picon

NODALIDA 2009: Final CALL FOR PAPERS

				NODALIDA 2009
	The 17th Nordic Conference of Computational Linguistics

				May 14-16, 2009
				Odense, Denmark

			  FINAL CALL FOR FULL PAPERS

IMPORTANT DATES

Submission full regular and student papers: 	January 12, 2009
Submission all short papers/demos: 			February 18, 2009
Notification of acceptance (all paper types): 	March 23, 2009
Submission camera-ready papers: 			April 14, 2009

MORE INFORMATION on the call and submission details on the conference website:
	http://beta.visl.sdu.dk/nodalida2009/

Questions about submissions can be sent to: 	nodalida2009 <at> visl.sdu.dk.

PROGRAM COMMITTEE

Kristiina Jokinen (Chair), University of Helsinki
Robin Cooper, University of Gothenburg
Anna Korhonen, University of Cambridge
Kaili Müürisep, University of Tartu
Joakim Nivre, Uppsala University
Patrizia Paggio, University of Copenhagen
Koenraad de Smedt, University of Bergen
Roman Yangarber, University of Helsinki
(Continue reading)

Khalil Sima'an | 6 Jan 2009 10:19
Picon
Picon
Favicon

Vacancy POSTDOC (2 years), Statistical NLP

Vacancy postdoc reseacher, 2 years, Statistical MT and Parsing 
<http://staff.science.uva.nl/%7Esimaan/postdoc_adv.html>*

The Institute for Logic, Language and Computation 
<http://www.illc.uva.nl> at the University of Amsterdam 
<http://www.uva.nl> has a vacancy for a
*

  Postdoctoral Researcher

    1.0 fte (38h per week)
    For both internal and external candidates

Summary

    * Position: POSTDOC researcher
    * Duration: 2 years (full time, 38hrs per week)
    * Salary (gross per month): minimum Euro 2379 and maximum Euro 3755
      in the first year.
    * The Collective Employment Agreement of the Dutch Universities is
      applicable.
    * Last date for application: February 15, 2009.

<http://staff.science.uva.nl/%7Esimaan/postdoc_adv.html>
Project

Priors for the Estimation of Probabilistic Grammars from Incomplete 
Natural Language Data
VIDI project Sima'an [2007-2011]

(Continue reading)

Nuno Cardoso | 6 Jan 2009 11:42
Picon

Call for Participation in GikiCLEF 2009

Apologies for cross-postings ========================================================================= GikiCLEF - Cross-language Geographic Information Retrieval from Wikipedia A CLEF 2009 track ========================================================================= http://www.linguateca.pt/GikiCLEF/ ---------------------- Call for Participation ---------------------- You are invited to participate in GikiCLEF 2009, a CLEF track, whose aim is to evaluate systems which find Wikipedia entries / documents that answer a particular information need which requires geographical reasoning of some sort. GikiCLEF is the follow-up of the GikiP 2008 pilot task which ran under GeoCLEF 2008, and is one of the tracks under CLEF, whose workshop will take place in Corfu Greece in connection with ECDL 2009. TASK DESCRIPTION: ================= Systems will receive a list of 50 questions and will have to return a list of answers (in the form of titles of Wikipedia entries) for each, from the GikiCLEF collection. GikiCLEF LANGUAGES: =================== Questions and answers are to be found in the nine languages (and ten Wikipedia versions) below: Bulgarian (BG), Dutch (NL), English (EN), German (DE), Italian (IT), Norwegian (NN and NO), Portuguese (PT), Romanian (RO) and Spanish (ES). Systems may participate in any language subset, although the best system would have to process all languages. GikiCLEF COLLECTION: =================== The GikiCLEF collection comprehends the June 2008 Wikipedia document collections for the above mentioned languages, processed by the WikiXML tool developed by the University of Amsterdam (http://ilps.science.uva.nl/WikiXML/xmlformat.php). Also available to the participants are the June 2008 MediaWiki SQL and HTML dumps for the GikiCLEF languages. TOPICS: ======= Topics (or rather, questions) will be released early March 2009, in all GikiCLEF languages. The topic choice committee will devise topics with crosslingual and cultural interest, so that the need for looking in Wikipedia in different languages is not artificial. See the Web site for topic examples and previous GikiP topics. EVALUATION: =========== After pooling all answers returned by the participant systems, they will be manually assessed by the organization. The systems will be evaluated according to the number of correct hits and precision in all languages, so that multilinguality is rewarded. IMPORTANT DATES: ================ 10 January 2009: Collection release. 18 January 2009: Final registration required. February 2009: Discussion / Publication of the final definition of the GikiCLEF task. March 2009: Topic release and run submission. (2 weeks after topic release): Deadline for run submission. June 2009: Assessment and GikiCLEF results made available. 14 August 2009: Submission of Papers for Working Notes 30 Sep/2 Oct 2009: CLEF Workshop (in Corfu, Greece) To indicate your interest and/or preliminary register, use the form on the site.
_______________________________________________
Corpora mailing list
Corpora <at> uib.no
http://mailman.uib.no/listinfo/corpora
Erik Tjong Kim Sang | 6 Jan 2009 14:47
Picon
Picon
Favicon

TLT 7 - Final Call for Participation

THE SEVENTH INTERNATIONAL WORKSHOP ON TREEBANKS AND LINGUISTIC THEORIES

January 23-24, 2009
Groningen, The Netherlands
http://www.let.rug.nl/tlt/

FINAL CALL FOR PARTICIPATION

The Seventh International Workshop on Treebanks and Linguistic
Theories will be held on January 23 and 24, 2009 in Groningen,
The Netherlands.

INVITED SPEAKERS

Adam Przepiorkowski - Linguistic annotation for valence acquisition
   and for its evaluation
Robert Malouf - Treebanks and evolutionary simulation for explaining
   typological patterns

WORKSHOP PROGRAM

Towards a multi-representational treebank
   Fei Xia, Rajesh Bhatt, Owen Rambow, Martha Palmer and Dipti Misra
   Sharma
PASSAGE Syntactic Representation (Talk & Demo)
   Patrick Paroubek, Eric de la Clergerie, Sylvain Loiseau, Anne
   Vilnat and Gil Francopoulo
Huge Parsed Corpora in LASSY
   Gertjan van Noord
Cultivating Trees: Adding Several Semantic Layers to the Lassy
   Treebank in SoNaR
   Ineke Schuurman, Veronique Hoste and Paola Monachesi
The Distribution of Weak and Strong Object Reflexives in Dutch
   Gosse Bouma and Jennifer Spenader
Similarity Rules! Exploring Methods for Ad-Hoc Rule Detection
   Markus Dickinson and Jennifer Foster
MonaSearch - A Tool for Querying Linguistic Treebanks
   Hendrik Maryns and Stephan Kepser
Constructing a Valence Lexicon for a Treebank of German
   Erhard Hinrichs and Heike Telljohann
TePaCoC - A Testsuite for Testing Parser Performance on Complex German
   Grammatical Constructions
   Sandra Kuebler, Ines Rehbein and Josef van Genabith
A Data-Driven Dependency Parser for Romanian
   Mihaela Calacean and Joakim Nivre
Automatic Annotation of Morpho-Syntactic Dependencies in a Modern
   Hebrew Treebank
   Noemie Guthmann, Yuval Krymolowski, Adi Milea and Yoad Winter
A Quechua-Spanish Parallel Treebank
   Annette Rios Gonzales, Anne Gohring and Martin Volk
Extracting and Annotating Wikipedia Sub-Domains
   Gisle Ytrestol, Stephan Oepen and Daniel Flickinger
Semantic Annotation of Genitive Attributes in a German Treebank (Poster)
   Maya Bangerter
To Use a Treebank or Not - Which Is Better for Hypernym Extraction?
   (Poster)
   Erik Tjong Kim Sang
LFG Parsebanker: A Tool for Building and Searching a Treebank as a
   Parsed Corpus (Poster & Demo)
   Victoria Rosen, Paul Meurer and Koenraad De Smedt

FURTHER INFORMATION

For more information on the registration procedure, venue and
other aspects of the workshop, please see the workshop website:
http://www.let.rug.nl/tlt/

_______________________________________________
Corpora mailing list
Corpora <at> uib.no
http://mailman.uib.no/listinfo/corpora

Barbara Plank | 6 Jan 2009 15:12
Picon
Favicon

CLIN 19 - Final Call for Participation

CLIN 19 - FINAL CALL FOR PARTICIPATION

Computational Linguistics in The Netherlands Thursday 22 January 2009
http://www.let.rug.nl/clin/

The Nineteenth Annual Meeting of Computational Linguistics in The
Netherlands (CLIN) will be held on Thursday 22 January 2009 in
Groningen, The Netherlands. We invite everyone with an interest in
computational linguistics to take part in the meeting.

ACCEPTED PAPERS

CLIN 19 will contain 60 talks and 17 posters on different aspects of
computational linguistics such as parsing, machine translation, language
modeling, information extraction, ontologies, corpus linguistics, corpus
annotation, and others. A complete list of the 77 accepted papers can be
found at the conference website.

INVITED TALK

The invited talk at CLIN 19 on "Constraint-based Sentence Compression"
will be presented by Mirella Lapata from the University of Edinburgh.

REGISTRATION

Online registration is possible via the conference website:
http://www.let.rug.nl/clin/

Payments can be made on-site (cash only) during the conference or via
bank transfer (before January 8, 2009). No credit-card payments are
possible.

CO-LOCATED EVENTS

CLIN 19 will be co-located with TLT 7, the 7th International Workshop on
Treebanks and Linguistic Theories, which will be held on 23-24 January
2009, in Groningen.

Two linguistic events are held in Groningen in the same week: the
conference on Relating Asymmetries between Speech & Comprehension in the
Acquisition of Language (24-25 January 2009) and the winter edition of
the Dutch Graduate School for Linguistics (LOT, 19-30 January 2009).

ORGANIZATION

CLIN 19 is organized by Erik Tjong Kim Sang and Barbara Plank, with
valuable help from Gertjan van Noord, Gosse Bouma, Jelena Prokic,
Cagri Coltekin and Tim van de Cruys.

See you in Groningen on Thursday 22 January!

_______________________________________________
Corpora mailing list
Corpora <at> uib.no
http://mailman.uib.no/listinfo/corpora

Barbara Plank | 6 Jan 2009 14:45
Picon
Favicon

CLIN 19 - Final Call for Participation

CLIN 19 - FINAL CALL FOR PARTICIPATION

Computational Linguistics in The Netherlands Thursday 22 January 2009
http://www.let.rug.nl/clin/

The Nineteenth Annual Meeting of Computational Linguistics in The
Netherlands (CLIN) will be held on Thursday 22 January 2009 in
Groningen, The Netherlands. We invite everyone with an interest in
computational linguistics to take part in the meeting.

ACCEPTED PAPERS

CLIN 19 will contain 60 talks and 17 posters on different aspects of
computational linguistics such as parsing, machine translation, 
language
modeling, information extraction, ontologies, corpus linguistics, 
corpus
annotation, and others. A complete list of the 77 accepted papers can 
be
found at the conference website.

INVITED TALK

The invited talk at CLIN 19 on "Constraint-based Sentence Compression"
will be presented by Mirella Lapata from the University of Edinburgh.

REGISTRATION

Online registration is possible via the conference website:
http://www.let.rug.nl/clin/

Payments can be made on-site (cash only) during the conference or via
bank transfer (before January 8, 2009). No credit-card payments are
possible.

CO-LOCATED EVENTS

CLIN 19 will be co-located with TLT 7, the 7th International Workshop 
on
Treebanks and Linguistic Theories, which will be held on 23-24 January
2009, in Groningen.

Two linguistic events are held in Groningen in the same week: the
conference on Relating Asymmetries between Speech & Comprehension in 
the
Acquisition of Language (24-25 January 2009) and the winter edition of
the Dutch Graduate School for Linguistics (LOT, 19-30 January 2009).

ORGANIZATION

CLIN 19 is organized by Erik Tjong Kim Sang and Barbara Plank, with
valuable help from Gertjan van Noord, Gosse Bouma, Jelena Prokic,
Cagri Coltekin and Tim van de Cruys.

See you in Groningen on Thursday 22 January!

_______________________________________________
Corpora mailing list
Corpora <at> uib.no
http://mailman.uib.no/listinfo/corpora

Linguistic Data Consortium | 6 Jan 2009 23:54
Favicon

New from the LDC

LDC2008T25
AQUAINT-2 Information-Retrieval Text Research Collection  -

LDC2008L03
Global Yoruba Lexical Database v. 1.0  -

The Linguistic Data Consortium (LDC) would like to announce the availability of two new publications.

New Publications

(1) AQUAINT-2 Information-Retrieval Text Research Collection was developed by LDC for NIST's (National Institute for Standards and Technology) AQUAINT 2007 Question-Answer (QA) track. It consists of approximately 2.5 GB of English news text from six distinct sources collected by LDC (Agence France Presse, Associated Press, Central News Agency (Taiwan), Los Angeles Times-Washington Post, New York Times and Xinhua News Agency) covering the period from October 2004 through March 2006. The AQUAINT-2 collection is the second part of a series intended to provide data useful for developing, evaluating and testing information extraction and retrieval systems. It follows the publication of The AQUAINT Corpus of English News Text (LDC2002T31).

The AQUAINT (Advanced Question-Answering for Intelligence)  program addresses interactivity with scenarios or tasks. The scenario provides a context in which questions will be asked and answered, and the task reflects the overall assignment. The program is committed to solve a single problem: how to find topically relevant, semantically related, timely information in massive amounts of data in diverse languages, formats, and genres.

For each source, all of the usable data collected by LDC was processed into a consistent XML format in which the stories for a given month are concatenated in chronological order into a single "DOCSTREAM" element; each story is a single "DOC" element within that stream and has a globally unique "id" attribute.

(2) The Global Yoruba Lexical Database v. 1.0 is a set of related dictionaries providing definitions and translations for over 450,000 words from the Yoruba language and its variants: Standard Yoruba (over 368,000 words), Gullah (over 3,600 words), Lucumí (over 8,000 words) and Trinidadian (over 1,000 words).

Yoruba is a Niger-Congo language (sub classification: Kwa > Yoruboid) spoken natively by nearly 20 million people, the vast majority of them in southwestern Nigeria.  The  Yoruba language diaspora is wide, stretching from southwestern Nigeria and Benin westward to the Caribbean and islands along the southeastern United States coast.  Throughout the region, Yoruba dialects blended with each other and with languages like Spanish and French to form a variety of creoles such as Gullah in the United States and Nagô in Brazil.  The ultimate goal of this dictionary is to provide coverage for all Yoruba dialects across the globe. For that reason, it will continue to be a work in progress.

The Yoruba dialect continuum consists of over fifteen varieties, with considerable phonological and lexical differences among them and some grammatical ones as well. Peripheral areas of dialectal regions often have some similarities to adjoining dialects. Standard Yoruba is a koine used for education, writing, broadcasting, and contact between speakers of different dialects.

The dictionaries in this publication are presented in two formats, Toolbox databases and XML. Short for The Field Linguist's Toolbox, Toolbox is a lexicographical database system published by SIL. SIL makes Toolbox freely available for download. In order to use the Global Yoruba Lexical Database v. 1.0, Toolbox must first be installed on the user's local computer.



--------------------------------------------------------------------
Linguistic Data Consortium Phone: (215) 573-1275 University of Pennsylvania Fax: (215) 573-2175 3600 Market St., Suite 810 ldc <at> ldc.upenn.edu Philadelphia, PA 19104 USA http://www.ldc.upenn.edu

_______________________________________________
Corpora mailing list
Corpora <at> uib.no
http://mailman.uib.no/listinfo/corpora
Joel Tetreault | 7 Jan 2009 01:47
Picon
Favicon

2009 Computational Linguistics Conference Calendar


Hello all, I've been maintaining a calendar of NLP and CL conferences for
the past five years, and have started a new listing for 2009 that people
might find useful:

http://www.cs.rochester.edu/~tetreaul/conferences.html

If you wish to have a conference added or report an error, please feel
free to email me.  I cull together conferences on this list from ACL
emails, linguistlist and corporalist, as well as from fellow researchers
emailing in, and update daily.  I'm hoping to make the calendar a little
less static in the future given recent feedback.

Cheers,
Joel

_______________________________________________
Corpora mailing list
Corpora <at> uib.no
http://mailman.uib.no/listinfo/corpora

Francis Tyers | 7 Jan 2009 07:42
Favicon
Gravatar

Re: 2009 Computational Linguistics Conference Calendar

El mar, 06-01-2009 a las 19:47 -0500, Joel Tetreault escribió:
> Hello all, I've been maintaining a calendar of NLP and CL conferences for
> the past five years, and have started a new listing for 2009 that people
> might find useful:
> 
> http://www.cs.rochester.edu/~tetreaul/conferences.html
> 
> If you wish to have a conference added or report an error, please feel
> free to email me.  I cull together conferences on this list from ACL
> emails, linguistlist and corporalist, as well as from fellow researchers
> emailing in, and update daily.  I'm hoping to make the calendar a little
> less static in the future given recent feedback.

There is something similar on the ACL Wiki:

 http://aclweb.org/aclwiki/index.php?title=Conferences_and_workshops

but yours seems more complete.

Fran

_______________________________________________
Corpora mailing list
Corpora <at> uib.no
http://mailman.uib.no/listinfo/corpora


Gmane