E. Glockner | 12 Aug 2006 10:50
Picon
Favicon

Commented romanian stopword-list

Hello Mr. Porter, hello Mr. Boulton,

we realised that the stop word list we sent you, is the non-commented 
one. In the attachment is the one with the comments. Please excuse the 
late reaction.
I personally have a question about evaluating. You get a lot of stemmers 
in different languages. I assume that you don't speak the language of 
each stemmer, do you? Though, how do you evaluate the stemmers? Do you 
just use the results which are sent in (eg. diffs.txt), or do you have 
your own way of evaluating?

With kind regards,
E. Glockner and colleagues.


| A Romanian stop word list. Comments begin with vertical bar. Each stop
| word is at the start of a line.

a			|to (verb infinitive partical)
abia			|only, just
acea			|that (adj sg fem)
aceasta, această	|this (adj/ pron sg fem)
aceea			|that (adj/ pron sg fem)
acelaşi			|the same (adj/ pron sg masc)
aceia			|those (adj/ pron pl masc)
acel			|that (adj sg masc)
acela			|that (adj/ pron pron sg masc)
acelaşi			|the same (adj/ pron sg masc)
(Continue reading)


Gmane