12 Aug 2006 10:50
Commented romanian stopword-list
E. Glockner <eglockner <at> hotmail.com>
2006-08-12 08:50:55 GMT
2006-08-12 08:50:55 GMT
Hello Mr. Porter, hello Mr. Boulton, we realised that the stop word list we sent you, is the non-commented one. In the attachment is the one with the comments. Please excuse the late reaction. I personally have a question about evaluating. You get a lot of stemmers in different languages. I assume that you don't speak the language of each stemmer, do you? Though, how do you evaluate the stemmers? Do you just use the results which are sent in (eg. diffs.txt), or do you have your own way of evaluating? With kind regards, E. Glockner and colleagues.
 | A Romanian stop word list. Comments begin with vertical bar. Each stop | word is at the start of a line. a |to (verb infinitive partical) abia |only, just acea |that (adj sg fem) aceasta, aceastÄ |this (adj/ pron sg fem) aceea |that (adj/ pron sg fem) acelaÅi |the same (adj/ pron sg masc) aceia |those (adj/ pron pl masc) acel |that (adj sg masc) acela |that (adj/ pron pron sg masc) acelaÅi |the same (adj/ pron sg masc)(Continue reading)
RSS Feed