Re: Contributing to a Yiddish stemmer
2011-04-05 18:06:23 GMT
Will, Hi. Your request is a bit unusual ... let's see what I could suggest. If you're not a programmer, I would not advise trying to write programs. What you might do is to formulate a set of rules for normalising the vocabulary of Yiddish, and present it on the internet as a "challenge" for others to code up. The rules could be set out like one of the stemmer definitions in the snowball site, http://snowball.tartarus.org/algorithms/german/stemmer.html I think you should also try & contact others with an interest in retrieval of texts in Yiddish. Searching in Google is perhaps the best way forward here. I did not realise the stemming algorithms might be useful in translation. I'm so involved in IR I tend to think of them as just an adjuct to IR work. I take it that willhelton.com is your eponymous website. I may mail again after thinking it over further, meanwhile (if you don't mind) I'll post this to snowball-discuss, which sometimes generates extra useful ideas, and to Pat Miles, who helped create the German and Russsian stemmers at snowball, Martin At 02:58 PM 4/5/2011 +0100, Will Helton wrote: > >Dear Dr Porter, >(Continue reading)
RSS Feed