2 May 2011 11:13
Re: less than zero (enriching Italian stemmer)
Martin Porter <martin <at> porterloo.wanadoo.co.uk>
2011-05-02 09:13:52 GMT
2011-05-02 09:13:52 GMT
Adriano, Yes, you alter the .sbl file, compile the snowball compiler, and translate the .sbl file into C or java. If you're a software beginner I suggest you get local help if you can. This will be easier than trying to take you through the steps from the snowball-discuss board. I'm not quite sure where python comes into all this ... Martin At 10:19 AM 4/29/2011 -0400, adriano allora wrote: >Hi Martin! > >Wow, you've just opened a new world to me! It's my first C script and, you >see, I can be stupid, but I never be coward when I see something >completely... °___° > >No. ok, seriously: I downloaded a gzip archive named snowball_web_and_code >which contains all the source code for snowball. >I opened it and see several things very interesting (for instance adding >some stopwords, but it's not necessary doing all now: there is time for >further improvements), so: thank you for this. >But I beg your pardon: now I'm not sure about what I have to do. >1) can I simply change the files stem_ISO_8859.sbl and stem_MS_DOS_LATIN.sbl >and in the directory named algorithms/italian? if not: where is the source >file I have to change in order to add morphemes to Italian algorithm? >2) after changing the algorithm what I exactly have to do? It's reasonable >to assume that compiling it (gcc -O -o Snowball compiler/*.c) will not(Continue reading)
RSS Feed