Neal Richter | 27 Apr 2006 23:27
Favicon

Stopword licenses

Hi all,

   We're big fans of your software.. thanks for all of your hard work. 
We've used them in our own software as well as added it to HtDig (open 
source search engine).

   Question:

   What license applies to the stopword lists?  We can't use them in the 
form you provide them, so we exact just the words (no comments), remove 
some, add others, transform to another character-set etc.

   We'd like to give credit where credit is due.. but these are stored in a 
pseudo DB that doesn't have an great mechanism to make the full BSD 
license visible when the user is editing the list.

   What would you recommend?  Can we put a note in our documentation?  Or a 
simple one-line statement at the top of the editing list?

   We do put notice of our use of the snowball software here: 
http://opensource.rightnow.com

Thanks!

--

-- 
Neal Richter
Sr. Researcher and Machine Learning Lead
Software Development
RightNow Technologies, Inc.
Customer Service for Every Web Site
(Continue reading)

Martin Porter | 28 Apr 2006 10:59
Picon

Re: Stopword licenses


Neal,

Very interesting to read of your use of the snowball work. I'm glad it's
been useful. (Perhaps we could add a reference to HtDig on our 'projects'
page, which is updated all too infrequently.)

The stopwords lists are BSD I guess, but I tend to be even more relaxed
about their use than the stemmers themselves. As you say, they usually
undergo a lot of massaging and adjusting before they become serviceable in
another piece of work. Given that you are not using them in a way that makes
acknowledgement easy, I suggest you don't bother to credit snowball as the
source.

Martin

P.S. -- I don't suppose you have a stopword list for Finnish?
Neal Richter | 28 Apr 2006 17:00
Favicon

Re: Stopword licenses


In fact we have a finnish stop word list!  I'll ask about send it your 
way.

Thanks!

On Fri, 28 Apr 2006, Martin Porter wrote:

>
> Neal,
>
> Very interesting to read of your use of the snowball work. I'm glad it's
> been useful. (Perhaps we could add a reference to HtDig on our 'projects'
> page, which is updated all too infrequently.)
>
> The stopwords lists are BSD I guess, but I tend to be even more relaxed
> about their use than the stemmers themselves. As you say, they usually
> undergo a lot of massaging and adjusting before they become serviceable in
> another piece of work. Given that you are not using them in a way that makes
> acknowledgement easy, I suggest you don't bother to credit snowball as the
> source.
>
> Martin
>
> P.S. -- I don't suppose you have a stopword list for Finnish?
>
>
>
>
> _______________________________________________
(Continue reading)


Gmane