Grant Ingersoll | 1 May 2009 01:37
Picon
Favicon
Gravatar

Re: kamikaze

Does Kamikaze have a mailing list?  It seems like, to me anyway, this  
conversation would be more appropriate for that list as it is about  
Kamikaze, not Lucene.

-Grant

On Apr 30, 2009, at 2:42 PM, molz wrote:

>
> Right on.
>
> -1 if not found, index in the sorted set if found.
>
> Anmol
>
>
>
> Michael Mastroianni wrote:
>>
>> Thanks, Anmol. Just so I'm clear on this: findWithIndex(foo)  
>> returns -1
>> if foo is not found, and some positive integer if it is?
>>
>> regards,
>> Michael
>>
>> -----Original Message-----
>> From: molz [mailto:anmol.bhasin <at> gmail.com]
>> Sent: Thursday, April 30, 2009 3:33 PM
>> To: java-user <at> lucene.apache.org
(Continue reading)

Grant Ingersoll | 1 May 2009 01:39
Picon
Favicon
Gravatar

Re: How to get the similarity between two string vectors?

Yes and no.  You can have a look at More Like This in the contrib  
package.  Additionally, you can just get the TermVectors out of Lucene  
and write your own.  You could use a MemoryIndex which contains one  
document and represent the other document as a query and the "search".

But, no, there isn't a MethodName(Vector1, Vector2) option.

-Grant

On Apr 30, 2009, at 1:00 PM, Kamal Najib wrote:

> Hi,
> I am new to Lucene and I want to get the similarity between two  
> vectors of strings,is there a method, who do that?
> for example if i have the vectors:
> Vector1 :<"term1","term2","term3">
> Vector2:<"term4","term5","term5">
> is there a method to get the similarity between them in lucene,or is  
> there any other way to do it?
> for esample: MethodName(Vector1,Vector2).
> Thanks in advance.
> kamal.
> -- 
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe <at> lucene.apache.org
> For additional commands, e-mail: java-user-help <at> lucene.apache.org

--------------------------
(Continue reading)

Grant Ingersoll | 1 May 2009 01:42
Picon
Favicon
Gravatar

Re: dbsight

Solr (http://lucene.apache.org/solr) can import from a DB, if that is  
what you are after.  I haven't done a full feature comparison between  
DB Sight and Solr, but it appears there is a fair amount of overlap  
based on the front page.

HTH,
Grant

On Apr 30, 2009, at 3:36 PM, Michael Masters wrote:

> I posted this on java-dev <at> lucene.apache.org and it was suggested that
> I pose this question here:
>
> Hello Everyone,
>
> I just started to use lucene recently. Great project BTW. I was
> wondering if anyone has suggested making an open source version of
> dbsight (www.dbsight.net/). I've just started using it and I think it
> would be awesome if it was open source. Does anyone know of a project
> that's like this that is OS?
>
> If not, then how can I propose a project that does a similar thing?
>
> Thanks,
> Mike
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe <at> lucene.apache.org
> For additional commands, e-mail: java-user-help <at> lucene.apache.org
>
(Continue reading)

John Wang | 1 May 2009 02:19
Picon
Gravatar

Re: kamikaze

You are right, Grant.Michael, Anmol, let's move this to the kamikaze mailing
list:
http://groups.google.com/group/kamikaze-users

Michael, I have added you by default.

-John

On Thu, Apr 30, 2009 at 4:37 PM, Grant Ingersoll <gsingers <at> apache.org>wrote:

> Does Kamikaze have a mailing list?  It seems like, to me anyway, this
> conversation would be more appropriate for that list as it is about
> Kamikaze, not Lucene.
>
> -Grant
>
>
> On Apr 30, 2009, at 2:42 PM, molz wrote:
>
>
>> Right on.
>>
>> -1 if not found, index in the sorted set if found.
>>
>> Anmol
>>
>>
>>
>> Michael Mastroianni wrote:
>>
(Continue reading)

Michael Masters | 1 May 2009 04:32
Picon
Gravatar

Re: dbsight

Sweet! I'll look more into solr. I wasn't under the impression solr  
could index a database like dbsight.

-Mike

On Apr 30, 2009, at 4:42 PM, Grant Ingersoll <gsingers <at> apache.org>  
wrote:

> Solr (http://lucene.apache.org/solr) can import from a DB, if that  
> is what you are after.  I haven't done a full feature comparison  
> between DB Sight and Solr, but it appears there is a fair amount of  
> overlap based on the front page.
>
> HTH,
> Grant
>
> On Apr 30, 2009, at 3:36 PM, Michael Masters wrote:
>
>> I posted this on java-dev <at> lucene.apache.org and it was suggested that
>> I pose this question here:
>>
>> Hello Everyone,
>>
>> I just started to use lucene recently. Great project BTW. I was
>> wondering if anyone has suggested making an open source version of
>> dbsight (www.dbsight.net/). I've just started using it and I think it
>> would be awesome if it was open source. Does anyone know of a project
>> that's like this that is OS?
>>
>> If not, then how can I propose a project that does a similar thing?
(Continue reading)

Erik Hatcher | 1 May 2009 04:11
Favicon

Re: dbsight


On Apr 30, 2009, at 10:32 PM, Michael Masters wrote:
> Sweet! I'll look more into solr. I wasn't under the impression solr  
> could index a database like dbsight.

It's not point-and-clickable, but Solr's DataImportHandler has  
sophisticated configuration capabilities for indexing any JDBC  
accessible database.

And there is also the LuSql project that has recently gotten a lot of  
good press, and I've seen it demo'd first hand it's quite powerful and  
flexible.

	Erik
Otis Gospodnetic | 1 May 2009 04:43
Picon
Favicon

Re: dbsight


----- Original Message ----

> From: Erik Hatcher <erik <at> ehatchersolutions.com>
> 
> On Apr 30, 2009, at 10:32 PM, Michael Masters wrote:
> > Sweet! I'll look more into solr. I wasn't under the impression solr could 
> index a database like dbsight.
> 
> It's not point-and-clickable, but Solr's DataImportHandler has sophisticated 
> configuration capabilities for indexing any JDBC accessible database.
> 
> And there is also the LuSql project that has recently gotten a lot of good 
> press, and I've seen it demo'd first hand it's quite powerful and flexible

And, probably the simplest of all options mentioned above:

  http://sematext.com/product-db-indexer.html

Otis
Anshum | 1 May 2009 06:18
Picon

Re: [ no subject ]

As per my knowledge, you'd have to index one of the docs  and then run a
query (second doc) to get the similarity score.
Also, the default similarity would take into account more factors than the
regular VSM hence, you'd even have to look into it.
You may write code that on the fly creates a volatile index, runs a query,
returns the similarity and clears the index (which would happen implicitly
in case of a ramdir approach.

--
Anshum Gupta
Naukri Labs!
http://ai-cafe.blogspot.com

The facts expressed here belong to everybody, the opinions to me. The
distinction is yours to draw............

On Thu, Apr 30, 2009 at 8:58 PM, Kamal Najib <kamal.najib <at> mytum.de> wrote:

> Hi,
> A am new in Lucene and I want to get the similarity between two vectors of
> strings,is there a method, who do that?
> for example assume the vectors:
> Vector1 :<"term1","term2","term3">
> Vector2:<"term4","term5","term5">
> is there a method to get the similarity between them in lucene,or is there
> any other way to do it?
> for esample: getTheSymilarity(Vector1,Vector2).
> Thanks in advance.
> kamal.
>
(Continue reading)

Kamal Najib | 1 May 2009 11:06
Picon

Re: Re: [ no subject ]

Thanks Anshum Gupta for the reply,
"As per my knowledge, you'd have to index one of the docs  and then run a
query (second doc) to get the similarity score."

which docs do you mean?  do you mean i have to create a doc for each Vector, do you mean somthing like this:
Vector1 =<"term1","term2","term3"> --->doc1.add("id",new Field ("term1"+"term2"+"term3",Field.Store.YES,Field.Index.TOKENIZED));
Vector2 =<"term4","term5","term6"> --->doc1.add("id",new Field ("term4"+"term5"+"term6",Field.Store.YES,Field.Index.TOKENIZED));
or 
Vector1 =<"term1","term2","term3"> --->
doc1.add("id",new Field("term1",Field.Store.YES,Field.Index.TOKENIZED));
doc1.add("id",new Field("term2",Field.Store.YES,Field.Index.TOKENIZED));
doc1.add("id",new Field("term3",Field.Store.YES,Field.Index.TOKENIZED));

Vector2 =<"term4","term5","term6"> --->
doc1.add("id",new Field ("term4",Field.Store.YES,Field.Index.TOKENIZED));
doc1.add("id",new Field ("term5",Field.Store.YES,Field.Index.TOKENIZED));
doc1.add("id",new Field ("term6",Field.Store.YES,Field.Index.TOKENIZED));

and then get the similarity score between the two docs?
please help.
thanks in advance.
Kamal

Original Message:

As per my knowledge, you'd have to index one of the docs  and then run a
<br />query (second doc) to get the similarity score.
<br />Also, the default similarity would take into account more factors than the
<br />regular VSM hence, you'd even have to look into it.
<br />You may write code that on the fly creates a volatile index, runs a query,
(Continue reading)

Kamal Najib | 1 May 2009 11:16
Picon

Re: Re: How to get the similarity between two string vectors?

thank you Grant for the reply,
i will have a look at the mentioned package and classes.
kamal.

Original Message:

Yes and no.  You can have a look at More Like This in the contrib  
<br />package.  Additionally, you can just get the TermVectors out of Lucene  
<br />and write your own.  You could use a MemoryIndex which contains one  
<br />document and represent the other document as a query and the "search".
<br />
<br />But, no, there isn't a MethodName(Vector1, Vector2) option.
<br />
<br />-Grant
<br />
<br />On Apr 30, 2009, at 1:00 PM, Kamal Najib wrote:
<br />
<br />> Hi,
<br />> I am new to Lucene and I want to get the similarity between two  
<br />> vectors of strings,is there a method, who do that?
<br />> for example if i have the vectors:
<br />> Vector1 :<"term1","term2","term3">
<br />> Vector2:<"term4","term5","term5">
<br />> is there a method to get the similarity between them in lucene,or is  
<br />> there any other way to do it?
<br />> for esample: MethodName(Vector1,Vector2).
<br />> Thanks in advance.
<br />> kamal.
<br />> -- 
<br />>
(Continue reading)


Gmane