karl wettin | 1 Feb 2007 01:48
Picon

Re: Field methods and usage

31 jan 2007 kl. 12.25 skrev Christoph Pächter:
>
> I was wondering, if there is anywhere a table (similar to Table 1.2  
> An overview
> of different field types, their characteristics, and their usage in  
> Lucene in
> Action), listing the possible methods and their usage.

Implementations will differ, for example:

>
> Store   |TermVector              |Index          |reasonable |Usage
> YES     |NO                      |NO             |1          |URLs
>                                                              | 
> telephone number

You never have to store anything in the index, perhaps that  
information is persistent somewhere else?

If you use a term vector or not depends very little on what kind of  
information you store in there, it is up to what analysis you plan to  
include the documents in. Highlighting? More like this? Neural networks?

Some are more than happy with one large token. Other people might  
want to tokenize the exact same information.

An URL in [protocol://host:port/path], a phone number in country-,  
area, and district parts.

It really up to each and every implementer to decide what settings is  
(Continue reading)

DECAFFMEYER MATHIEU | 1 Feb 2007 09:10
Picon

RE: Score

Thank u Chris for your support. 

__________________________________
Matt

-----Original Message-----
From: Chris Hostetter [mailto:hossman_lucene <at> fucit.org] 
Sent: Thursday, February 01, 2007 12:54 AM
To: java-user <at> lucene.apache.org
Subject: RE: Score

*****  This message comes from the Internet Network *****


: >>Have you looked at the constructor for BooleanQuery and
: >>tried passing "true" to disable the Coord factor?
:
: Thanks Chris, this is exactly what I want,
: but I am working with lucene 1.4.3 because I have to for some reasons,
:
: Is there any equivalent ?!

if you look atteh source for it, it's fairly trivial ... you should be
able to putthe same logic into a simple little helper function you use
when making BooleanQueries.

-Hoss

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe <at> lucene.apache.org
(Continue reading)

Antony Bowesman | 1 Feb 2007 09:51

Re: Boost/Scoring question

Hi Chris,

> : If I search for a document where the field boost is 0.0 then the document is not
> : found I just search that field.  Is this expected???
> 
> you mean you search on:       A^0    and get no results even though
> documents contain A, and if you search on:       +A^0  B^1     you see
> those documents?

It's the index time boost, rather than query time boost.  This short example 
shows the behaviour of searches for

+A
+A +B
+B

where A was indexed with boost 0.0 and B with 1.0

     IndexWriter writer = new IndexWriter(TestTools.getRoot(),
                                          new StandardAnalyzer(), true);
     Field f1 = new Field("subject", "subject - boost factor 0.0F",
                          Field.Store.YES, Field.Index.TOKENIZED);
     f1.setBoost(0.0F);
     Field f2 = new Field("body", "body - boost factor 1.0F", Field.Store.YES,
                           Field.Index.TOKENIZED);
     f2.setBoost(1.0F);
     Document doc = new Document();
     doc.add(f1);
     doc.add(f2);
     writer.addDocument(doc);
(Continue reading)

DECAFFMEYER MATHIEU | 1 Feb 2007 09:53
Picon

Deleting document by file name

Hi,

I have a list of filenames like
Corporate.htm
Logistics.htm
Merchant.htm
....
that need to be deleted.

For now on I  give this list to my Search application that reads the idnex and give the results, and if the path contains one of the filenames, I don't display this hit ... Not really proper programming ...

Is there a way to delete the document in the index instead with this information ?

Thank u.

__________________________________

   Matt



============================================
Internet communications are not secure and therefore Fortis Banque Luxembourg S.A. does not accept legal responsibility for the contents of this message. The information contained in this e-mail is confidential and may be legally privileged. It is intended solely for the addressee. If you are not the intended recipient, any disclosure, copying, distribution or any action taken or omitted to be taken in reliance on it, is prohibited and may be unlawful. Nothing in the message is capable or intended to create any legally binding obligations on either party and it is not intended to provide legal advice.
============================================
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe <at> lucene.apache.org
For additional commands, e-mail: java-user-help <at> lucene.apache.org
liquideshark | 1 Feb 2007 12:15
Picon

problem with field.setboost(5.0f) on lucene 2.00


iam building up a search engine using lucene 2.0, and iam having problem
using the term boost "setboost" a part of my code is :
and my code is :

doc.add(new
Field("title",httpd.getTitle(),Field.Store.YES,Field.Index.TOKENIZED ));
doc.getField("title").setboost(5.0f);//  <=== the boost wont update to 5.0
it remain 1.0
writer.addDocument(doc);
writer.optimize();
writer.close();

but when i look up in the index created the field title is still 1.0
can some one help me thx
--

-- 
View this message in context: http://www.nabble.com/problem-with-field.setboost%285.0f%29-on-lucene-2.00-tf3154250.html#a8746530
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
Saroja Kanta Maharana | 1 Feb 2007 12:44
Picon

Please Help me on Lucene

Hi All,

I am new to LUCENE,I have an query that, I have to index a Product table,
Suppose that table has following Columns.

             product [ prod_Id as primary key, prod_name, prod_price]

I have an requirement like if someone searches for prod_id or prod_price, I
have to fetch the rows( prod_id, prod_name, prod_price individualy row-wise)
matches the search condition from the indexed files.

I am n't understanding how to index the table,

 { Like i have create one Document object  and fields Object as columns
value }
Or
 { More than one document object and search through the Documents }

How to search Please help me in this regards.

Regards
Saroja kanta Maharana
Erick Erickson | 1 Feb 2007 14:08
Picon

Re: Deleting document by file name

Believe it or not, you delete items with IndexReader <G>. You can either
delete by document ID or by Term. Be aware that currently open searchers
will still find these documents (even after they have been deleted) until
the *searcher* is closed and reopened.

Erick

On 2/1/07, DECAFFMEYER MATHIEU <MATHIEU.DECAFFMAYER <at> fortis.lu> wrote:
>
>  Hi,
>
> I have a list of filenames like
> Corporate.htm
> Logistics.htm
> Merchant.htm
> ....
> that need to be deleted.
>
> For now on I  give this list to my Search application that reads the idnex
> and give the results, and if the path contains one of the filenames, I don't
> display this hit ... Not really proper programming ...
>
> Is there a way to delete the document in the index instead with this
> information ?
>
> Thank u.
>
> *__________________________________*
>
> *   Matt*******
>
>
> ============================================
> Internet communications are not secure and therefore Fortis Banque
> Luxembourg S.A. does not accept legal responsibility for the contents of
> this message. The information contained in this e-mail is confidential and
> may be legally privileged. It is intended solely for the addressee. If you
> are not the intended recipient, any disclosure, copying, distribution or any
> action taken or omitted to be taken in reliance on it, is prohibited and may
> be unlawful. Nothing in the message is capable or intended to create any
> legally binding obligations on either party and it is not intended to
> provide legal advice.
> ============================================
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe <at> lucene.apache.org
> For additional commands, e-mail: java-user-help <at> lucene.apache.org
>
poeta simbolista | 1 Feb 2007 14:11
Picon

Use of only a prohibit search


HI guys,
I've read on 
http://www.nabble.com/Using-NOT-queries-inside-parentheses-tf1234775.html#a3272973
this  forum about the problem of using, for example, only a prohibit search
such as

-description:plot

I really would like to get this done on a good manner, I am constructing
queries where elements may vary greatly. 
I am using 1.9.1 at the moment. Is the solution provided in the post I
linked still the best one "nowadays" (it was stated for 1.4) ?  I find that
a bit "hacky" for such an operation, though...

Thanks a lot!
 D
--

-- 
View this message in context: http://www.nabble.com/Use-of-only-a-prohibit-search-tf3154742.html#a8747868
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
Erick Erickson | 1 Feb 2007 14:12
Picon

Re: problem with field.setboost(5.0f) on lucene 2.00

I haven't played with boosts, but I suspect your ordering is wrong. You've
already added the field to the document before you set the boost. Try
Field f = new Field()...

f.setBoost()

doc.add(f).

writer.addDoc(doc)..

Best
Erick

On 2/1/07, liquideshark <tandimail <at> gmail.com> wrote:
>
>
> iam building up a search engine using lucene 2.0, and iam having problem
> using the term boost "setboost" a part of my code is :
> and my code is :
>
> doc.add(new
> Field("title",httpd.getTitle(),Field.Store.YES,Field.Index.TOKENIZED ));
> doc.getField("title").setboost(5.0f);//  <=== the boost wont update to 5.0
> it remain 1.0
> writer.addDocument(doc);
> writer.optimize();
> writer.close();
>
> but when i look up in the index created the field title is still 1.0
> can some one help me thx
> --
> View this message in context:
> http://www.nabble.com/problem-with-field.setboost%285.0f%29-on-lucene-2.00-tf3154250.html#a8746530
> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe <at> lucene.apache.org
> For additional commands, e-mail: java-user-help <at> lucene.apache.org
>
>
liquideshark | 1 Feb 2007 14:50
Picon

Re: problem with field.setboost(5.0f) on lucene 2.00


Yes you are right 
but i have change it to:
------------------------------------------------
        Field tiTle = new
Field("title",httpd.getTitle(),Field.Store.YES,Field.Index.TOKENIZED );
        tiTle.setBoost(6.1f);
        doc.add(tiTle);
-----------------------------------------------
it still dont make any change on the boost value, for information i use
luke.jar to see if the value had change

nice reading you again
      Tandina

Erick Erickson wrote:
> 
> I haven't played with boosts, but I suspect your ordering is wrong. You've
> already added the field to the document before you set the boost. Try
> Field f = new Field()...
> 
> f.setBoost()
> 
> doc.add(f).
> 
> writer.addDoc(doc)..
> 
> Best
> Erick
> 
> On 2/1/07, liquideshark <tandimail <at> gmail.com> wrote:
>>
>>
>> iam building up a search engine using lucene 2.0, and iam having problem
>> using the term boost "setboost" a part of my code is :
>> and my code is :
>>
>> doc.add(new
>> Field("title",httpd.getTitle(),Field.Store.YES,Field.Index.TOKENIZED ));
>> doc.getField("title").setboost(5.0f);//  <=== the boost wont update to
>> 5.0
>> it remain 1.0
>> writer.addDocument(doc);
>> writer.optimize();
>> writer.close();
>>
>> but when i look up in the index created the field title is still 1.0
>> can some one help me thx
>> --
>> View this message in context:
>> http://www.nabble.com/problem-with-field.setboost%285.0f%29-on-lucene-2.00-tf3154250.html#a8746530
>> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe <at> lucene.apache.org
>> For additional commands, e-mail: java-user-help <at> lucene.apache.org
>>
>>
> 
> 

--

-- 
View this message in context: http://www.nabble.com/problem-with-field.setboost%285.0f%29-on-lucene-2.00-tf3154250.html#a8748508
Sent from the Lucene - Java Users mailing list archive at Nabble.com.

Gmane