Shamik Bandopadhyay | 30 Sep 23:51 2014
Picon

Boost Query (bq) syntax/usage

Hi,

  I'm little confused with the right syntax of defining boost queries. If I
use them in the following way:

http://localhost:8983/solr/testhandler?q=Application+Manager&bq=(Source2:sfdc^6
Source2:downloads^5 Source2:topics^3)&debugQuery=true

it gets translated to -->

<arr name="parsed_boost_queries">
   <str>
       +Source2:sfdc^6.0 +Source2:downloads^5.0 +Source2:topics^3.0
   </str>
</arr>

Now, if I use the following query:

http://localhost:8983/solr/testhandler?q=Application+Manager&bq=Source2:sfdc
^6&bq=Source2:downloads^5&bq=Source2:topics^3&debugQuery=true

gets translated as -->

<arr name="parsed_boost_queries">
    <str>Source2:sfdc^6.0</str>
    <str>Source2:downloads^5.0</str>
    <str>Source2:topics^3.0</str>
</arr>

Both queries generate different result in terms of relevancy. Just
(Continue reading)

shamik | 30 Sep 23:38 2014
Picon

Boost Query (bq) syntax/usage

Hi,

  I'm little confused with the right syntax of defining boost queries. If I
use them in the following way:

http://localhost:8983/solr/testhandler?q=Application+Manager&bq=(Source2:sfdc^6
Source2:downloads^5 Source2:topics^3)&debugQuery=true

it gets translated to -->

<arr name="parsed_boost_queries">
   <str>
       +Source2:sfdc^6.0 +Source2:downloads^5.0 +Source2:topics^3.0
   </str>
</arr>

Now, if I use the following query:

http://localhost:8983/solr/testhandler?q=Application+Manager&bq=Source2:sfdc^6&bq=Source2:downloads^5&bq=Source2:topics^3&debugQuery=true

gets translated as -->

<arr name="parsed_boost_queries">
    <str>Source2:sfdc^6.0</str>
    <str>Source2:downloads^5.0</str>
    <str>Source2:topics^3.0</str>
</arr>

Both queries generate different result in terms of relevancy. Just wondering
what is the right way of using bq ?
(Continue reading)

Luis Carlos Guerrero | 30 Sep 19:39 2014
Picon

Syllabification, readability metric

Hi,

Does Lucene support syllabification of words out of the box? If so is there
support for brazilian portuguese? I'm trying to setup a readability score
for short text descriptions and this would be really helpful.

thanks,

--

-- 
Luis Carlos Guerrero
about.me/luis.guerrero
Chris Johnson | 30 Sep 17:41 2014

complexphrase with wildcards

https://gist.github.com/kindkid/c9f0ed9ee417064c1245

I'm using Solr 4.10.0, and getting a couple of error messages for
invalid complexphrase queries that I don't understand. Are these known
bugs or am I just doing something wrong?

Relevant portion of schema.xml...
=================================

  <fieldType name="summaryText" class="solr.TextField"
positionIncrementGap="100" required="false" indexed="true"
stored="false" multiValued="true" omitNorms="true">
    <analyzer type="index">
      <tokenizer class="solr.WhitespaceTokenizerFactory"/>
      <filter class="solr.LowerCaseFilterFactory"/>
      <filter class="solr.ReversedWildcardFilterFactory"
withOriginal="true" maxPosAsterisk="1" maxPosQuestion="1"/>
    </analyzer>
    <analyzer type="query">
      <tokenizer class="solr.WhitespaceTokenizerFactory"/>
      <filter class="solr.LowerCaseFilterFactory"/>
    </analyzer>
  </fieldType>

  <field name="summary" type="summaryText"/>

Some valid queries, for comparison...
=====================================

summary:arrrr
(Continue reading)

Andreas Hubold | 30 Sep 17:14 2014

dismax query does not match with additional field in qf

Hi,

I ran into a problem with the Solr dismax query parser. We're using Solr 
4.10.0 and the field types mentioned below are taken from the example 
schema.xml.

In a test we have a document with rather strange content in a field 
named "name_tokenized" of type "text_general":

abc_<iframe src='loadLocale.js' onload='javascript:document.XSSed="name"' width=0 height=0>

(It's a test for XSS bug detection, but that doesn't matter here.)

I can find the document when I use the following dismax query with qf 
set to field "name_tokenized" only:

http://localhost:44080/solr/studio/editor?deftype=dismax&q=abc_%3Ciframe+src%3D%27loadLocale.js%27+onload%3D%27javascript%3Adocument.XSSed%3D%22name%22%27&debug=true&echoParams=all&qf=name_tokenized^2

If I submit exactly the same query but add another field "feederstate" 
to the qf parameter, I don't get any results anymore. The field is of 
type "string".

http://localhost:44080/solr/studio/editor?deftype=dismax&q=abc_%3Ciframe+src%3D%27loadLocale.js%27+onload%3D%27javascript%3Adocument.XSSed%3D%22name%22%27&debug=true&echoParams=all&qf=name_tokenized^2%20feederstate

The decoded value of q is: abc_<iframe src='loadLocale.js' 
onload='javascript:document.XSSed="name"' and it seems the trailing 
single-quote causes problems here. (In fact, I can find the document 
when I remove the last char)
The parsed query for the latter case is

(Continue reading)

J'roo | 30 Sep 17:08 2014
Picon

Search multiple values with wildcards

Hi,

I am using Solr 3.5.0 with JavaClient SolrJ which I cannot change.

I have following type of docs:

<doc>
<str
name="proprietaryMessage_tis">:20:13-900-C05-P001:21:REF12349:25:23456789:32A:130202USD100,00:52A:/123456</str>
</doc>

I want to be able to find docs containing :25:234* AND :32A:1302* using
wildcards, which I thought to do like:

&q=proprietaryMessage_tis:(\:25\:23456*+\:32A\:130202US*)

But this doesn't work. Have tried many variations, anyone got a good tip for
me?

Thanks!

--
View this message in context: http://lucene.472066.n3.nabble.com/Search-multiple-values-with-wildcards-tp4161916.html
Sent from the Solr - User mailing list archive at Nabble.com.

Michael | 30 Sep 15:02 2014
Picon

How can I pass in query request parameter at search time and know of it in my query analyzer/tokenizer?

Hi all,

I'm using Solr 4.7.2 to implement multilingual search in my application.

I have a need to pass in query locale on search request and to choose
between custom tokenizers dynamically based on provided locale value.

In Solr In Action - Chapter 14 (Multilingual Search), Listing 14.9 -
*Indexing and querying multiple languages within the same field*
MultiFieldTextTokenizer allow specifying query language as a prefix to the
terms that goes into the analyzer. For example: q=en,fr,es|abandon AND
en,fr,es|understanding AND en,fr,es|sagess.

From one side I have only one language per query, and from the other side I
allow users to use Lucene query syntax in queries including multi-term
queries. Therefore it seems that I have to do the nontrivial work of
parsing the user query according to the query parser rules and add the
prefix everywhere it's needed.
For example consider this user entered query: *one AND (two OR
field2:three)*
This will need to be non-trivially translated into:

*en:one AND (en:two OR field2:en|three). *
Is there other conventional way to pass in language string (one per search
request) to query analyzer/tokenizer ?

Thanks in advance,
Michael
Paul Bellora | 30 Sep 05:02 2014
Picon

Way to determine which terms of an OR query matched each document

My application typically searches Solr by strictly ANDing all terms.
However in some cases of too few results being returned I'd like to relax
the query by switching to OR and setting an appropriate mm ("min match")
value. The problem is that I want it to be clear to the user which terms
are missing from each result of these min match queries. See Google for
example
<https://www.google.com/search?q=monkey%20france%20tuba%20lymphocyte>,
which shows "Missing: foo" under certain results.

Highlighting seems like the closest solution but it's too heavyweight for
my use case. There's no need for the extra analysis to show the terms in
the context of a block of text - I simply want a set of matching terms (or
non-matching terms) per document. Are there other Solr features available
that I could leverage to do this?

Paul
Shamik Bandopadhyay | 30 Sep 00:29 2014
Picon

How to query certain fields filtered by a condition

Hi,

  Just wanted to understand if it's possible to limit a searchable field
only to specific documents during query time. Following are my searchable
fields.

<str name="qf">text^0.5 title^10.0 country^1.0</str>

What I want is to make country a searchable field only for documents which
contain "author:Robert". For remaining documents, "country" should not be
considered as a searchable field, only text and title will come into play.
So If I search for "usa", it should bring result from documents where
author=Robert (by matching country field), but not for remaining authors
even if they've a country field with value "usa".

I don't how it can be done during query time or if it's possible at all
through some function queries. The other option is to add the country value
as part of title or text for documents containing Author:Robert during
index time. But I would like to know if its possible during query time.

Appreciate your feedback.

-Thanks,
Shamik
Michael McCandless | 29 Sep 22:03 2014

[ANNOUNCE] Apache Solr 4.10.1 released

September 2014, Apache Solr™ 4.10.1 available

The Lucene PMC is pleased to announce the release of Apache Solr 4.10.1

Solr is the popular, blazing fast, open source NoSQL search platform
from the Apache Lucene project. Its major features include powerful
full-text search, hit highlighting, faceted search, dynamic
clustering, database integration, rich document (e.g., Word, PDF)
handling, and geospatial search. Solr is highly scalable, providing
fault tolerant distributed search and indexing, and powers the search
and navigation features of many of the world's largest internet sites.

Solr 4.10.1 is available for immediate download at:

    http://lucene.apache.org/solr/mirrors-solr-latest-redir.html

Solr 4.10.1 includes 6 bug fixes, as well as Lucene 4.10.1 and its 7 bug fixes.

See the CHANGES.txt file included with the release for a full list of
changes and further details.

Please report any feedback to the mailing lists
(http://lucene.apache.org/solr/discussion.html)

Note: The Apache Software Foundation uses an extensive mirroring
network for distributing releases. It is possible that the mirror you
are using may not have replicated the release yet. If that is the
case, please try another mirror. This also goes for Maven access.

Mike McCandless
(Continue reading)

Shamik Bandopadhyay | 29 Sep 21:29 2014
Picon

Solr query field (qf) conditional boost

Hi,

  I'm trying to check if it's possible to include a conditional boosting in
Solr qf field. For e.g. I've the following entry in qf parameter.

<str name="qf">text^0.5 title^10.0 ProductLine^5</str>

What I'm looking is to add the productline boosting only for a given Author
field, something in the lines boost ProductLine^5 if Author:"Tom".

I've been using a similar filtering in "appends" section, but not sure how
to do it in "qf or whether it's possible.

<lst name="appends">
<str name="fq">Author:("Tom"  +Solution:"yes") </str>
</lst>

Any pointers will be appreciated.

Thanks,
Shamik

Gmane