Jean-Francois Dockes | 7 Nov 2005 11:58
Picon

submitting a xapian entry to the FreeBSD ports collection


Hello,

Would anyone object to my submitting an entry for xapian-core to the
FreeBSD ports collection ? This would make it even easier for any FreeBSD
user to install a xapian-based application.

By the way, it does not seem quite right that xapian-core 0.9 will install
commands with such generic names as 'copydatabase' or 'simplesearch' to
/usr/local/bin by default. Either a renaming or a configure option to
control the installation for the examples directory would be really nice.

Would you accept a patch to allow configure to disable the building and
installation of the examples directory (ie: --disable-examples) ?

Regards,

J.F. Dockes
Olly Betts | 7 Nov 2005 13:42
Favicon
Gravatar

Re: submitting a xapian entry to the FreeBSD ports collection

On Mon, Nov 07, 2005 at 11:58:17AM +0100, Jean-Francois Dockes wrote:
> Would anyone object to my submitting an entry for xapian-core to the
> FreeBSD ports collection ?

Quite the contrary!

Let me know when you have a URL and I'll add it to the download page.

> By the way, it does not seem quite right that xapian-core 0.9 will install
> commands with such generic names as 'copydatabase' or 'simplesearch' to
> /usr/local/bin by default. Either a renaming or a configure option to
> control the installation for the examples directory would be really nice.

I didn't think we were installing the simple* examples (they're not
really useful as tools, only as example code), but it seems we are.  I
think we should just stop installing them.

> Would you accept a patch to allow configure to disable the building and
> installation of the examples directory (ie: --disable-examples) ?

I think delve and quest are useful to install.  Probably copydatabase too,
since it does allow you to combine databases and copy a database to a
different format (e.g. quartz to flint, or muscat 3.6 to quartz).

I've no objection to renaming copydatabase to xapian-copydatabase or
something else less generic.

Cheers,
    Olly
(Continue reading)

Jean-Francois Dockes | 7 Nov 2005 18:45
Picon

Re: submitting a xapian entry to the FreeBSD ports collection


Ok, then here is what I propose to do:

 - I shall set up the FreeBSD port so that it disables the standard install in
   the examples directory (by trivially patching Makefile.in after
   extracting the source), then install copydatabase, delve and quest with
   a xapian- prefix
 - I shall submit the port to the appropriate FreeBSD queue. I am not a
   FreeBSD committer and I do not know how much time it will take to get
   into the official ports tree.

If/when the example program names are fixed so that they are less likely to
cause trouble / confusion, I'll adjust the port in consequence.

Xapian builds and works perfectly under FreeBSD as far as I know, so the
goal here is to get it into the automated ports/packages system so that
FreeBSD users will be able to use the automated system tools to
download/build/install xapian, as they do for ie, MySQL.

J.F. Dockes

Olly Betts writes:
 > On Mon, Nov 07, 2005 at 11:58:17AM +0100, Jean-Francois Dockes wrote:
 > > Would anyone object to my submitting an entry for xapian-core to the
 > > FreeBSD ports collection ?
 > 
 > Quite the contrary!
 > 
 > Let me know when you have a URL and I'll add it to the download page.
 > 
(Continue reading)

Jean-Francois Dockes | 11 Nov 2005 22:19
Picon

Re: submitting a xapian entry to the FreeBSD ports collection

For your information, xapian-core just went into the FreeBSD ports tree:
http://www.freebsd.org/cgi/query-pr.cgi?pr=88609

Cheers,
JF

Olly Betts writes:
 > On Mon, Nov 07, 2005 at 11:58:17AM +0100, Jean-Francois Dockes wrote:
 > > Would anyone object to my submitting an entry for xapian-core to the
 > > FreeBSD ports collection ?
 > 
 > Quite the contrary!
 > 
 > Let me know when you have a URL and I'll add it to the download page.
Olly Betts | 12 Nov 2005 00:31
Favicon
Gravatar

Re: submitting a xapian entry to the FreeBSD ports collection

On Fri, Nov 11, 2005 at 10:19:38PM +0100, Jean-Francois Dockes wrote:
> For your information, xapian-core just went into the FreeBSD ports tree:
> http://www.freebsd.org/cgi/query-pr.cgi?pr=88609

I've added some information to the download page.  I've not used FreeBSD
ports, but I guess you install them using some utility rather than by
downloading from a URL.

I added a link to here, which seemed more informative than the PR link
above:

http://www.freebsd.org/cgi/ports.cgi?query=xapian&stype=name

If there's something better, let me know.

Cheers,
    Olly
Olly Betts | 12 Nov 2005 00:47
Favicon
Gravatar

Re: submitting a xapian entry to the FreeBSD ports collection

On Mon, Nov 07, 2005 at 06:45:20PM +0100, Jean-Francois Dockes wrote:
>  - I shall set up the FreeBSD port so that it disables the standard install in
>    the examples directory (by trivially patching Makefile.in after
>    extracting the source), then install copydatabase, delve and quest with
>    a xapian- prefix

In the longer term I'd greatly prefer to not have differences between
the "vanilla" version and the various packaged versions, unless there's
a really good reason.  It just makes it harder to write documentation,
scripts, etc.  But in the short term, I've certainly no problem with
this.

> Xapian builds and works perfectly under FreeBSD as far as I know

Yes, I do regular automated builds with testsuite runs on various
releases of FreeBSD (x86 and alpha).

Cheers,
    Olly
arjan holscher | 24 Nov 2005 14:44
Picon
Favicon

Case sensitive search

Hi all,

I´ve been developing a search application using Xapian and Omega. When our visitors search on specific keywords it´s noticable that Omega is case sensitive. It will find results on keyword `Asus´ and it will NOT find results on keyword ´asus´.

I´ve been looking through the documentation to find a CGI param to turn this off. I cannot find it in the omega docs or in the scriptindex docs. Is there such a setting/parameter? If so, where can I find it and how do I use it. If not, what do I need to modify in scriptindex and omega to make this possible?

NOTE: I'm passing in the documents as they are in the database. Meaning that the documents contain words with capital letters. I don't know if this is causing the problem? Do I need to make the input to scriptindex lowercase?

Thx in a dvance,

Arjan Holscher

Yahoo! Music Unlimited - Access over 1 million songs. Try it free.
_______________________________________________
Xapian-devel mailing list
Xapian-devel <at> lists.xapian.org
http://lists.xapian.org/mailman/listinfo/xapian-devel
James Aylett | 24 Nov 2005 15:26

Re: Case sensitive search

On Thu, Nov 24, 2005 at 05:44:16AM -0800, arjan holscher wrote:

> I&#180;ve been developing a search application using Xapian and
> Omega. When our visitors search on specific keywords it&#180;s
> noticable that Omega is case sensitive. It will find results on
> keyword `Asus&#180; and it will NOT find results on keyword
> &#180;asus&#180;.

The way omega works is for all words to be normalised to lower case,
but also to add a 'raw' term for every word that starts with an upper
case letter. Raw terms are prefixed with 'R'.

This should be working properly with scriptindex as well, providing
you're using ``index'' or ``indexpos''.

> NOTE: I'm passing in the documents as they are in the
> database. Meaning that the documents contain words with capital
> letters. I don't know if this is causing the problem? Do I need to
> make the input to scriptindex lowercase?

What does you scriptindex index script look like?

J

--

-- 
/--------------------------------------------------------------------------\
  James Aylett                                                  xapian.org
  james <at> tartarus.org                               uncertaintydivision.org
Olly Betts | 24 Nov 2005 15:44
Favicon
Gravatar

Re: Case sensitive search

On Thu, Nov 24, 2005 at 02:26:12PM +0000, James Aylett wrote:
> On Thu, Nov 24, 2005 at 05:44:16AM -0800, arjan holscher wrote:
> 
> > I&#180;ve been developing a search application using Xapian and
> > Omega. When our visitors search on specific keywords it&#180;s
> > noticable that Omega is case sensitive. It will find results on
> > keyword `Asus&#180; and it will NOT find results on keyword
> > &#180;asus&#180;.
> 
> The way omega works is for all words to be normalised to lower case,
> but also to add a 'raw' term for every word that starts with an upper
> case letter. Raw terms are prefixed with 'R'.

That's right, and searching for a capitalised word searches for the raw
form by default, which is the behaviour Arjan is describing.

You can tell Omega to ignore the R terms and always use the stemmed form
by adding "$set{stem_all,true}" to the top of your omegascript query
template (or templates if you're using more than one).

If you want to eliminate the R terms from the database, you'll have to
delete the code in indextext.cc where add_term or add_posting is called
with "rprefix + term".  That code is used by both omindex and
scriptindex.

> > NOTE: I'm passing in the documents as they are in the
> > database. Meaning that the documents contain words with capital
> > letters. I don't know if this is causing the problem? Do I need to
> > make the input to scriptindex lowercase?

Actually, you could lowercase in the scriptindex script instead of
changing indextext.cc.  Just add "lower" before "index" (or
"indexnopos").  You probably want the "lower" after any "field" though,
unless you want to value in the field also in lowercase (you probably
don't want to lowercase the document title or sample used in the result
display...)

The raw term mechanism is intended to allow searching for propernames
which are sometimes conflated with common words by stemming.  But it
isn't perfect.  This is something which stemming at search time would
help.

Cheers,
    Olly
Diane Kaplan | 27 Nov 2005 09:09

[PHP5] Xapian not compiling

 
_______________________________________________
Xapian-devel mailing list
Xapian-devel <at> lists.xapian.org
http://lists.xapian.org/mailman/listinfo/xapian-devel

Gmane