James Aylett | 1 Aug 10:41 2006

Re: How to search URL field (unique Q key)?

On Mon, Jul 31, 2006 at 09:51:08AM -0800, oscaruser <at> programmer.net wrote:

> If I'm not mistaken, the approach of printing all of the Q terms
> basically dumps the database contents, but does no searching. If the
> URL field was set to unique, I would imagine that there is a method
> of searching against the key. Based on the functionality of
> scriptindex, it determines whether to the record already exists or
> not before updating or adding to the index. This latter
> functionality is the one I want to utilize because based on the
> nature of the datastructure and keys, searching should be very
> fast. I'll check the scriptindex sources to see if I can understand
> what's going on there.

Yes, possibly I solved your problem rather than answered your
question. If you want to know if a particular term exists in the
database or not, use Xapian::Database::term_exists(). That's painful
by comparison with just doing what I suggested to get all the URLs
that have been indexed out (unless you already have a rig for deciding
whether to add a particular document or not).

(scriptindex may just use replace_document(), I can't actually remember.)

James

--

-- 
/--------------------------------------------------------------------------\
  James Aylett                                                  xapian.org
  james <at> tartarus.org                               uncertaintydivision.org
Vizcayno Tamparantan | 3 Aug 06:21 2006
Picon

An error while testing Python 2.5 and xapian in Win32

Hi:
After a succesful installation of .dll xapian using VS2005 and 
swigwin-1.3.29, I attempted to do my first tests, however, I had the next 
problem:

.python
Python 2.5b2 (r25b2:50512, Jul 11 2006, 10:16:14) [MSC v.1310 32 bit 
(Intel)] on
win32
Type "help", "copyright", "credits" or "license" for more information.
>>>
>>>import sys
>>>sys.path.append("C:/Python25/Lib/site-packages/Xapian")
>>>from _xapian import *
>>>from _xapian import new_WritableDatabase, new_Document
>>>w=new_WritableDatabase("db",DB_CREATE_OR_OPEN)
>>>d=new_Document()
>>>del d
swig/python detected a memory leak of type 'Xapian::Document *', no 
destructor found.
>>>
If a try to print d, windows crashes indicating:
Faulting application python.exe, version 0.0.0.0, faulting module ntdll.dll, 
version 5.1.2600.2180, fault address 0x00018fea.

What do yot think is happening?
Thanks!!

_________________________________________________________________
Las mejores tiendas, los precios mas bajos, entregas en todo el mundo, 
(Continue reading)

Charlie Hull | 4 Aug 12:41 2006
Picon

Xapian article in PHP Architect

Hi all,

I've just seen a great article in PHP Architect all about Xapian, 
written by Marco Tabini who set up the BeebleX PHP search site.

"..This level of specialization has allowed the developers of Xapian to 
implement some features that, for a lack of a more technical term, can 
only be defined as "extremely cool":.....'

It's a great introduction to Xapian and how to do basic indexing and 
searching, even if you're not a PHP user. You can get the relevant back 
issue of the magazine here:
https://www.phparch.com/issue.php?mid=59

Cheers

Charlie
www.lemurconsulting.com
Fernando Nemec | 4 Aug 22:45 2006
Picon

Xaping perl binding install problems

Hi,

I'm trying to install Search::Xapian on a FC3 box but I can't figure
out what's going wrong.

I've installed the last xapian version and also downloaded the last
Search::Xapian version.

When I tried to run simplesearch.pl I got the error:

Can't load
'/usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi/auto/Search/Xapian/Xapian.so'
for module Search::Xapian: libxapian.so.11: cannot open shared object
file: No such file or directory at
/usr/lib/perl5/5.8.5/i386-linux-thread-multi/DynaLoader.pm line 230.

I'm using Perl 5.8.5 and gcc 3.4.4.

Thanks in advanced for any help,

Nemec
James Aylett | 5 Aug 01:08 2006

Re: Xaping perl binding install problems

On Fri, Aug 04, 2006 at 05:45:22PM -0300, Fernando Nemec wrote:

> I'm trying to install Search::Xapian on a FC3 box but I can't figure
> out what's going wrong.
> 
> I've installed the last xapian version and also downloaded the last
> Search::Xapian version.
> 
> When I tried to run simplesearch.pl I got the error:
> 
> Can't load
> '/usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi/auto/Search/Xapian/Xapian.so'
> for module Search::Xapian: libxapian.so.11: cannot open shared object
> file: No such file or directory at
> /usr/lib/perl5/5.8.5/i386-linux-thread-multi/DynaLoader.pm line 230.
> 
> I'm using Perl 5.8.5 and gcc 3.4.4.

What were your install/configuration options for xapian? Looks like it
isn't being found on your library path, and there are a couple of ways
to deal with that.

James

--

-- 
/--------------------------------------------------------------------------\
  James Aylett                                                  xapian.org
  james <at> tartarus.org                               uncertaintydivision.org
James Aylett | 6 Aug 23:21 2006

Re: How to use omega to search remote back end?

On Sun, Aug 06, 2006 at 12:19:12PM -0800, oscaruser <at> programmer.net wrote:

> Whoops took a while to figure out, but "database_dir" should be set
> to "/svr/xapian", then in the omega call, i specify it as "./omega
> 'DB=stub'" where 'stub' file lives in /svr/xapian

Yeah - the idea is that you can have a directory full of a combination
of flint db directories and stub files, and switch between them - or
use a combination - easily.

James

--

-- 
/--------------------------------------------------------------------------\
  James Aylett                                                  xapian.org
  james <at> tartarus.org                               uncertaintydivision.org
Jeff Breidenbach | 7 Aug 04:23 2006
Picon

htdig_noindex markup

I'm playing with Xapian Omega for the first time
today, lots of fun. And very smooth.

Does it make sense for Xapian Omega to honor
HtDig's markup for "don't index this region of the
document" ?

http://www.htdig.org/attrs.html#noindex_start

I'm a little biased in the matter as I have lots of
marked up documents from the htdig 3.1 days,
but maybe it just makes sense in general to honor
this sort of thing.

Jeff
oscaruser | 7 Aug 05:46 2006
Picon

Search with symbols causes search time to hemorrhage

Folks,

Searching for terms like with non-alpha numerical symbols causes great delays before search results
appears. I am searching 5 M pages (~76 GB) of shopping site web data for things like "Men's Levi's Low Rise
Boot Cut 527 Jeans - Downtown", which has symbols " ' ", "-". The xapian DB is on a fast SCSI RAID 0, dual Xeon
configuration, but still I see long search times e.g. "Search took 166.606332 seconds". If I remove these
symbols and replace them with space, the search times are good (subsecond). However if there are any weird
symbols in the search string, then it takes a very long time. Is there anything that I can do about this, so
that I still am searching using the special symbols, but the result time is reduced?

Thanks,
-OSC

--

-- 
___________________________________________________
Play 100s of games for FREE! http://games.mail.com/
Jeff Breidenbach | 7 Aug 07:45 2006
Picon

Omega is fast, but not THAT fast

>Search took -125.376129 seconds

I double checked with a handheld stopwatch, and at no
point did the hands spin backwards.

Known problem?
richard | 7 Aug 09:49 2006

Re: Omega is fast, but not THAT fast

On Sun, Aug 06, 2006 at 10:45:09PM -0700, Jeff Breidenbach wrote:
> >Search took -125.376129 seconds
> 
> I double checked with a handheld stopwatch, and at no
> point did the hands spin backwards.
> 
> Known problem?

Not one I have seen.  Do you have a reproducible test case?

--

-- 
Richard

Gmane