Pierre-Louis Dehapiot | 26 Oct 18:55 2010
Picon

Opensource Websearch Engine Project


Hi,

I'm Pierre-Louis Dehapiot from Paris, France. I am studying computing programming at the ECE (a french
school) and this year, the topic of my project is "google and indexing".
To summarize, it deals with creating my own google in only one year :p !
I saw that you made yourself an opensource websearch engine written in C (Xapian).
I already made the php/CSS interface for my own project only in French for the moment but in English soon !
(you can have a look here : http://pti.pl4tipus.com)
As you can see, it's very "google-like" : this is what the topic deals with.
If you have few minutes to answer me, I think I need some tips about "how to make an indexing engine".
I know how it works approximately but i need more details about the difficulties of the project. All the tips
you can give me can be very useful.
Can you help me ?" 
I am glad of your future support.

Pierre-Louis Dehapiot
Charlie Hull | 27 Oct 10:15 2010
Picon

Re: Opensource Websearch Engine Project

On 26/10/2010 17:55, Pierre-Louis Dehapiot wrote:
>
> Hi,
>
> I'm Pierre-Louis Dehapiot from Paris, France. I am studying computing programming at the ECE (a french
school) and this year, the topic of my project is "google and indexing".
> To summarize, it deals with creating my own google in only one year :p !
> I saw that you made yourself an opensource websearch engine written in C (Xapian).
> I already made the php/CSS interface for my own project only in French for the moment but in English soon !
(you can have a look here : http://pti.pl4tipus.com)
> As you can see, it's very "google-like" : this is what the topic deals with.
> If you have few minutes to answer me, I think I need some tips about "how to make an indexing engine".
> I know how it works approximately but i need more details about the difficulties of the project. All the
tips you can give me can be very useful.
> Can you help me ?"
> I am glad of your future support.
>
> Pierre-Louis Dehapiot

Hi Pierre,

(Apologies, I posted this to xapian-discuss by mistake)

You may be interested to know that Xapian was originally created to 
power a web search engine (half a billion web pages or thereabouts).

You've got a pretty steep learning curve to be honest: you're first 
going to need to learn about web crawling (note that Xapian does not 
include a web crawler, although there are plenty of open source ones out 
there - Heretrix is a good example), and how to keep your index clean 
(Continue reading)


Gmane