Vinai | 13 Jan 11:56 2003

couldn't find inv_index file

Hi all,
 I am  a new user to perlfect search. I am facing lots of problem when I tried to use it. I need all ur help to solve the problems.
 
 First,
 I couldn't install search.pl. It is saying that DB_File.pm is not found. I am having SDBM_File.pm. Whether DB_File.pm is mandatory or what? If so how to install it.
 When I tried installing DB_File.pm it is saying that Berkley DB needs to be installed. I downloaded that Berkley DB and built the win32 work space. what should I do after that?
 I am totally lost. Pls. help me to solve the issues as early as possible.
 
rgds,
vinai
**************************Disclaimer**************************************************    

 Information contained in this E-MAIL being proprietary to Wipro Limited is 'privileged' 
and 'confidential' and intended for use only by the individual or entity to which it is 
addressed. You are notified that any use, copying or dissemination of the information 
contained in the E-MAIL in any manner whatsoever is strictly prohibited.

****************************************************************************************

Daniel Naber | 13 Jan 13:40 2003
Picon

Re: couldn't find inv_index file

On Monday 13 January 2003 11:56, Vinai wrote:

>  I couldn't install search.pl. It is saying that DB_File.pm is not
> found. I am having SDBM_File.pm. Whether DB_File.pm is mandatory or
> what?

Yes, here are instructions on how to install it:
http://www.perlfect.com/pipermail/perlfect-search//2000-September/000182.html

regards
 Daniel

--

-- 
http://www.danielnaber.de

_______________________________________________
perlfect-search mailing list
perlfect-search <at> perlfect.com
To unsubscribe, set other personal options or view the list archives please visit:
http://perlfect.com/mailman/listinfo/perlfect-search


Scott Sevener | 13 Jan 17:24 2003

drop-down menu indexing

First of all, many kudos to the creators of this script - we've probably been through a dozen different ones trying to find one that could parse php correctly and it looks like we finally found a winner!  The only problem remaining is this...
 
Our site uses a simple drop-down menu / javascript combination for navigation through our archives and I already know that this script doesn't seem to get along with javascript very well.  However, I tried simply switching the javascript to both perl and php redirection scripts, and perlfect still doesn't want to index any of these files.  Is there anything that can be done to the code itself to 'persuade' it to read into the form tags setup for this type of menu, or are we simply going to have to reorganize these pages to adjust for the script?
 
(And if that's the case, does anybody have any other ideas that would work in this situation other than a simple bulleted-list?)
 
The pages in question start here:
 
Thanks in advance!  Best wishes. 
+-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-+
        Scott Sevener
        (Scott.Sevener <at> JustLaugh.com)
 
        Editor-in-Chief, Just Laugh magazine
        http://www.justlaugh.com
 
        "If we couldn't laugh, we would all go insane..."
                         --Jimmy Buffett
+-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-+
Daniel Naber | 13 Jan 17:43 2003
Picon

Re: drop-down menu indexing

On Monday 13 January 2003 17:24, Scott Sevener wrote:

> Our site uses a simple drop-down menu / javascript combination for
> navigation through our archives and I already know that this script
> doesn't seem to get along with javascript very well.

No search engine can index javascript, because Javascript is mostly for 
interactive elements, and the search robots don't trigger these elements. 
The best solution in this case is to use a common list instead of the drop 
down menu: a list has better usability (you see everything at once without 
clicking in the list) and it works for people who have javascript disabled 
(who currently cannot access these pages either).

If you absolutely want to stick with the menu: add a hidden section with 
links like this: <div style="visibility:hidden"><a 
href="link.html">1</a></div>

Regards
 Daniel

--

-- 
http://www.danielnaber.de

_______________________________________________
perlfect-search mailing list
perlfect-search <at> perlfect.com
To unsubscribe, set other personal options or view the list archives please visit:
http://perlfect.com/mailman/listinfo/perlfect-search


ALAN | 24 Jan 12:22 2003

I have a question about php,please help me

I use perlfect to index the php files via http,but it seem only index the first page(default.php)!
My website is www.power198.com
password is "123"
 
Please test it and tell my why?
 
conf.pl file follow:
 
$DOCUMENT_ROOT = '/var/website/power198/';
 
$BASE_URL = 'http://www.power198.com';
 
$CGIBIN = 'http://www.power198.com/cgi-local/perlfect/search/';

$INSTALL_DIR = '/var/website/cgi-bin/power198/perlfect/search/';
 
<at> EXT = ("htm","html","php");
 
$INDEXER_CGI_PASSWORD = "123";
 
$HTTP_START_URL = 'http://www.power198.com/';
 
$HTTP_MAX_PAGES = 60000;
 
$HTTP_SERVER_ROOT = $DOCUMENT_ROOT;
 
<at> HTTP_LIMIT_URLS = ($HTTP_START_URL);
 
$HTTP_FOLLOW_COMMENT_LINKS = 1;

<at> HTTP_CONTENT_TYPES = ('text/html', 'text/plain');
 
$HTTP_DEBUG = 1;
 
Thank you very much!
 
alan
2003-1-24
Daniel Naber | 24 Jan 14:10 2003
Picon

Re: I have a question about php,please help me

On Friday 24 January 2003 12:22, ALAN wrote:

> I use perlfect to index the php files via http,but it seem only index
> the first page(default.php)! My website is www.power198.com
> start indexer url is http://www.power198.com/index_form.html

I just tried it and 271 files have been indexed, so I don't see a problem.

Regards
 Daniel

--

-- 
http://www.danielnaber.de

_______________________________________________
perlfect-search mailing list
perlfect-search <at> perlfect.com
To unsubscribe, set other personal options or view the list archives please visit:
http://perlfect.com/mailman/listinfo/perlfect-search


ALAN | 25 Jan 02:20 2003

Re:I have a question about php,please help me

Daniel,
Thanks!
But i think it have a problem.
you can go to this page:
http://www.power198.com/getproduct.php?classcode=1&brand=ACER&model=1652 via http://www.power198.com.
but indexer can't index this kind of pages.
 
 
Best regards!
 
alan
 
 
Daniel Naber | 25 Jan 20:09 2003
Picon

Re: Re:I have a question about php,please help me

On Saturday 25 January 2003 02:20, ALAN wrote:

> But i think it have a problem.
> you can go to this page:
> http://www.power198.com/getproduct.php?classcode=1&brand=ACER&model=1652
> via http://www.power198.com. but indexer can't index this kind of pages.

Please have a look at this FAQ item:
http://www.perlfect.com/freescripts/search/faq.shtml#T10

Regards
 Daniel

--

-- 
http://www.danielnaber.de

_______________________________________________
perlfect-search mailing list
perlfect-search <at> perlfect.com
To unsubscribe, set other personal options or view the list archives please visit:
http://perlfect.com/mailman/listinfo/perlfect-search


Jon Ohmann | 30 Jan 16:04 2003

IGNORE TEXT not working

My site consists of dynamically created pages (using JSP).  These pages are all coming through a central shell page that embeds the content appropriately.  We have used the <!--ignore_perlfect_search--><!--/ignore_perlfect_search--> extensively to keep the same URL links from being indexed, but they do not seem to work.  Links within these tags (as well as HTML comment tags) are crawled by the indexer.  (we are initiating the indexer via HTTP).  The issue here is that it ends up nesting too deep for the indexer to complete all pages before running into the HTTP_MAX_PAGES limit (currently set to 800). 
 
To make it clear...the inital page (index2.1.jsp) has a link to dynamic pages (JSPID=12345 and 54321).  To display these pages, the URL is: index2.1.jsp?JSPID=12345 or index2.1.jsp?JSPID=84321.  These pages in turn have dynamic links as well.  The indexer crawls to index2.1.jsp, then to the first dynamic document which is really index2.1.jsp with new links.  It then crawls to these 'sub links' and so on and so on. Soon we are many levels deep when in reality, I don't want it to continue repeating the same initial page links (these are carried through to the next page)
 
We have these repeating links in the ignore text, but the indexer seems to disregard the tag and crawl anyway.
 
What are we doing wrong?
Daniel Naber | 30 Jan 16:31 2003
Picon

Re: IGNORE TEXT not working

On Thursday 30 January 2003 16:04, Jon Ohmann wrote:

> We have used the
> <!--ignore_perlfect_search--><!--/ignore_perlfect_search--> extensively
> to keep the same URL links from being indexed, but they do not seem to
> work.

"Ignore" here means to ignore the terms inside these tags, it does not work 
for links. The solution is to take the code from the start of normalize() 
in indexer.pl and add it to the crawl_http() function in indexer_web.pl.

Regards
 Daniel

--

-- 
http://www.danielnaber.de

_______________________________________________
perlfect-search mailing list
perlfect-search <at> perlfect.com
To unsubscribe, set other personal options or view the list archives please visit:
http://perlfect.com/mailman/listinfo/perlfect-search



Gmane