1 Feb 2007 01:48
Re: Field methods and usage
karl wettin <karl.wettin <at> gmail.com>
2007-02-01 00:48:55 GMT
2007-02-01 00:48:55 GMT
31 jan 2007 kl. 12.25 skrev Christoph Pächter: > > I was wondering, if there is anywhere a table (similar to Table 1.2 > An overview > of different field types, their characteristics, and their usage in > Lucene in > Action), listing the possible methods and their usage. Implementations will differ, for example: > > Store |TermVector |Index |reasonable |Usage > YES |NO |NO |1 |URLs > | > telephone number You never have to store anything in the index, perhaps that information is persistent somewhere else? If you use a term vector or not depends very little on what kind of information you store in there, it is up to what analysis you plan to include the documents in. Highlighting? More like this? Neural networks? Some are more than happy with one large token. Other people might want to tokenize the exact same information. An URL in [protocol://host:port/path], a phone number in country-, area, and district parts. It really up to each and every implementer to decide what settings is(Continue reading)
RSS Feed