Lucene 3.6發(fā)布 Java全文搜索引擎
Lucene是一套用于全文檢索和搜尋的開(kāi)源程式庫(kù),由Apache軟件基金會(huì)支持和提供。Lucene提供了一個(gè)簡(jiǎn)單確強(qiáng)大的應(yīng)用程式接口,能夠做全文索引和搜尋,在Java開(kāi)發(fā)環(huán)境里L(fēng)ucene是一個(gè)成熟的免費(fèi)開(kāi)放源代碼工具;就其本身而論,Lucene是現(xiàn)在并且是這幾年,最受歡迎的免費(fèi)java資訊檢索程式庫(kù)。人們經(jīng)常提到資訊檢索程式庫(kù),就像是搜尋引擎,但是不應(yīng)該將資訊檢索程式庫(kù)與網(wǎng)搜索引擎相混淆。
Lucene 3.6 包含大量的 bug 修復(fù)、優(yōu)化和改進(jìn),主要內(nèi)容有:
* 完全支持 Java 7,要求 JDK 7u1
* TypeTokenFilter filters tokens based on their TypeAttribute.
* Fixed offset bugs in a number of CharFilters, Tokenizers and TokenFilters that could lead to exceptions during highlighting.
* Added phonetic encoders: Metaphone, Soundex, Caverphone, Beider-Morse, etc.
* CJKBigramFilter and CJKWidthFilter replace CJKTokenizer.
* Kuromoji morphological analyzer tokenizes Japanese text, producing both compound words and their segmentation.
* Static index pruning (Carmel pruning) removes postings with low within-document term frequency.
* QueryParser now interprets '*' as an open end for range queries.
* FieldValueFilter excludes documents missing the specified field.
* CheckIndex and IndexUpgrader allow you to specify the specific FSDirectory implementation to use with the new -dir-impl command-line option.
* FSTs can now do reverse lookup (by output) in certain cases and can be packed to reduce their size. There is now a method to retrieve top N shortest paths from a start node in an FST.
* New WFSTCompletionLookup suggester supports finer-grained ranking for suggestions.
* FST based suggesters now use an offline (disk-based) sort, instead of in-memory sort, when pre-sorting the suggestions.
* ToChildBlockJoinQuery joins in the opposite direction (parent down to child documents).
* New query-time joining is more flexible (but less performant) than index-time joins.
* Added HTMLStripCharFilter to strip HTML markup.
* Security fix: Better prevention of virtual machine SIGSEGVs when using MMapDirectory: Code using cloned IndexInputs of already closed indexes could possibly crash VM, allowing DoS attacks to your application.
* Many bug fixes...
下載地址:http://www.apache.org/dyn/closer.cgi/lucene/java/
【編輯推薦】