Lucene 3.6发布 Java全文搜索引擎

开发 后端
Lucene是一套用于全文检索和搜寻的开源程式库,由Apache软件基金会支持和提供。Lucene提供了一个简单确强大的应用程式接口,能够做全文索引和搜寻,在Java开发环境里Lucene是一个成熟的免费开放源代码工具;就其本身而论,Lucene是现在并且是这几年,最受欢迎的免费java资讯检索程式库。

Lucene是一套用于全文检索和搜寻的开源程式库,由Apache软件基金会支持和提供。Lucene提供了一个简单确强大的应用程式接口,能够做全文索引和搜寻,在Java开发环境里Lucene是一个成熟的免费开放源代码工具;就其本身而论,Lucene是现在并且是这几年,最受欢迎的免费java资讯检索程式库。人们经常提到资讯检索程式库,就像是搜寻引擎,但是不应该将资讯检索程式库与网搜索引擎相混淆。

Lucene 3.6 包含大量的 bug 修复、优化和改进,主要内容有:

* 完全支持 Java 7,要求 JDK 7u1
* TypeTokenFilter filters tokens based on their TypeAttribute.
* Fixed offset bugs in a number of CharFilters, Tokenizers and TokenFilters that could lead to exceptions during highlighting.
* Added phonetic encoders: Metaphone, Soundex, Caverphone, Beider-Morse, etc.
* CJKBigramFilter and CJKWidthFilter replace CJKTokenizer.
* Kuromoji morphological analyzer tokenizes Japanese text, producing both compound words and their segmentation.
* Static index pruning (Carmel pruning) removes postings with low within-document term frequency.
* QueryParser now interprets '*' as an open end for range queries.
* FieldValueFilter excludes documents missing the specified field.
* CheckIndex and IndexUpgrader allow you to specify the specific FSDirectory implementation to use with the new -dir-impl command-line option.
* FSTs can now do reverse lookup (by output) in certain cases and can be packed to reduce their size. There is now a method to retrieve top N shortest paths from a start node in an FST.
* New WFSTCompletionLookup suggester supports finer-grained ranking for suggestions.
* FST based suggesters now use an offline (disk-based) sort, instead of in-memory sort, when pre-sorting the suggestions.
* ToChildBlockJoinQuery joins in the opposite direction (parent down to child documents).
* New query-time joining is more flexible (but less performant) than index-time joins.
* Added HTMLStripCharFilter to strip HTML markup.
* Security fix: Better prevention of virtual machine SIGSEGVs when using MMapDirectory: Code using cloned IndexInputs of already closed indexes could possibly crash VM, allowing DoS attacks to your application.
* Many bug fixes...

下载地址:http://www.apache.org/dyn/closer.cgi/lucene/java/

【编辑推荐】

  1. RRiBbit 1.1.0发布 Java的IoC框架
  2. JActor 3.0.0 RC4发布 Actor模式的Java实现
  3. JActor 3.0.0 RC3发布 Actor模式的Java实现
  4. JActor 3.0.0 RC2发布 Actor模式的Java实现
  5. JActor 2.2.0 RC3发布 Actor模式的Java实现
责任编辑:林师授 来源: 51CTO
相关推荐

2011-09-16 15:55:09

Java

2011-12-14 10:06:03

Java

2018-12-28 09:48:11

SolrElasticSear搜索

2017-08-17 16:42:38

Elastic 全文搜索服务器

2021-04-12 10:38:17

ElasticSearSolrJava

2011-06-20 18:23:06

SEO

2022-01-13 20:14:46

Qury元宇宙搜索

2014-11-25 10:09:59

ElasticSear分布式搜索引擎Lucene

2012-03-06 15:34:05

JavaFreeMarker

2020-03-20 10:14:49

搜索引擎倒排索引

2017-08-07 08:15:31

搜索引擎倒排

2009-02-19 09:41:36

搜索引擎搜狐百度

2010-04-20 11:43:46

2023-04-20 14:48:05

微软AIChatGPT

2020-07-31 09:55:27

Linux分布式Elasticsear

2012-09-07 13:22:21

搜索搜狗

2022-10-08 09:13:18

搜索引擎⽹站

2009-09-22 16:23:52

搜索引擎

2012-01-05 10:26:26

JavaApache

2012-03-09 09:47:51

Apache CameJava
点赞
收藏

51CTO技术栈公众号