Logo
Sign in

Apache Lucene is a high-performance Java library for full-text search, indexing, and querying. It supports structured and unstructured data, faceting, spell correction, and vector-based nearest-neighbor search. Lucene is open source and highly scalable.

Vendor

Vendor

The Apache Software Foundation

Company Website

Company Website

search-lucene-issues.gif
lucene-high-level-diagram.svg
Product details

Apache Lucene

Apache Lucene is a high-performance, full-featured search engine library written in Java. It is designed for applications requiring structured and full-text search, faceting, spell correction, query suggestions, and nearest-neighbor search across high-dimensional vectors. Lucene is widely adopted and serves as the core search technology behind platforms like Elasticsearch and Solr.

Features

Lucene offers scalable indexing, capable of processing over 800GB/hour with minimal memory usage. It supports incremental and batch indexing, ranked searching, phrase and wildcard queries, proximity and range queries, and fielded searches. Additional features include typo-tolerant suggesters, flexible faceting, result grouping, highlighting, and pluggable ranking models like BM25 and Vector Space Model.

Capabilities

Lucene enables:

  • Structured and unstructured text search
  • Faceted navigation and filtering
  • Spell correction and query suggestions
  • Nearest-neighbor search for vector data
  • Efficient storage and retrieval using customizable codecs
  • Multi-index searching with merged results
  • Integration with various programming languages via bindings

Benefits

Lucene provides:

  • High-speed indexing and search performance
  • Low memory footprint and efficient storage
  • Flexibility through modular architecture and APIs
  • Proven reliability and scalability in production environments
  • Open-source licensing under Apache 2.0 for commercial and non-commercial use
  • Active community and continuous innovation