Logo
Sign in

Apache Phoenix enables low-latency SQL access over HBase, supporting full ACID transactions, standard JDBC APIs, and dynamic schema management for OLTP and operational analytics in Hadoop environments.

Vendor

Vendor

The Apache Software Foundation

Company Website

Company Website

sqlline.png
squirrel.png
Product details

Apache Phoenix

Apache Phoenix is an open-source relational database layer over Apache HBase. It enables OLTP and operational analytics in Hadoop ecosystems by providing a familiar SQL interface and full ACID transaction support. Phoenix compiles SQL queries into native HBase scans, optimizing performance and reducing development complexity. It integrates seamlessly with other Hadoop tools like Spark, Hive, Pig, and Flume.

Features

  • SQL support including SELECT, JOIN, GROUP BY, ORDER BY, UPSERT, DELETE, and more
  • Full ACID transactions with optimistic concurrency control
  • Secondary indexes for efficient query performance
  • Updatable and read-only views
  • Dynamic columns and schema-on-read capabilities
  • JDBC driver for standard connectivity
  • Integration with Hadoop tools and MapReduce
  • Snapshot queries using versioned schema metadata
  • Multi-tenancy support via tenant-specific views
  • Server-side query execution using coprocessors

Capabilities

  • Transforms SQL into native HBase API calls for efficient execution
  • Supports parallel query execution and server-side filtering
  • Allows schema evolution and dynamic schema definition at query time
  • Enables mapping to existing HBase tables with minimal changes
  • Provides metadata caching and configurable update frequency
  • Supports salting of row keys to balance write load
  • Offers tools for bulk data loading and performance testing
  • Compatible with Kerberos-secured clusters

Benefits

  • Simplifies HBase usage with familiar SQL syntax
  • Reduces development time and complexity
  • Enhances performance through optimized query execution
  • Enables scalable, high-throughput data operations
  • Facilitates integration with BI tools and existing SQL-based workflows
  • Supports flexible data modeling and schema management
  • Improves developer productivity and onboarding
  • Provides robust transaction management for critical applications