Logo
Sign in

Apache Kyuubi is a distributed, multi-tenant gateway that provides serverless SQL access to data lakehouses. It enables high-performance analytics by abstracting complex backend engines like Apache Spark, Flink, and Trino, offering a unified SQL interface for interactive and batch workloads across diverse data sources.

Vendor

Vendor

The Apache Software Foundation

Company Website

Company Website

kyuubi_layers.png
kyuubi_architecture_new.png
Product details

Apache Kyuubi

Apache Kyuubi is a distributed, multi-tenant gateway designed to provide serverless SQL access to data lakehouses. It abstracts complex backend engines like Apache Spark, Flink, and Trino, offering a unified and secure SQL interface for interactive analytics, batch processing, and data lake exploration. Kyuubi simplifies data access while ensuring high availability, scalability, and resource isolation.

Features

  • End-to-end multi-tenancy with unified authentication and authorization
  • High availability via ZooKeeper-based load balancing
  • ANSI SQL interface for diverse workloads
  • Background engine caching for fast query response
  • Multi-catalog metadata APIs for centralized data views
  • Support for traditional warehouses and modern lakehouses
  • Integration with BI tools through JDBC/ODBC
  • Storage-independent architecture

Capabilities

  • Interactive analytics with rapid query execution on big data
  • Batch processing for large-scale ETL operations
  • Query federation across Hive, Iceberg, Hudi, Delta Lake, and more
  • Engine isolation at user or connection level for stability
  • Deployment flexibility across cloud, on-premise, and hybrid environments
  • Compatibility with modern computing frameworks
  • Centralized access to disparate data sources

Benefits

  • Simplifies data access with a single SQL entry point
  • Enhances performance and concurrency for enterprise workloads
  • Reduces operational overhead with serverless architecture
  • Improves data governance through secure access control
  • Accelerates innovation with unified metadata and query capabilities
  • Scales efficiently with growing data and user demands