Logo
Sign in

Apache Kylin is an open-source OLAP engine for big data that delivers sub-second query latency on trillions of records, enabling high-performance, high-concurrency analytics with seamless BI tool integration.

Vendor

Vendor

The Apache Software Foundation

Company Website

Company Website

added_tables-ebc72ef18086cea932a2ed74f3bd7a27.png
deployment_arc-dfb372fda05a08215df7023a3ae4857c.svg
cubes-53047766d2fa1dacb5f45c5a1fa2a773.png
Product details

Apache Kylin

Apache Kylin is a powerful open-source distributed OLAP (Online Analytical Processing) engine designed for big data analytics. It enables sub-second query latency on datasets containing trillions of records, making it ideal for enterprise-scale data warehousing and business intelligence applications.

Features

  • Ultra-Fast Query Performance: Achieves sub-second query latency through advanced pre-computation techniques.
  • High Concurrency: Supports large-scale, high-concurrency analytics with minimal hardware and development costs.
  • Model & Index Recommendation: Automatically generates models and optimizes indexes based on query history or imported SQL.
  • Internal Table Support: Enables flexible query scenarios and lakehouse architecture with native compute engine integration.
  • Streaming-Batch Fusion Analysis: Supports hybrid analysis using both batch and streaming data sources (e.g., Apache Kafka).
  • Native Compute Engine: Integrates Apache Gluten-ClickHouse backend for 2–4x performance improvement over Spark.
  • Brand New Web UI: Simplified modeling interface for defining relationships, dimensions, and measures on a single canvas.

Capabilities

  • Multidimensional Modeling: Builds star or snowflake schemas for efficient large-scale data analysis.
  • Advanced Indexing: Uses aggregate and table indexes (CUBEs) to accelerate query performance.
  • Pre-computation: Aggregates data in advance to reduce runtime computation and improve responsiveness.
  • Streaming Data Integration: Enables real-time analytics by processing data as it arrives.
  • BI Tool Integration: Seamlessly connects with Tableau, Power BI, Excel, and other BI platforms.
  • Metadata Refactoring: Improved transaction performance and system concurrency through redesigned metadata architecture.
  • Flexible Data Loading: Supports batch and streaming data ingestion for dynamic analytics.

Benefits

  • Scalability: Handles petabyte-scale datasets with ease.
  • Speed: Delivers sub-second query responses even on massive data volumes.
  • Cost Efficiency: Reduces hardware and operational costs through optimized computation.
  • Ease of Use: Simplifies model creation with intelligent recommendations and intuitive UI.
  • Real-Time Insights: Enables timely decision-making with streaming data support.
  • Enterprise Readiness: Offers robust capabilities for mission-critical analytics applications.