Logo
Sign in
Product Logo
Starburst GalaxyStarburst

Fully-managed data lakehouse platform enabling fast analytics and AI across multi-cloud environments.

Starburst-image.jpg
Product details

Overview

Starburst Galaxy is a fully-managed data lakehouse platform that empowers organizations to perform fast analytics and fuel AI initiatives across multi-cloud environments. By connecting directly to various data sources, it eliminates the need for upfront data centralization, offering a flexible and cost-efficient data strategy. With Starburst Galaxy, users can unify their data, enabling seamless querying across disparate sources and near real-time data ingestion. The platform is designed to enhance collaboration through an intuitive interface and a comprehensive semantic layer, facilitating self-service analytics while ensuring enterprise-grade governance and security.

Features and Capabilities

  • Effortless Federation: Provides a single point of access to query across data lakes, warehouses, and databases with over 20 native connectors for seamless data access.
  • Faster Insights: Optimized query execution for predictable workloads, including subquery caching and result set caching, ensuring rapid data retrieval.
  • Real-time Analysis: Supports real-time data ingestion for up-to-date insights and continuous file loading to simplify building an Iceberg data lake.
  • Data Products: Allows users to curate, share, and govern data products natively, enabling data teams to create new data products by seamlessly joining data from data lakes and surrounding sources without the hassle of data movement.
  • Data Catalog and Governance: Features a universal data discovery, governance, and sharing layer called Gravity, which indexes relevant objects from data sources, enhancing discoverability. It also offers fine-grained access controls down to the row level, including support for column masks.
  • Interactive Data Lake Analytics: Powered by open-source Trino, Starburst Galaxy is designed for analyzing large and complex datasets in and around cloud data lakes, ranging from gigabyte to petabyte scale.
  • Compute Tailoring: Enables users to tailor compute resources to the individual needs of workloads with enhanced cluster execution modes like Warp Speed and query-level fault tolerance.
  • Multi-Cloud Support: Operates on AWS, Azure, and GCP, with the ability to query across clouds and even on-premise data, maintaining the data lake as the center of gravity.
  • Open Table Format Support: Offers first-class support for all modern table formats, including Apache Iceberg, Delta Lake, and Hudi.
  • Enterprise-Grade Security: Provides built-in security controls for all data, protecting it and meeting stringent requirements with robust operational and security measures.
  • 24/7 Support: Offers round-the-clock support from Trino experts to assist users whenever needed.