
DataHub is a service that is provided by Alibaba Cloud to process streaming data. You can use DataHub to publish and subscribe to streaming data. These features can help you analyze streaming data and build applications.
Vendor
Alibaba Cloud
Company Website



DataHub is a service that is provided by Alibaba Cloud to process streaming data. You can use DataHub to publish and subscribe to streaming data. These features can help you analyze streaming data and build applications.
Benefits
- High Stability DataHub is derived from the real-time transmission system of Alibaba Group. DataHub has been proven stable and reliable during Double 11 over the years.
- High Throughput Up to terabytes of data can be written to a topic per day. Up to hundreds of GB of data can be written to a shard per day.
- Low Cost DataHub is an out-of-the-box solution that helps you transmit data with low cost based on the pay-as-you-go billing method.
- Integrated Ecosystem DataHub is based on the Apsara distributed operating system and is deeply integrated with Alibaba Cloud big data systems. DataHub seamlessly connects with MaxCompute, Realtime Compute for Apache Flink, and Hologres.
Features
Data Import
DataHub supports various SDKs and APIs and provides multiple third-party plug-ins such as Flume and Logstash. You can import data to DataHub in an efficient manner.
Data Delivery
The DataConnector module can synchronize imported data to downstream storage and analysis systems in real time, such as MaxCompute, OSS, and Tablestore. This significantly reduces your workload.
Data Cache
DataHub supports flexible cache schedules, repeated consumption in downstream systems, and automatic backup to ensure high data reliability.
Multiple Interfaces
You can access DataHub by using the web-based console or by calling APIs and SDKs.