Logo
Sign in
Product Logo
Apache DolphinSchedulerThe Apache Software Foundation

Apache DolphinScheduler is a distributed and extensible open-source workflow orchestration platform. It enables users to design, schedule, and monitor complex data workflows using a visual DAG interface. It supports high concurrency, low latency, and integrates with diverse data processing tools and environments.

Vendor

Vendor

The Apache Software Foundation

Company Website

Company Website

home-1-1.png
home-1-2.png
home-1-4.png
Product details

Apache DolphinScheduler

Apache DolphinScheduler is a distributed, open-source workflow orchestration platform designed to manage complex data workflows with high reliability and scalability. It provides a powerful visual DAG interface, supports a wide range of task types, and enables users to define, schedule, and monitor workflows across diverse environments. With its low-code capabilities and extensible architecture, DolphinScheduler is ideal for modern data engineering and automation needs.

Features

  • Visual DAG-based workflow design with drag-and-drop interface
  • Support for over 30 built-in task types including Spark, Flink, Hive, Python, Shell, and more
  • Multi-tenant architecture with isolated environments and worker groups
  • Dynamic parameter passing and output modification between tasks
  • Workflow version control, rollback, and rerun capabilities
  • Batch task execution with intelligent scheduling by date range or list
  • Python, YAML, and OpenAPI support for workflow definition
  • Sub-process task nodes for modular workflow reuse
  • Decentralized multi-master and multi-worker architecture
  • Real-time task queue caching to prevent overload

Capabilities

  • Orchestrates complex workflows across projects and environments
  • Enables high-concurrency, high-throughput, and low-latency task execution
  • Supports dynamic online/offline scaling of master and worker nodes
  • Integrates with cloud-native and big data ecosystems
  • Provides workflow-as-code via PyDolphinScheduler for Python-based automation
  • Allows task dependency management using visual tools or code
  • Offers resource isolation and quota management for tasks
  • Facilitates data backfill and historical reruns without affecting templates
  • Supports both serial and parallel batch task execution
  • Enables real-time monitoring and alerting of workflow status

Benefits

  • Simplifies workflow creation and management with intuitive UI and low-code tools
  • Reduces operational complexity through automation and orchestration
  • Enhances productivity with reusable workflow components and version control
  • Improves system reliability with decentralized architecture and HA support
  • Accelerates data pipeline development and deployment
  • Enables seamless collaboration across teams and departments
  • Scales efficiently with growing data and task volumes
  • Supports hybrid and multi-cloud environments
  • Encourages best practices in workflow governance and observability
  • Backed by a strong open-source community and enterprise adoption