Member-only story
Why you should try something else than Airflow for data pipeline orchestration
Let’s evaluate AWS step functions, Google workflows, Prefect next to Airflow.
While Airflow has dominated the market in terms of usage and community size as a data orchestrator pipeline, it’s pretty old and wasn’t designed initially to meet some of the needs we have today. Airflow is still a great product, but the article's goal is to raise awareness on the alternative and what the perfect orchestration tool would be for your data use case. Let’s evaluate AWS step functions, Google workflows, Prefect next to Airflow.
So what are the criteria for a good data orchestrator tool nowadays?
API-First design ⚙
As the Cloud providers are API-First, you want your orchestrations tool to be the same. Ideally, you want to be able to do a couple of things through the API :
- Create/delete workflows
- Easy DAG serialization & deserialization for non-static /evolving workflows.
- Run parameterized workflows
- Handling access management
- Deploy the orchestration tool (if not serverless) through IaC frameworks (Terraform/Pulumi)