Sitemap
TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Member-only story

How I Built A Cascading Data Pipeline Based on AWS (Part 2)

10 min readAug 25, 2023

--

Photo by Mehmet Ali Peker on Unsplash

Previously, I shared my experience in developing a data pipeline using AWS CloudFormation technology. It is not an optimal approach, though, because it leaves behind 3 more issues awaiting resolution:

  1. The deployment has to be imposed manually which could increase the chances of errors;
  2. All resources are created in one single stack, without proper boundaries and layers; as the development cycle goes on, the resource stack will be heavier, and managing it will be a disaster;
  3. Many resources are supposed to be sustained and reused in other projects.

In short, we are going to increase the manageability and reusability of this project, in an agile manner.

Solution

AWS enables users to implement 2 types of CloudFormation structural patterns: cross-stack reference and nested stacking. Cross-stack reference stands for a designing style of developing cloud stacks separately, and usually independently, while the resources among all stacks can be interrelated based on the reference relationship. Nested stacking means a CloudFormation stack composed of other stacks. It is achieved by using the AWS::CloudFormation::Stack resource.

--

--

TDS Archive
TDS Archive

Published in TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Memphis Meng
Memphis Meng

Written by Memphis Meng

I write data, sports and more.