MLOps End-To-End Machine Learning Pipeline-CICD

Published in

Analytics Vidhya

13 min readJul 5, 2021

The main objective of this project is to automate the whole machine learning app deployment process. To implement this project the person needs some understanding of TensorFlow and basic knowledge in dockers and Kubernetes. If you want to know more about Docker, Kubernetes, and cloudbuild then refer to the links I have given. Let's dive in.

👉🏻 MLOps is a methodology for ML engineering that unifies ML system development (the ML element) with ML system operations (the Ops element). It advocates formalizing and (when beneficial) automating critical steps of ML system construction. MLOps provides a set of standardized processes and technology capabilities for building, deploying, and operationalizing ML systems rapidly and reliably.
MLOps supports ML development and deployment in the way that DevOps and DataOps support application engineering and data engineering (analytics). The difference is that when you deploy a web service, you care about resilience, queries per second, load balancing, and so on. When you deploy an ML model, you also need to worry about changes in the data, changes in the model, users trying to game the system, and so on. This is what MLOps is about.

1. About the Dataset:

This dataset was initially published by analyticsvidhya.com

DataHack : Biggest Data hackathon platform for Data Scientists

Data science hackathons on DataHack enable you to compete with leading data scientists and machine learning experts in…

datahack.analyticsvidhya.com

and also available in Kaggle.

This dataset contains around 25k images of size 150x150 distributed under 6 categories.
{‘buildings’ -> 0,
‘forest’ -> 1,
‘glacier’ -> 2,
‘mountain’ -> 3,
‘sea’ -> 4,
‘street’ -> 5 }

The Train, Test, and Prediction data is separated in each zip file. There are around 14k images in Train, 3k in Test, and 7k in Prediction.

2. Model Development Steps

I am not going into detail on the model development. I am using TensorFlow for this image classification problem. Here the objective is to build the model and automate the deployment process.

Unstructured data
Image classification -Multiclass
Use TensorFlow library
Upload the dataset to a dataframe.
Explore the dataset.
Prepare the data.
Data Augmentation — Using ImageDataGenerator
CNN classifier
Multiclass classification -softmax
loss-categorical_crossentropy
Optimizer -Adam

Image by the author

Remember to download the model and test again by uploading.

model.save(‘/content/drive/MyDrive/Files/image_intel/models/’, save_format=’tf’)

and upload

model_loaded = tf.keras.models.load_model(‘/content/drive/MyDrive/Files/image_intel/models/models/’)

Save: tf.saved_model.save(model, path_to_dir)
Load: model = tf.saved_model.load(path_to_dir)

The saved model folder will look like

3. Model Deployment and CICD Steps

The below are the steps we are going to follow to deploy the model in GCP.

What is CICD?

According to Google documentation

Continuous Integration (CI) and Continuous Delivery (CD) enables teams to adopt automation in building, testing, and deploying software.It will then guide you through the CI/CD pipeline stage to build and deploy an application to GKE using Container Registry and Cloud Build.

We will be doing the following steps.

Github ready: Create all the files needed for the automation and keep the GitHub repository ready.
Cloudbuild: The build will be done by using google cloudbuild.
Testing: No automated testing in this pipeline.
Deploy: We will be deploying it in GKE with 2 replicas.

Github, Cloudbuild, and Deploy in GKE:

The detail steps are

Create a streamlit app -python file.
Create a docker file.
Create the requirements file.
Create the Kubernetes deployment YAML file.
Create the Kubernetes service YAML file.
Create the cloudbuild YAML file.
Create a GitHub repository on the GitHub desktop.
Upload and organize the files on the Github desktop.
Push the files from desktop to Github.
Link the cloudbuild to the Github and the GCP project.
Create a trigger in the GCP -trigger based on the changes in the Github code.
Now the build is triggered and the app is deployed on the Kubernetes engine.

Let's go into details on the above steps.

Now you have the CNN image classification model downloaded and ready to be deployed. We are going to deploy the model in the Google Kubernetes Engine.

According to google cloud documentation -Google Kubernetes Engine (GKE) provides a managed environment for deploying, managing, and scaling your containerized applications using Google infrastructure. The GKE environment consists of multiple machines (specifically, Compute Engine instances) grouped together to form a cluster.

Some of the advantages of using the Kubernetes are

Load balancing
Automatic scaling
Automatic upgrades
Node auto repair
Logging and monitoring.

If you don’t have the GCP account then you can create one and use the 300$ free credit for the new users by GCP. The details are

Free Trial and Free Tier | Google Cloud

Start building on GCP with a Free Trial that includes $300 in credits. Plus, enjoy access to 20+ select products, like…

cloud.google.com

New customers also get $300 in free credits to fully explore and conduct an assessment of the Google Cloud Platform. You won’t be charged until you choose to upgrade.

Let's see all the steps in detail

Streamlit:

Streamlit is an open-source app framework for Machine Learning and Data Science teams. Create beautiful data apps in hours, not weeks. All in pure Python.

For Streamlit examples check out the following link

Streamlit * The fastest way to build and share data apps

Streamlit is an open-source app framework for Machine Learning and Data Science teams. Create beautiful data apps in…

streamlit.io

The py file for the streamlit is below

Docker:

A Dockerfile is a text document that contains all the commands a user could call on the command line to assemble an image.

To know more about docker check out the below

A Docker Tutorial for Beginners

Learn to build and deploy your distributed applications easily to the cloud with Docker Written and developed by…

docker-curriculum.com

What's the difference between VM and Docker?

How is Docker different from a virtual machine?

I like Ken Cochrane's answer. But I want to add additional point of view, not covered in detail here. In my opinion…

stackoverflow.com

Before creating the docker file let's create the requirements.txt file which will be used in the docker file.

The requirements file in our case is below

Image by the author

The requirements file contains all the packages we need in the application. In our case, we need the above libraries like TensorFlow, Streamlit, pandas, matplotlib, etc.

Why we need the requirements file in the docker file?

Why use requirements.txt in a Docker image

There is a similar question from last year but I don't think the responses are widely applicable and it's not accepted…

stackoverflow.com

Our docker file is below

Image by the author

The docker file contains the following

Docker images can be inherited from other images. Therefore, instead of creating our own base image, we’ll use the official Python image that already has all the tools and packages that we need to run a Python application. We are using Python 3.7. Why Slim? The slim image is a paired-down version of the full image. This image generally only installs the minimal packages needed to run your particular tool. By leaving out lesser-used tools, the image is smaller. Use this image if you have space constraints and do not need the full version. But be sure to test thoroughly when using this image! If you run into unexplained errors, try switching to the full image and see if that resolves it.

COPY . .

Create the working directory. Create the variable which contains the working directory. Copy all the local files to the working directory. This COPY command takes all the files located in the current directory and copies them into the image.

RUN pip3 install -r requirements.txt

After the copy then run the pip install all the packages mentioned in the requirements.txt file. This works exactly the same as if we were running pip3 install locally on our machine, but this time the modules are installed into the image.
Now we have the python installed. Then all the dependencies installed.
Now, all we have to do is to tell Docker what command we want to run when our image is executed inside a container. We do this using the CMD command. We want to execute the streamlit app. The streamlit app is executed disabling CORS protection by running Streamlit with the --server.enableCORS flag set to false

CMD [ "streamlit", "run","--server.enableCORS","false","myapp.py" ]

Kubernetes:

We have already created the docker file and why we need Kubernetes.According to Kubernetes documentation

Kubernetes, also known as K8s, is an open-source system for automating deployment, scaling, and management of containerized applications.

Kubernetes is an open-source container management software developed in the Google platform. It helps you to manage a containerized application in various types of physical, virtual, and cloud environments.

Kubernetes simplifies the deployment and configuration of complex containerized applications and it helps with topics like scaling and load balancing. Kubernetes was created at Google originally and later donated to the Cloud Native Computing Foundation (CNCF). It is now managed and maintained by CNCF and has strong community support and users around the globe. Google runs roughly 2.5 billion containers using Kubernetes to run its services for users. Kubernetes is available on different cloud platforms such as Google Cloud Platform’s Google Kubernetes Engine (GKE), AWS EC2 Container Service, and Microsoft Azure Container. The CLI tool that is used to interact with the Kubernetes object is known as kubectl.

The alternatives to Kubernetes are

1. Amazon ECS

2. RedHat OpenShift

3. Docker Swarm

4 .Nomad

5. AWS Fargate

All the config files are created in YAML .

We will be creating 2 YAML file

Deployment YAML file
Service YAML file

To learn more about the deployment and service file please check out the video

The deployment file is as below

Image by the author

apiVersion - Which version of the Kubernetes API you're using to create this object
kind - What kind of object you want to create
metadata - Data that helps uniquely identify the object, including a name string, UID, and optional namespace
spec - What state you desire for the object
The important point to note is the container image used to build the pod is image: gcr.io/my-vision-project-283816/myapp:v1. This is the image build using the docker file and registered in the GCP registry.
The container port is 8501.Streamlit uses port 8501.
The Deployment creates two replicated Pods, indicated by the .spec.replicas field.If you want to scale up to more then you increase the replicas to a higher number.

The service yaml file is below

Image by the author

The kind is Service
The app name is imageclassfier. The same name used in the deployment file.
This specification creates a new Service object named “imageclassifier”, which targets TCP port 8501 on any Pod with the app=iamgeclassfier label.

Check out the difference between Kubernetes and Docker.

Kubernetes vs Docker: Must Know Differences!

Kubernetes is an open-source container management software developed in the Google platform. It helps you to manage a…

www.guru99.com

We are in the final stages of the automatic deployment.

Google Cloudbuild:

According to Google documentation

Cloud Build is a service that executes your builds on Google Cloud Platform infrastructure. Cloud Build can import source code from Cloud Storage, Cloud Source Repositories, GitHub, or Bitbucket, execute a build to your specifications, and produce artifacts such as Docker containers or Java archives. Learn more
Cloud Build executes your build as a series of build steps, where each build step is run in a Docker container. A build step can do anything that can be done from a container irrespective of the environment. To perform your tasks, you can either use the supported build steps provided by Cloud Build or write your own build steps.

The other similar products like google cloud build are

AWS CodePipeline.
CircleCI.
Jenkins.
GitHub.
Postman.
GitLab.
CloudBees CI.
Amazon Elastic Container Service (Amazon ECS)

Image credit -Google Cloud Documentation

We need to create a yaml file. The cloudbuild yaml file is below

Image by the author

The first step is to build the docker image.
Make sure to give the registry location.
The second step is responsible for pushing the Docker image built on step one to Container Registry.
The third step is to deploy the pod in the Kubernetes. The filename K8s will contain the deployment YAML file and service YAML file.
Also mention the name of the Kubernetes cluster you created. Location and cluster name.
We make use of variable ${PROJECT_ID}

Please check out the below video to learn more about YAML .

So far we have created the

Model jupyter notebook
Streamlit python file
Requirements text file
Docker file
k8s deployment YAML file
k8s service YAML file
Cloudbuild YAML file.

We need to do some additional manual steps before setting the trigger.

Create the GCP Project:

The below steps are from the google documentation

Open the Google Cloud Console.
Next to “Google Cloud Platform,” click the Down arrow arrow_drop_down . A dialog listing current projects appear.
Click New Project. The New Project screen appears.
In the Project Name field, enter a descriptive name for your project. If you’re executing a quickstart, use “Quickstart.”
To edit the Project ID, click Edit. The project ID can’t be changed after the project is created, so choose an ID that meets your needs for the lifetime of the project.
Click Organization and select your organization. In the Location field, click Browse to display potential locations for your project. Click a location and click Select. Click Create. The console navigates to the Dashboard page and your project is created within a few minutes.

Activate the API’s:

In GCP you need to activate the following API’s

Google Kubernetes Engine
Google Cloudbuild
Google Container Registry

Create K8’s Cluster:

We need to create the Kubernetes cluster in GCP.

We can create the cluster using command line interface

gcloud container clusters create mykube --zone "us-west1-b" --machine-type "n1-standard-1" --num-nodes "1" --service-account my-vision-project-283816@appspot.gserviceaccount.com

Name of the cluster: mykube

No of nodes 1 — basic cluster

Make sure to give the cluster name correctly in the google cloudbuild YAML file .

Github Desktop:

Create a new repository.
Organize the files
Push it to GitHub.

Create Cloudbuild Trigger:

We are in the last step of the automation.

Connect Github Repository and Cloudbuild: Have your source code ready in a GitHub repository.

Check out the below documentation which contains steps to connect the GitHub repository to cloudbuild.

Creating GitHub App triggers | Cloud Build Documentation

GitHub App triggers enable you to automatically invoke builds on Git pushes and pull requests and view your build…

cloud.google.com

2. After connecting the cloudbuild and GitHub repository then create the trigger. Check out the documentation on how to create a trigger.

Creating and managing build triggers | Cloud Build Documentation

A Cloud Build trigger automatically starts a build whenever you make any changes to your source code. You can configure…

cloud.google.com

The trigger is created now

Test and Setting up the CICD Pipeline:

The cloudbuild trigger will be triggered if we make a push to the repository. Just make some changes to the Read file and push it. Now can you see the Cloudbuild trigger is triggered?

It takes around 5–6 minutes to complete. You can check the log. It will show the step 1 docker image is build and then pushed to the container registry. Notice you are able to see the output for each of the build steps defined in our cloudbuild YAML file.

Image

Once the build is complete then you can see the status

If it fails then check the log and fix it. I didn’t give the file folder name correctly and it failed one time. One time gave the GKE cluster name incorrectly. So if any errors then check the log and fix it and again run the trigger.

After the successful build, you can see the pod installed. Since we mentioned replicas 2 in the deployment file you see 2 pods created.

The endpoint is created

Now you can test the app

Cleanup:

Please make sure to delete the resources after you are done with the project. Delete the following

Pods, services, and endpoints created.
Kubernetes cluster
Container registry images
Storage buckets
Cloud build trigger

Just make sure to delete all the objects created so you are not charged .try to use the 300$ credit provided by GCP.

Conclusion:

There are a lot of ways you can deploy the app and create the CICD pipeline. Here I used google Kubernetes engine and cloud build. Maybe try to do it in AWS or Azure. Please feel free to connect with me on LinkedIn

References:

MLOps: Continuous delivery and automation pipelines in machine learning:https://cloud.google.com/architecture/mlops-continuous-delivery-and-automation-pipelines-in-machine-learning
AI Engineering — MLOps Playlist : https://www.youtube.com/watch?v=K6CWjg09fAQ&list=PL3N9eeOlCrP5a6OA473MA4KnOXWnUyV_J
CICD Pipeline:https://tanzu.vmware.com/cicd
Kubernetes deployment and service:https://kubernetes.io/docs/concepts/workloads/controllers/deployment/
Google Cloudbuild: https://cloud.google.com/build
Streamlit:https://streamlit.io/gallery?type=apps&category=computer-vision-images

MLOps End-To-End Machine Learning Pipeline-CICD

Contents:

1. About the Dataset:

DataHack : Biggest Data hackathon platform for Data Scientists

Data science hackathons on DataHack enable you to compete with leading data scientists and machine learning experts in…

2. Model Development Steps

3. Model Deployment and CICD Steps

Github, Cloudbuild, and Deploy in GKE:

Free Trial and Free Tier | Google Cloud

Start building on GCP with a Free Trial that includes $300 in credits. Plus, enjoy access to 20+ select products, like…

Streamlit:

Streamlit * The fastest way to build and share data apps

Streamlit is an open-source app framework for Machine Learning and Data Science teams. Create beautiful data apps in…

Docker:

A Docker Tutorial for Beginners

Learn to build and deploy your distributed applications easily to the cloud with Docker Written and developed by…

How is Docker different from a virtual machine?

I like Ken Cochrane's answer. But I want to add additional point of view, not covered in detail here. In my opinion…

Why use requirements.txt in a Docker image

There is a similar question from last year but I don't think the responses are widely applicable and it's not accepted…

Kubernetes:

Kubernetes vs Docker: Must Know Differences!

Kubernetes is an open-source container management software developed in the Google platform. It helps you to manage a…

Google Cloudbuild:

Create the GCP Project:

Activate the API’s:

Create K8’s Cluster:

Github Desktop:

Create Cloudbuild Trigger:

Creating GitHub App triggers | Cloud Build Documentation

GitHub App triggers enable you to automatically invoke builds on Git pushes and pull requests and view your build…

Creating and managing build triggers | Cloud Build Documentation

A Cloud Build trigger automatically starts a build whenever you make any changes to your source code. You can configure…

Test and Setting up the CICD Pipeline:

Cleanup:

Conclusion:

References:

Written by Senthil E