Analytics Vidhya
Published in

Analytics Vidhya

How I Passed The GCP Professional Data Engineer Certification(In 1st attempt)

I passed the Google Cloud Professional Data Engineer exam on the first attempt on 27th September. Let me share my preparation with you all for your reference.

Originally published at

Table of Contents

  • My Previous Experience with GCP
  • Current work with GCP
  • My Preparation For Exam
  • Practice Test
  • One important Tip

Youtube Video :
If you prefer video format :

My Previous Experience with GCP

I started my GCP journey in 2018 when I was working on Kafka and Kubernetes. I was so happy to see my first distributed software installed and communicating with each other. (It was a pleasure when you deploy distributed software like Kafka in a cluster environment)

During that time I learned about streaming technology, GCP Compute Engine, Docker, Kubernetes.

Current Work with GCP

My current work is to build a data platform on GCP. I explored various products like BigQuery, Data Studio, Cloud Spanner, Cloud BigTable, Google cloud storage, Cloud Composer, KMS, Dataflow, and apache beam. I tried POC and tried to understand each product and finalize the design for the business requirements.

My Preparation for Exam

Although I worked with many data analytics products on GCP, there were some products I didn't explore like PubSub, Dataproc, Data fusion, Data prep, and Machine Learning products like Kubeflow, Machine Learning Platform.

Hence I needed some online courses which could give me a basic understanding of the product.

I watch many video courses on Pluralsight and Coursera

1: Video Courses I Completed

List of courses I completed on Pluralsight:

Data Storage

Data Processing

Pipeline Orchestration

Machine Learning

Overall Data Platform

Above courses were very helpful to set the base for understanding of data engineering on GCP.

2: Product Documentation

Next step is took was to go through the documentation of each product. This was helpful in understanding detail concepts . Going through entire documentation looked too much initially but this is very important in order to not miss out on certain important concepts. I decided to use 1 hour of my day to go through the documentation.

3: GCP Qwiklabs

I practiced data engineering quest on GCP Qwiklabs which is good way to perform hands-on.

4: Final Steps

  • I referred this blog very useful during last week preparation in order to avoid if i am missing any concepts.
  • This course from GCP on Pluralsight was also good for last week preparation.

Practice Test

One important Tip

Understanding open source alternatives to each GCP product would be very helpful in exam, since questions include comparison with open source alternatives.

For example, I studied GCP PubSub, but I also learnt about Apache Kafka . This is important because In the exam you would often ask about questions on comparison and alternatives to choose from. Hence If learning about Google Dataflow is mandatory to pass the exam but understanding Apache Spark and Flink is also important to become data engineer.

Thanks for your time!

Liked this blog ? Find more @ :
Please subscribe to my youtube channel for tech related contents.

Analytics Vidhya is a community of Analytics and Data Science professionals. We are building the next-gen data science ecosystem

Recommended from Medium

Orion Money bounty contest

A #100DaysOfCode Timeboxed Front-End Development Curriculum

Airdrop: Aurix Exchange

GoLang discussion series —  The beginning

M3O Latest Updates — February 2022

Staff Augmentation vs Managed Services

Create notifications in minutes with Amazon SNS and Python

Learn Factory Design Pattern

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store


Google Cloud Certified Professional Data Engineer. Find more blogs:

More from Medium

Trust me I’m an Engineer — Data Scientist’s perspective on GCP Data Engineer Exam

How Data Virtualization Can Help Your Big Data Project

Warehousing with Google’s Big Query

Which is the better way of saving Cloud Logging for Python in Cloud Functions?