Open in app

Sign in

Write

Sign in

Keven Pinto
Keven Pinto

74 Followers

Home

About

Published in

CTS Google Cloud Tech Blog

·Pinned

Dataproc Serverless & Airflow 2 Powered Event Driven Pipelines

In my previous post, I demonstrated how one can get a Dataproc Serverless pipeline up and running from the CLI. In this post, we’ll look at how Dataproc Serverless integrates seamlessly with Cloud Composer and how one can combine the two to create a simple event-driven pipeline. For those new…

Dataproc Serverless

7 min read

Dataproc Serverless & Airflow 2 Powered Event Driven Pipelines
Dataproc Serverless & Airflow 2 Powered Event Driven Pipelines
Dataproc Serverless

7 min read


Published in

CTS Google Cloud Tech Blog

·Pinned

Running pyspark jobs on Google Cloud using Serverless Dataproc

Run Spark batch workloads without having to bother with the provisioning and management of clusters!. If you are interested in running a simple pyspark pipeline in Serverless mode on the Google Cloud Platform then read on.. The ability to run spark jobs in serverless mode is a great idea, however…

Dataproc

6 min read

Running pyspark jobs on Google Cloud using Serverless Dataproc
Running pyspark jobs on Google Cloud using Serverless Dataproc
Dataproc

6 min read


Published in

CTS Google Cloud Tech Blog

·Mar 23

A review of the Google Datastream Service - Cloud SQL (PostgreSQL) to BigQuery

Datastream introduced the ability to seamlessly replicate data from Cloud SQL to BigQuery late in 2022. Given that (near) real time replication of operational databases is one of the most common uses cases we observe as part of our client requirement, we decided to review this service and put it…

Datastream

8 min read

A review of the Google Datastream Service - Cloud SQL (PostgreSQL) to BigQuery
A review of the Google Datastream Service - Cloud SQL (PostgreSQL) to BigQuery
Datastream

8 min read


Published in

CTS Google Cloud Tech Blog

·Mar 14

Using Datastream as a Data Sync and DR tool

In my last blog we looked at how near real time BigQuery data sync across two regions could not be achieved using the dataset copy service provided by Google and how we had to abandon the approach. …

Change Data Capture

5 min read

Using Datastream as a Data Sync and DR tool
Using Datastream as a Data Sync and DR tool
Change Data Capture

5 min read


Published in

CTS Google Cloud Tech Blog

·Jan 26

BigQuery Cross-Region Replication Gotchas

2022 was the fifth warmest year on record. The July heatwave pushing temperatures to 40°C in parts of the UK was so brutal, that it was a rare occasion that my camera was switched on during office meetings. There were times, I felt that if I stayed out too long…

Bigquery

7 min read

BigQuery Cross-Region Replication Gotchas
BigQuery Cross-Region Replication Gotchas
Bigquery

7 min read


Published in

CTS Google Cloud Tech Blog

·Oct 27, 2022

A Centralised Approach to CICD of DAGs on Google Cloud Composer with Google Cloud Build — Part 2

In Part 1 of this blog we setup the infrastructure and other building blocks of our CICD pipeline — in this part we will show how we promote code within our managed environments via our CICD pipeline. A managed environment is any Cloud Composer environment managed by a CICD service…

Directed Acyclic Graph

5 min read

A Centralised Approach to CICD of DAGs on Google Cloud Composer with Google Cloud Build — Part 2
A Centralised Approach to CICD of DAGs on Google Cloud Composer with Google Cloud Build — Part 2
Directed Acyclic Graph

5 min read


Published in

CTS Google Cloud Tech Blog

·Oct 20, 2022

A Centralised Approach to CICD of DAGs on Google Cloud Composer with Google Cloud Build — Part 1

In this 2-part Blog, we shall look at how we implemented CICD for Directed Acyclic Graphs (DAGs) on Google Cloud Composer using Google Cloud Build as our CICD platform. In Part 1 (this part), we set the basic building blocks in place to allow us to do a multi project…

Google Cloud

15 min read

A Centralised Approach to CICD of DAGs on Google Cloud Composer with Google Cloud Build — Part 1
A Centralised Approach to CICD of DAGs on Google Cloud Composer with Google Cloud Build — Part 1
Google Cloud

15 min read


Published in

CTS Google Cloud Tech Blog

·Aug 11, 2022

Build, Test and Deploy Dataflow Flex Templates in Google Cloud

Dataflow Flex Templates are a great way to package and distribute your DataFlow Pipeline. That being said, many engineers trying to implement flex templates for the first time, face quite a few challenges. …

Dataflow

7 min read

Build, Test and Deploy Dataflow Flex Templates in Google Cloud
Build, Test and Deploy Dataflow Flex Templates in Google Cloud
Dataflow

7 min read


Published in

CTS Google Cloud Tech Blog

·Jun 19, 2022

Hive to BigQuery: Orchestrating the House Move of the Warehouse

Hive to BigQuery is a common data migration path and there are plenty of ways to get there. This article describes the approach we took with one of our clients. …

Data

6 min read

Hive to BigQuery: Orchestrating the House Move of the Warehouse
Hive to BigQuery: Orchestrating the House Move of the Warehouse
Data

6 min read

Keven Pinto

Keven Pinto

74 Followers

Traveller | Eco warrior | Data Engineer | Curious Fella | Foodie | Father

Following
  • Alistair Grew

    Alistair Grew

  • Cooper Thornton

    Cooper Thornton

  • Lace Chantelle Rogers

    Lace Chantelle Rogers

  • Lee Doolan

    Lee Doolan

  • Jasbirs

    Jasbirs

See all (22)

Help

Status

About

Careers

Blog

Privacy

Terms

Text to speech

Teams