Manan KshatriyainSearceModifying Rowkey (Schema) in Bigtable using DataflowCloud Bigtable is a petabyte-scale, fully managed NoSQL database service in GCP for large analytical and operational workloads. It…Oct 20, 2019Oct 20, 2019
Manan KshatriyainSearceRunning Spark on Cloud Dataproc and loading results to BigQuery using Apache AirflowApache Airflow is an popular open-source orchestration tool having lots of connectors to popular services and all major clouds. This blog…Oct 1, 2019Oct 1, 2019
Manan KshatriyainSearceConvert CSV to Parquet using Hive on Cloud DataprocWe were recently working with a leading international voice carrier firm headquartered in US, which wanted to build a Data Warehouse on…Sep 25, 20181Sep 25, 20181