Keys Concepts in DataPlex (Part-1)

Diksha Chourasiya
2 min readAug 4, 2023

--

Dataplex

To get your hands on DataPlex Let’s get familiar with its few important terminology which you should know while working with DataPlex.

🔺Key Concepts in Google Cloud DataPlex.

✅DataProc Metastore

✅Lake

✅Zones

✅Assets

1️⃣What is DataProc Metastore ❓

Dataproc Metastore is a managed Apache Hive Metastore service which offers 100% OSS compatibility when accessing database and table metadata stored in the service. It is used for managing your metadata.

2️⃣What is a Lake ❓

DataPlex organizes data stored in Google cloud storage and BigQuery into “lakes” and “zones”. A DataPlex lake most commonly maps to a Data Mesh domain.

3️⃣What is a Zone ❓

Within zones, DataPlex organizes structured and unstructured data as “Zones”.Within each Data Lake created, there will be two zones to store the data.

• Raw-Zone

  • Curated-Zone

4️⃣What are Assets ❓

Assets are the logical equivalent of either a BigQuery table or a Google Cloud Storage set of files within a folder.

An asset maps to data stored in either Cloud Storage or BigQuery.

👉Want to Explore more about Cloud DataPlex 🤔 💭💭

Here is the Official Documentation link 👇

🔗 https://cloud.google.com/dataplex

Happy Learning . . . . 📖

— — — — — — — — — — — — — — — — — — -

📌 Follow me on LinkedIn Diksha Chourasiya

--

--

Diksha Chourasiya

Hi I am Diksha Chourasiya working in Fractal as a Data Engineer.I have completed my Masters from Birla Institute of Technology.