Harun Raseed BasheerSecuring Your Data in Azure — Best Practices and ToolsSecuring your data in Azure is a critical aspect of data management, particularly for businesses that need to protect sensitive…May 15, 2023May 15, 2023
Harun Raseed BasheerStorage Event Trigger in DatabricksTraditionally, when we schedule a job in databricks, it will trigger at the scheduled time. But if we want to process the data in real time…Mar 6, 2023Mar 6, 2023
Harun Raseed BasheerChild Pipeline Return value customization in Azure Data Factory/Azure Synapse PipelineWhen building complex Pipelines in the cloud with Azure Data Factory and Azure Synapse Pipelines, a very common pattern is to separate…Feb 17, 2023Feb 17, 2023
Harun Raseed BasheerTable Clone in DatabricksWe can create a copy of an existing Delta Lake table on Databricks at a specific version using the clone command. Clones can be either deep…Nov 7, 2022Nov 7, 2022
Harun Raseed BasheerSpark Logical And Physical PlansWe are writting our Spark program and executing it to get the desired output, But Spark does a lot of background work to process our data…Mar 11, 2022Mar 11, 2022
Harun Raseed BasheerPerformance Enhancement in Delta LakeTo improve query speed, Delta Lake on Azure Databricks supports the ability to optimize the layout of data stored in cloud storage.Mar 3, 2022Mar 3, 2022
Harun Raseed BasheerVACUUM in Delta LakeIn Delta lake, whenever we do Overwrite or delete a records from delta table, it will not get deleted permanently from the underlying file…Feb 28, 20221Feb 28, 20221
Harun Raseed BasheerWhat is Delta Lake?Delta Lake is an open-source architecture for building a Lakehouse, by creating a structured layer for all types of data (including…Feb 26, 2022Feb 26, 2022