Gergely SotiindatamindedbeHow to Access Key Vaults from Azure Batch JobsThe cheapest and simplest way of running computational jobs on Azure is by using Azure Batch. This service enables you to launch managed…Apr 2, 20212Apr 2, 20212
Gergely SotiindatamindedbeRunning Spark 3 on AKS with Azure AD integrationDo you want to run Spark 3 on AKS in pro mode? Meaning no more “just copy-paste the storage account access key into the source code, and…Nov 2, 20201Nov 2, 20201
Gergely SotiindatamindedbeOrganize your data lake using LighthouseLighthouse is an open source library (using Apache Spark and Scala) that we developed at Data Minded, with the aim of providing a way to…Oct 31, 2019Oct 31, 2019
Gergely SotiindatamindedbeLittle known Spark DataFrame join typesProbably most of you know the basic join types from SQL: left, right, inner and outer. Since these are supported by most of the…Nov 29, 20181Nov 29, 20181
Gergely SotiindatamindedbeJoining Spark DatasetsEver wanted to do better than joins on Apache Spark DataFrames? Now you can!Nov 16, 20183Nov 16, 20183