Quickly Enable Unity on your Azure Workspace
Author: Jason Drew
Intro:
As you probably already know, Unity is the future of Databricks. It gives you a consistent governance method across all your assets in your Delta Lakehouse as well as enhanced search, lineage, auditing and sharing capabilities. Not only that, but many upcoming features such as Serverless Workflows will require Unity.
This isn’t meant to be an all inclusive Production implementation guide, rather a quick start to show how to enable Unity in your workspace in the simplest way possible. Also, while the concepts maybe similar in other clouds, this is specific to Azure.
Personally I’m a very visual learner, so while these steps are in the documentation, I thought it would be useful to actually show each step visually as best I could. To help facilitate this, I spun up a fresh Azure Workspace from scratch so everything is a clean and easy to follow.
Prerequisites:
- You must have Databricks Account Admin access. At first, only your AD Global Administrator role will have access your Databricks Account Admin interface, but they can grant other users the Account admin role.
- You need an ADLS Gen 2 storage account with Hierarchical namespace Enabled within the same Region as your Workspace for your “Unity Metastore”. I put this in quotes because this isn’t actually where metadata is stored, rather just the default location for MANAGED tables. Also, while multiple Workspaces within an account can share the same Metastore, this is limited to Workspaces within the same Region.
- You need an Access Connector for Azure Databricks that is assigned the Storage Blob Data Contributor role to a Scope that includes the Storage Account mentioned above. You can also use this as the Storage Credential for other External Locations if you like.
Enabling Unity:
Wrap Up:
Congratulations, you have enabled Unity for your Workspace! You are now on your way to better Datalake management and set for the future. If you’re interested in how to migrate existing External Hive tables to Unity you can check out my post Migrate External Hive Streaming tables to Unity in an Azure Workspace.