Data Governance Meetup Recap
Last week we organized the first Data Governance Meetup. This session was the first in a series of meetups on technology for Data Governance. The presentation as well as the discussion were focussed on challenges in implementing data governance tasks. The session consisted of a presentation on Data Governance 101 followed by a discussion.
The main takeaways from the presentation were:
- Data Compliance, Privacy & Security is a journey
- Data governance is hard because:
- There is too much data.
- There is too much complexity in data infrastructure systems.
- There is not enough context on data usage.
3. Start with simple questions such as:
- Where is sensitive data?
- Who has access to data?
- How is the data used?
4. Automate data governance tasks. Examples are:
- PIICatcher & Data Lineage for cataloging data sets.
- Analyze access to datasets in AWS Glue Catalog.
- Storing Snowflake or AWS Redshift query history for detecting behavioral patterns.
- ProxySQL (MySQL) and pgPool (Postgres) to capture query history on production databases.
Discussion topics in the BoF session were:
- How do you assign a dollar amount to the cost & benefit of data governance?
- Challenges in supporting “Right To Delete”.
- Challenges in Data Classification considering differences in locales.
- Cultural norms in classifying data.
Revisit the discussions in the meetup:
We will organize more sessions on various topics in Data Governance such as
- Governing unstructured data
- Data Governance in cloud data platforms
- Case Studies
Follow Data Governance Meetups landing page for updates on the next meetup and topics and join the public telegram group to interact with the group https://t.me/datagovernanceindia