The Gretel Epoch #6
Published in
3 min readMay 11, 2022
At Gretel, we are scientists, engineers, and developers like you. As builders, we are focused on sharing tools and techniques for creating better data using synthetic data generation technology. Here’s what’s new this month in content and our community.
NEW BLOGS
Tutorials and technical deep dives from our applied research team scientists and product engineers.
- Transforms and Multi-Table Relational Databases — a how-to on de-identifying a relational database for demo or pre-production testing environments while keeping the referential integrity of primary and foreign keys intact.
- Transforms and Synthetics on Relational Databases — a walkthrough of our new multi-table transform and multi-table synthetics notebooks, which can be used independently or simultaneously.
- ML Models 101 — a quick refresher or an intro to the fascinating world of machine learning models.
GRETEL IN THE WILD
Here are some recent interviews, articles, and public events that Gretel’s staff participated in.
- Making Data Work — on a new Greymatter podcast, Gretel CEO Ali Golshan discussed the importance of building a developer toolkit that anyone can use to create, share and collaborate with high-quality synthetic data.
- The Data Engineering Podcast — Gretel co-founder and CTO John Myers explains how we are building tools for data engineers and analysts to incorporate privacy engineering techniques into their workflows and validate the safety of their data against re-identification attacks.
- Career Roadmap: ML Scientist — an InfoWorld article covering a day in the life of a new role in companies that applies the expertise of a data scientist to the exciting field of designing and implementing state-of-the-art algorithms. As told by Gretel principal ML scientist Amy Steier.
- Synthetic Data Startups Pick Up More Real Cash — Crunchabase covered the growing investments in the synthetic data market and asked Gretel for industry insights on why finance and healthcare are so eager to embrace the new technology.
- Emerging Startups 2022: Top Big Data Analytics Startups — research firm Tracxn provides a curated list of the most promising startups in the industry. Of the more than 2,800 companies on their radar, they featured Gretel as a “Soonicorn” (a soon-to-be unicorn).
- OSDC East 2022 — Gretel Senior Applied Scientist Lipika Ramaswamy hosted not one, but two sessions at this year’s Open Data Science Conference on how to use our open-source tools to democratize access to sensitive datasets. Her workshop is now available to watch here.
- PyCon US 2022 — Gretel lead developer advocate Mason Egger was on the scene for the exciting in-person return of the largest annual conference dedicated to Python.
GRETEL COMMUNITY
Highlights from our users.
- In a newly published study, researchers used Gretel’s synthetic data API to augment a limited EEG dataset. The results — they improved their ML model accuracy by an astounding average of 14%!
- At PyCon, besides Mason, other developers spoke about the value of Gretel Synthetics, too, specifically in helping manage the test data nightmare.
If you have questions or just want to chat, you can connect with us directly, either through our Slack community or by email at hi@gretel.ai.
Cheers,
Team Gretel