Karen ZhanginData Engineer ThingsThe Inheritance Schema Design Pattern for MongoDB Data ModellingIn the world of NoSQL databases, particularly MongoDB, designing an efficient data model is crucial for optimal application performance…6 min read·May 12, 2024----
Karen ZhanginData Engineer ThingsNavigating Data Lake Challenges: Unveiling SPL as the Open-Source SolutionIntroducing SPL — the open data lake computing engine to solve the three main challenges for Data Lake: Balancing High-fidelity Data…11 min read·Jan 26, 2024--2--2
Karen ZhanginData Engineer ThingsAn Intro to DuckDB: The SQLite for AnalyticsWhen, Why, and How You Should Consider Using DuckDB·6 min read·Nov 12, 2023--1--1
Karen ZhanginData Engineer Things10 Things I Learned from Reading Fundamentals of Data EngineeringAfter two enriching years as a Data Engineer, I finally had the chance to dive into Fundamentals of Data Engineering written by the…·10 min read·Aug 2, 2023--16--16
Karen ZhanginDev GeniusAutomate Data Engineering Deployment with GitHub ActionsUsing GitHub Actions to deploy ARM Template to Microsoft Azure Data Factory as an example.8 min read·May 13, 2023----
Karen ZhanginDev GeniusHuggingGPT — One step closer to AGI 🤗HuggingGPT: Using LLMs as Controllers to Coordinate Multiple AI Models for Complex Tasks with Language as a Generic Interface6 min read·Apr 5, 2023--1--1
Karen ZhanginDev GeniusData Quality Testing with dbt-expectationsEnsuring Accurate and Reliable Data Pipelines with dbt-expectations6 min read·Feb 22, 2023--1--1
Karen ZhanginDev GeniusWorkflow Orchestration — An Introduction to PrefectA step-by-step guide to set up Prefect and deploy your pipelines.7 min read·Feb 6, 2023----