Karen ZhanginData Engineer ThingsThe Inheritance Schema Design Pattern for MongoDB Data ModellingIn the world of NoSQL databases, particularly MongoDB, designing an efficient data model is crucial for optimal application performance…May 12May 12
Karen ZhanginData Engineer ThingsNavigating Data Lake Challenges: Unveiling SPL as the Open-Source SolutionIntroducing SPL — the open data lake computing engine to solve the three main challenges for Data Lake: Balancing High-fidelity Data…Jan 262Jan 262
Karen ZhanginData Engineer ThingsAn Intro to DuckDB: The SQLite for AnalyticsWhen, Why, and How You Should Consider Using DuckDBNov 12, 20231Nov 12, 20231
Karen ZhanginData Engineer Things10 Things I Learned from Reading Fundamentals of Data EngineeringAfter two enriching years as a Data Engineer, I finally had the chance to dive into Fundamentals of Data Engineering written by the…Aug 2, 202316Aug 2, 202316
Karen ZhanginDev GeniusAutomate Data Engineering Deployment with GitHub ActionsUsing GitHub Actions to deploy ARM Template to Microsoft Azure Data Factory as an example.May 13, 2023May 13, 2023
Karen ZhanginDev GeniusHuggingGPT — One step closer to AGI 🤗HuggingGPT: Using LLMs as Controllers to Coordinate Multiple AI Models for Complex Tasks with Language as a Generic InterfaceApr 5, 20231Apr 5, 20231
Karen ZhanginDev GeniusData Quality Testing with dbt-expectationsEnsuring Accurate and Reliable Data Pipelines with dbt-expectationsFeb 22, 20231Feb 22, 20231
Karen ZhanginDev GeniusWorkflow Orchestration — An Introduction to PrefectA step-by-step guide to set up Prefect and deploy your pipelines.Feb 6, 2023Feb 6, 2023