PinnedSeals_XYZ10 Tips For Presto Query Performance Optimization1. Filter by partition columnSep 4, 2021Sep 4, 2021
Seals_XYZSQL Optimization for big data — dataset preparationSQL Optimization: the process of improving both the performance (less time )and efficiency (less resource)of SQLJul 23, 20231Jul 23, 20231
Seals_XYZMacBook Setup 101 For Software Engineer New HireIn this post we will cover the ultimate guide on how to setup powerful terminal using iTerm2 & Oh My Zsh. The whole process can be divided…Dec 28, 2022Dec 28, 2022
Seals_XYZCommon Table Expression (CTE) Support in Spark SqlIn this post we will talk about the CTE support in spark 2.4 and spark 3.xMay 22, 20221May 22, 20221
Seals_XYZEfficient IntelliJ IDEA Debugging: BreakpointsDo you know know there are actually 4 types of breakpoints available in IntelliJ IDEA? You are not the only one who only use the Line…Mar 7, 2022Mar 7, 2022
Seals_XYZData Engineer RoadMap Series II (Job Scheduler: Airflow)In this post I will cover below items to help you write you first data pipeline.Feb 13, 2022Feb 13, 2022
Seals_XYZCode Review Guide For The DeveloperThis guide includes best practices for developers going through code review. For details, please check this repo.Oct 4, 2021Oct 4, 2021
Seals_XYZCode Review Guide For The ReviewerThis guide includes recommendations on the best way to do code reviews. For details, please check this repo.Oct 3, 2021Oct 3, 2021
Seals_XYZData Engineer RoadMap Series II (Fundamentals)In this post I will cover fundamentals which can help you ramp up quickly.Oct 3, 2021Oct 3, 2021
Seals_XYZData Engineer RoadMap Series I (Overview)Recently, I keep thinking on how to help people who have great interests in becoming a data engineer and then I decide to write a series of…Oct 3, 2021Oct 3, 2021