PinnedA Python project templateCreating a Python project involves tiresome work. There are numerous repeated steps as creating setup.py, configuring MyPy, Pytests, and…Jan 14, 20225Jan 14, 20225
The price paid for lack of configuration schema validationMy last blog was about configuration schema and validation. The very next day, I’ve got a nasty configuration issue at work.Apr 14, 2022Apr 14, 2022
The configuration schema using PydanticAs software over-engineers, we want to extract parameters inside configuration so we can make our lives easier.Apr 9, 20222Apr 9, 20222
Writing documentation is not a waste of timeLet’s play a riddle. What is it? When you write it, no one uses it, but when you don’t write it, everyone desires it.Mar 18, 2022Mar 18, 2022
Expected value as evaluation metric in Machine LearningIn this blog, we are going to explore the expected value as an evaluation metric for a binary classifier.Feb 28, 20221Feb 28, 20221
Published inSelectFromRuntime control of data shuffling in Apache FlinkOne of the old nemesis of every data engineer is data shuffling. It may be problematic when application logic changes at runtime.Feb 16, 2022Feb 16, 2022
Hidden obstacles when choosing techWorking on bleeding-edge might sound cool and tempting, but it is not always the best choice.Feb 3, 2022Feb 3, 2022
Published inSelectFromDynamic filtering in FlinkStream processing use case in Apache Flink: How to Change Filtering Rules in the Runtime without Restarting the ApplicationJan 27, 2022Jan 27, 2022
Who wrote this sh*tty code?A CTO and a dev are looking at the legacy codebase to fix a bug. Who wrote this shit, asked the CTO. At the end, the answer was him.Jan 20, 202217Jan 20, 202217
The evaluation metrics and error analysis in ML projectsThis blog covers how we can improve machine learning journey by integrating optimization metric and error analysis in our researchJan 6, 2022Jan 6, 2022