Edwin TaninPython in Plain EnglishPySpark DataFrame Comparison: A Must-Know Skill for Data Scientists & EngineersHow to create custom .equals() method in Pyspark for comparing DataFrames·6 min read·Sep 6, 2023--1--1
Edwin TaninTowards AIBeyond Tutorials: Learning Data Analysis with LangChain’s Pandas AgentHow to leverage on LangChain's Pandas Agent as your co-pilot.·10 min read·Sep 3, 2023--2--2
Edwin TaninTowards Data ScienceHow to Detect Drift in Machine Learning ModelsThis might be the reason why your model performance degrades in production.·8 min read·Feb 6, 2023--2--2
Edwin TaninTowards AIHow to Anonymize Faces in Images with Deep Learning and Computer VisionBuild an image processing pipeline to detect and blur faces with RetinaFace and OpenCV·7 min read·Jan 17, 2023----
Edwin TaninTowards Data ScienceProductionize Machine Learning Models with Serverless Container ServicesHow to create serverless containerized inference endpoint for your machine learning models with Azure Container App·7 min read·Jan 9, 2023----
Edwin TaninTowards Data ScienceHow to Test PySpark ETL Data PipelineValidate big data pipeline with Great Expectations·6 min read·Dec 6, 2022--1--1
Edwin TaninTowards Data ScienceHow to Prepare Scikit-Learn Models for ProductionServe scikit-learn models with FastAPI and Docker·7 min read·Sep 27, 2022--2--2
Edwin TaninTowards Data ScienceHow to Test Pandas ETL Data PipelineTest your Pandas ETL data pipeline with Great Expectations·7 min read·Sep 19, 2022--2--2
Edwin TaninTowards Data Science10 VSCode Productivity Hacks for Data Scientists10x your productivity with these VSCode extensions·6 min read·Sep 12, 2022--1--1
Edwin TaninTowards Data ScienceHow to Generate Docstrings for Data Science ProjectsGenerate clear and well formatted python docstrings in seconds·5 min read·Aug 31, 2022--1--1