Shuyi YanginTowards Data ScienceBuild your own RAG and run it locally on your laptop: ColBERT + DSPy + StreamlitTutorial for GenAI beginners: let’s build a very simple RAG (Retrieval Augmented Generation) system locally, step-by-step.Mar 135Mar 135
Shuyi YanginTowards Data ScienceLow Quality Image DetectionHow to perform low quality images detection using Machine Learning and Deep Learning.Jan 52Jan 52
Shuyi YanginTowards Data ScienceData Anonymization with AutoencodersHow to anonymize the data while preserving its predictive power.Nov 28, 20201Nov 28, 20201
Shuyi YanginTowards Data ScienceKeyword Extraction: from TF-IDF to BERTHow to perform keyword extraction in Python with TF-IDF, TextRank, TopicRank, YAKE!, and KeyBERT.Nov 25, 20202Nov 25, 20202
Shuyi YanginAnalytics VidhyaSpatial AutoRegressive (SAR) Models EstimationHow to estimate the coefficients of a Spatial AutoRegressive model with Maximum Likelihood or Bayesian Estimation.Nov 13, 2020Nov 13, 2020
Shuyi YanginTowards Data ScienceData Scientist: from zero to heroHow to become a great data scientist and remain updated with the latest algorithms and technologies.Nov 9, 20201Nov 9, 20201
Shuyi YanginTowards Data ScienceApache Kafka: Docker Container and examples in PythonHow to install Kafka using Docker and produce/consume messages in PythonAug 19, 20202Aug 19, 20202
Shuyi YanginTowards Data ScienceBasic Linux Console Commands Every Data Scientist Should KnowA list of most useful terminal commands for a data scientist: file management, system administration and text file manipulation.Jul 27, 2020Jul 27, 2020
Shuyi YanginTowards Data ScienceHow to write and render LaTeX math formulas on MediumIn this article we list some methods to write and visualize math formulas on Medium: LaTeX to image, unicode conversion, browser addons…Jul 19, 2020Jul 19, 2020
Shuyi YanginTowards Data ScienceHow to query Neo4j from PythonUsing Neo4j in Python: a quick and dirty introduction to Neo4j Python Driver and Cypher Query Language.Jul 14, 20205Jul 14, 20205