Jiahui WanginTowards Data ScienceIntroduction of Four Types of Item Similarity MeasuresCovers how to choose the similarity measure when item embeddings are availableFeb 17, 2023Feb 17, 2023
Jiahui WanginTowards Data ScienceAn Example of Sequence Modelling with TransformerSome concepts about transformer and an example on how to build a sequence model with transformerFeb 10, 2023Feb 10, 2023
Jiahui WangStress Testing on Credit RiskCovers the background and terminologies involved in stress testing on credit riskFeb 7, 2023Feb 7, 2023
Jiahui WanginTowards Data ScienceIntroduction to NLP Deep Learning TheoriesA Summary of what I learnt from Natural Language Processing with Deep Learning coursesSep 10, 2021Sep 10, 2021
Jiahui WanginTowards Data Science3. A Case Study Of Spark Performance Optimization On Large DataframesA detailed case study of how to optimize the spark performance involving billions of recordsJan 6, 20211Jan 6, 20211
Jiahui WanginTowards Data Science2. Understanding Apache Spark Resource And Task Management With Apache YARNHow to monitor Spark resource and task management with YarnNov 24, 20201Nov 24, 20201
Jiahui WanginTowards Data Science1. Introduction To Apache SparkA kickoff post to start a new series on Exploration of Spark Performance OptimizationNov 12, 2020Nov 12, 2020
Jiahui WanginTowards Data ScienceFour Things Programmers Need To Know About Python Classes and LibrariesI never had a chance to learn Python in the classroom. Instead, I picked up the programming language by self-learning. A disadvantage of…Apr 9, 20204Apr 9, 20204
Jiahui WanginTowards Data ScienceHow To Model Time Series Data With Linear RegressionTime Series Modeling With Python CodeApr 8, 20203Apr 8, 20203
Jiahui WanginTowards Data ScienceHow To Analyse Multiple Time Series VariableTime Series Modeling With Python CodeApr 6, 2020Apr 6, 2020