Lili JianginTowards Data ScienceHow GPT works: A Metaphoric Explanation of Key, Value, Query in Attention, using a Tale of PotionGPT, explained simply, in a metaphor of potion.10 min read·Jun 17, 2023--8--8
Lili JianginTowards Data ScienceA Visual Explanation of Gradient Descent Methods (Momentum, AdaGrad, RMSProp, Adam)Why can AdaGrad escape saddle point? Why is Adam usually better? In a race down different terrains, which will win?9 min read·Jun 7, 2020--22--22
Lili JianginTowards Data ScienceAn Intuitive Explanation of Kernels in Support Vector Machine (SVM)We will walk through a simple example with basic arithmetics to demystify the concept of kernel.4 min read·Apr 5, 2020----
Lili JianginTowards Data ScienceIntuitive Explanation of Cross EntropyI will draw a coin from a bag. Your goal is to guess the color with the fewest questions.5 min read·Jan 18, 2019--5--5