PinnedHenry WuIntro to Reinforcement Learning: Monte Carlo to Policy GradientThis post is an intro to reinforcement learning, in particular, Monte Carlo methods, Temporal Difference Learning, Deep Q-learning, Policy…Feb 151Feb 151
Henry WuTrust Region Policy Optimization ExplainedThis post explains Trust Region Policy Optimization (TRPO) and shows how it addresses common issues in vanilla policy gradient methods.Mar 9Mar 9
Henry WuExploring Vector Databases with MilvusThis post introduces vector databases with Milvus, an open-source vector data management system to efficiently store and search large-scale…Feb 6Feb 6
Henry WuUnderstanding Word Embeddings with KerasIn this post, we will cover word embeddings, an approach in NLP for representing text as real value vectors that capture complex…Jan 30Jan 30
Henry WuUnderstanding Byte-Pair EncodingThis post explores the process of Byte-Pair Encoding, from handling raw training text and pre-tokenization to constructing vocabularies and…Jan 26Jan 26
Henry WuOptimizing N-Body Simulation with Barnes-Hut Algorithm and CUDAThis post examines N-body simulations using the Barnes-Hut algorithm and its parallel implementation in CUDA. We will first cover the…Jan 241Jan 241
Henry WuDeep Dive into Raft: Consensus Algorithms in Distributed SystemsIn this post, we take a deep dive into the Raft consensus algorithm, essential for distributed systems. We explore key mechanisms like…Jan 21Jan 21