Shenggang LiinLevel Up CodingExploration of Reinforcement Learning in LLMsUnveiling the Power of RL in LLMs for Interactive Dialogues, Intelligent Retrieval, and Discoveries·14 min read·19 hours ago--
Stephanie SheninTowards Data ScienceDeep Reinforcement Learning: Toward Integrated and Unified AICan AI provide a lens on human intelligence?·15 min read·6 days ago--1
Vyacheslav EfimovinTowards Data ScienceReinforcement Learning, Part 4: Monte Carlo ControlHarnessing Monte Carlo algorithms to discover the best strategies13 min read·Jun 11, 2024--1--1
Yuki MinaiMuZero: Model-based RL (part1)In previous posts, I introduced various Reinforcement Learning (RL) methods such as Q-learning, Deep Q-learning, and Actor-Critic. These…8 min read·3 hours ago----
From Narrow To General AIThinking is an act of imposing one’s will onto truth, not passive predictionAddressing Nietzsche’s riddle of the unseen causes of thoughts15 min read·May 25, 2024--10--10
Shenggang LiinLevel Up CodingExploration of Reinforcement Learning in LLMsUnveiling the Power of RL in LLMs for Interactive Dialogues, Intelligent Retrieval, and Discoveries·14 min read·19 hours ago--
Stephanie SheninTowards Data ScienceDeep Reinforcement Learning: Toward Integrated and Unified AICan AI provide a lens on human intelligence?·15 min read·6 days ago--1
Vyacheslav EfimovinTowards Data ScienceReinforcement Learning, Part 4: Monte Carlo ControlHarnessing Monte Carlo algorithms to discover the best strategies13 min read·Jun 11, 2024--1
Yuki MinaiMuZero: Model-based RL (part1)In previous posts, I introduced various Reinforcement Learning (RL) methods such as Q-learning, Deep Q-learning, and Actor-Critic. These…8 min read·3 hours ago--
From Narrow To General AIThinking is an act of imposing one’s will onto truth, not passive predictionAddressing Nietzsche’s riddle of the unseen causes of thoughts15 min read·May 25, 2024--10
Vyacheslav EfimovinTowards Data ScienceReinforcement Learning, Part 1: Introduction and Main ConceptsMaking the first step into the world of reinforcement learning11 min read·Apr 9, 2024--1
Hanho RyuSummary of DeepMind x UCL RL Lecture (2021): Introduction [1]This article is based on “DeepMind & UCL RL Lecture Series (2021)”. This article includes the contents of the lecture along with my own…5 min read·17 hours ago--
Vyacheslav EfimovinTowards Data ScienceReinforcement Learning, Part 3: Monte Carlo MethodsFrom casinos to AI: unveiling the power of Monte Carlo methods in complex environments12 min read·May 23, 2024--