InData Science in Your PocketbyMehul GuptaAdvantage Actor-Critic (A2C) algorithm in Reinforcement Learning with Codes and Examples using…Combining DQNs and REINFORCE algorithm for training agentsApr 14, 2023Apr 14, 2023
InTDS ArchivebyHennie de HarderSolving Multi-Armed Bandit ProblemsA powerful and easy way to apply reinforcement learning.Nov 4, 20225Nov 4, 20225
InNetflix TechBlogbyNetflix Technology BlogReinforcement Learning for Budget Constrained Recommendationsby Ehtsham Elahi with James McInerney, Nathan Kallus, Dario Garcia Garcia and Justin BasilicoAug 15, 20222Aug 15, 20222
InTDS ArchivebySamuel FlenderThe Joy of A/B Testing, Part II: Advanced TopicsCookies and privacy, interleaving experiments, clean dial-ups, and test metricsAug 13, 2022Aug 13, 2022
Edoardo ContiOffline Policy Evaluation: Run fewer, better A/B testsHow offline policy evaluation works, examples on how to use it, and lessons learned from building OPE at FacebookJun 10, 20211Jun 10, 20211
InAnalytics VidhyabyShishir KumarSlateQ: A scalable algorithm for slate recommendation problemsI was recently introduced to the wonderful world of Reinforcement Learning (RL) and wanted to explore its applications in recommender…Jul 31, 20202Jul 31, 20202