DeepMind and OpenAI Ideas to Incorporate Human Feedback in Reinforcement Learning Agents
A paper from two years ago introduces some clever ideas to fine tune reward functions in reinforcement learning agents.

I recently started an AI-focused educational newsletter, that already has over 100,000 subscribers. TheSequence is a…