Less Is More: How TRM Outperforms Larger Reasoning ModelsTiny Recursive Model Paper ExplainedOct 25Oct 25
Continuous Thought Machines (CTMs) ExplainedIs This The AI Era Beyond Transformers?Jun 4A response icon1Jun 4A response icon1
Perception Language Models (PLMs) by Meta ExplainedA Fully Open SOTA VLM With Detailed Visual UnderstandingMay 3A response icon1May 3A response icon1
GRPO Reinforcement Learning Explained (DeepSeekMath Paper)Understanding The Fundamental Component Behind DeepSeek-R1Apr 16Apr 16
Cheating LLMs & How (Not) To Stop Them | OpenAI Paper ExplainedReward Hacking In LLMs — Should We Stop That?Mar 14Mar 14