Haitham Bou Ammar – Medium

Haitham Bou Ammar

Pinned

Haitham Bou Ammar
in
Becoming Human: Artificial Intelligence Magazine

Safe Reinforcement Learning — Part I

Many practitioners using reinforcement learning (RL) are often concerned about the safety of SOTA deep RL techniques. With that in mind…

Nov 14, 2022

Safe Reinforcement Learning — Part I

Nov 14, 2022

Haitham Bou Ammar

From MCTS to Alpha-Zero with PyTorch — Part I (Building a Tic-Tac-Toe’r)

AlphaZero is a deep reinforcement learning algorithm developed by DeepMind that has achieved superhuman performance in games like Chess…

Sep 9

From MCTS to Alpha-Zero with PyTorch — Part I (Building a Tic-Tac-Toe’r)

Sep 9

Haitham Bou Ammar

Short Circuit — Let AI Design your Chips

Introduction

Aug 28

Short Circuit — Let AI Design your Chips

Aug 28

Haitham Bou Ammar

New Grounds in Theorem Proving with DeepSeek-Prover-V1.5

DeepSeek-Prover-V1.5 represents a significant leap forward from its predecessor, DeepSeek-Prover-V1. This new iteration is designed for…

Aug 18

New Grounds in Theorem Proving with DeepSeek-Prover-V1.5

Aug 18

Haitham Bou Ammar

Deriving DPO’s Loss

Direct preference optimisation has become critical for aligning LLMs with human preferences. I have been talking to many people about it…

Aug 15

Deriving DPO’s Loss

Aug 15

Haitham Bou Ammar

Auto-DS (I): The Data Interpreter

Introduction

Aug 13

Auto-DS (I): The Data Interpreter

Aug 13

Haitham Bou Ammar

Pluralistic Alignment of LLMs: Fix your Algorithm not just your data

Interjection: Recent studies have found that large language models (LLMs) are biased, with many articles demonstrating these biases and…

Jul 22

Pluralistic Alignment of LLMs: Fix your Algorithm not just your data

Jul 22

Haitham Bou Ammar

A Leap Towards Human-Like AI: Recreating Human Memory in LLMs

In artificial intelligence, large language models (LLMs) have demonstrated remarkable capabilities in understanding and generating…

Jul 20

A Leap Towards Human-Like AI: Recreating Human Memory in LLMs

Jul 20

Haitham Bou Ammar

Transforming Robot Programming with Language AI

Intuitive robot programming enables non-experts to interact with and control robotic systems effectively. Our ROS-LLM framework allows you…

Jul 19

Transforming Robot Programming with Language AI

Jul 19

Haitham Bou Ammar

Optimal Control from Natural Language

Generating model predictive control without domain expertise via large language models!

Feb 13

Optimal Control from Natural Language

Feb 13

Haitham Bou Ammar

Haitham Bou Ammar

RL team leader @Huawei R&D UK H. Assistant Professor @UCL Researcher in #RL, #MachineLearning, #BayesianOpt Concisely (densely) derives #ML Math & Tricks

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams