Tingsong Ou – Medium

Tingsong Ou

Pinned

Tingsong Ou
in
Towards Data Science

A comparison of Temporal-Difference(0) and Constant-α Monte Carlo methods on the Random Walk Task

This post discusses the difference between the constant-a MC method and TD(0) methods and compared their performance on the Random Walk…

Aug 24, 2023

A comparison of Temporal-Difference(0) and Constant-α Monte Carlo methods on the Random Walk Task

Aug 24, 2023

Tingsong Ou
in
Towards Data Science

Solving Reinforcement Learning Racetrack Exercise — Building the Environment

The story discusses the solution to the racetrack exercise in the Reinforcement Learning book together.

Jul 23, 2023

Solving Reinforcement Learning Racetrack Exercise — Building the Environment

Jul 23, 2023

Tingsong Ou

[RL Notes] K-armed Bandits, Part 1

This post series includes my notes and code implementations of the knowledge and example of the book Reinforcement Learning 2nd Edition by…

May 25, 2023

[RL Notes] K-armed Bandits, Part 1

May 25, 2023

Tingsong Ou

Variational AutoEncoder, and a bit KL Divergence, with PyTorch

I. Introduction

Dec 31, 2022

Variational AutoEncoder, and a bit KL Divergence, with PyTorch

Dec 31, 2022

Tingsong Ou

A Simple AutoEncoder and Latent Space Visualization with PyTorch

I. Introduction

Dec 26, 2022

A Simple AutoEncoder and Latent Space Visualization with PyTorch

Dec 26, 2022

Tingsong Ou

Tingsong Ou

MSCD, Carnegie Mellon University. Interested in Deep Learning and Reinforcement Learning

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams