Blog

2021

Notes on Soft Actor-Critic

4 minute read

A short blog post explaining how SAC works. We also implement SAC from scratch and train on a few tasks.

2019

Notes on DDPG

4 minute read

A short blog post explaining how DDPG works. We also implement DDPG from scratch and train on a few tasks.

2017

Random Walk

1 minute read

We compare different policy evaluation approaches on the random walk problem.

Grid World

less than 1 minute read

We find the shortest path on a grid using RL.