Blog

2021

Notes on Soft Actor-Critic

4 minute read

A short blog post explaining how SAC works. I also implement SAC from scratch and train on a few tasks.

2019

Notes on DDPG

4 minute read

A short blog post explaining how DDPG works. I also implement DDPG from scratch and train on a few tasks.

2017

Random Walk

1 minute read

I compare different policy evaluation approaches on the random walk problem.

Grid World

less than 1 minute read

Find the shortest path on a grid using RL.