Blog

2021

4 minute read

A short blog post explaining how SAC works. I also implement SAC from scratch and train on a few tasks.

4 minute read

A short blog post explaining how DDPG works. I also implement DDPG from scratch and train on a few tasks.

1 minute read

I compare different policy evaluation approaches on the random walk problem.

less than 1 minute read

Find the shortest path on a grid using RL.