Notes on Soft Actor-Critic
A short blog post explaining how SAC works. I also implement SAC from scratch and train on a few tasks.
A short blog post explaining how SAC works. I also implement SAC from scratch and train on a few tasks.
A short blog post explaining how DDPG works. I also implement DDPG from scratch and train on a few tasks.
I compare different policy evaluation approaches on the random walk problem.
Find the shortest path on a grid using RL.