Notes on Soft Actor-Critic
A short blog post explaining how SAC works. We also implement SAC from scratch and train on a few tasks.
A short blog post explaining how SAC works. We also implement SAC from scratch and train on a few tasks.
A short blog post explaining how DDPG works. We also implement DDPG from scratch and train on a few tasks.
We compare different policy evaluation approaches on the random walk problem.
We find the shortest path on a grid using RL.