PPO + Vanilla
Pre-release
Pre-release
PPO
and Vanilla
release!
- Add PPO, one of the most popular modern RL algorithms.
- Add
Vanilla
series agents: "vanilla" implementations of actor-critic, sarsa, q-learning, and REINFORCE. These algorithms are all prefixed with the letter "v" in theagents
folder.