Mava-0.1.0
Pre-release
Pre-release
Highlights
Mava Core
-
Components
- Architectures
- Added Centralised, Decentralised, Networked and State Based Architectures.
- Modules
- Added Broadcast Communication, Epsilon Decay Scheduling, Additive and Monotonic Mixing and Fingerprint Stabilization.
- Networks
- Added Additive and Monotic Mixing Networks, Hypernetworks, Communication Networks, Epsilon Greedy and DiscreteValued head.
- Architectures
-
Environment Loops
- Added Parallel and Sequential Environment Loops.
-
Adders
- Added Parallel versions of Transition, Sequential and Episode Adders.
Systems
- Added feedforward training for maddpg, mad4pg, madqn, mappo, vdn and qmix.
- Added recurrent training for madqn, dial, maddpg and mad4pg.
- Added continuous network heads for maddpg, mad4pg and mappo.
- Added decentralised architecture training for maddpg, mad4pg, madqn, mappo, dial, vdn and qmix.
- Added centralised architecture training for maddpg, mad4pg and mappo.
- Added state based architecture training for maddpg and mad4pg.
- Added networked architecture training for maddpg.
Environments/ Environment Wrappers
- Added PettingZoo, SMAC, RoboCup, OpenSpiel, Flatland, Debug Simple Spread, Debug Switch environment and Debug Two-Step game.
Examples
- Added quickstart notebook.
- Added basic examples for sample systems and environments.