Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.
MAZero requires python3 (>=3.8) and pytorch (>=1.12) with the development headers.
conda create -n mazero python=3.8
conda activate mazero
conda install pytorch==1.12.1 torchvision==0.13.1 torchaudio==0.12.1 cudatoolkit=11.3 -c pytorch
pip install ray tensorboard matplotlib gymnasium wandb seaborn scipy opencv-python==4.5.1.48 cython==0.29.23
For SMAC installation, please refer to https://github.com/oxwhirl/smac.
Before starting training, you need to build the c++/cython style external packages. (GCC version 7.5+ is required.)
cd core/mcts/ctree
bash make.sh
conda activate mazero
python test/test_train_sync_parallel.py
We provide train_smac.sh
for training.
- With 1 GPUs (A100 80G):
bash train_smac.sh
If you find this repo useful, please cite our paper:
@inproceedings{liu2023efficient,
title={Efficient Multi-agent Reinforcement Learning by Planning},
author={Liu, Qihan and Ye, Jianing and Ma, Xiaoteng and Yang, Jun and Liang, Bin and Zhang, Chongjie},
booktitle={The Twelfth International Conference on Learning Representations},
year={2023}
}
We appreciate the following github repos a lot for their valuable code base implementations:
https://github.com/YeWR/EfficientZero