Dramatic performance difference between JaxMARL SMAX vs. Mava SMAX? #1144
-
Hi there! I was wondering if you guys had any ideas as to why you're seeing such a big performance difference on SMAX with your implementations vs. the JaxMARL implementations? https://github.com/instadeepai/Mava?tab=readme-ov-file#smax |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Hi @Chulabhaya. Thank you for the question. We are in the process of reworking Mava's Readme, so the results should be updated soon as well. There are a few potential reasons for the performance differences:
Please let us know if you have any further questions, and please keep checking back since we will have a newly overhauled Readme live soon highlighting all our most recent developments. |
Beta Was this translation helpful? Give feedback.
Hi @Chulabhaya. Thank you for the question.
We are in the process of reworking Mava's Readme, so the results should be updated soon as well.
There are a few potential reasons for the performance differences:
There are two model architecture differences between Mava and JaxMARL's implementations.
a. In Mava we do not use a shared actor-critic network with two separate heads that is optimised by a single optimiser with a shared loss. Instead, we have completely independent actor and critic networks which get optimised with their own losses and optimisers. We did this based on the recommendations from The Surpirising Effectiveness of PPO in Cooperative Multi-Agent Games. As well as the imp…