Skip to content

Dramatic performance difference between JaxMARL SMAX vs. Mava SMAX? #1144

Answered by RuanJohn
Chulabhaya asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @Chulabhaya. Thank you for the question.

We are in the process of reworking Mava's Readme, so the results should be updated soon as well.

There are a few potential reasons for the performance differences:

  1. There are two model architecture differences between Mava and JaxMARL's implementations.
    a. In Mava we do not use a shared actor-critic network with two separate heads that is optimised by a single optimiser with a shared loss. Instead, we have completely independent actor and critic networks which get optimised with their own losses and optimisers. We did this based on the recommendations from The Surpirising Effectiveness of PPO in Cooperative Multi-Agent Games. As well as the imp…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by RuanJohn
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants