[BUG]Jumanji RWARE reward shape mismatch #1110

lbeyers · 2024-10-24T13:33:23Z

We found that in the 4ag RWARE task, env.reward_spec().generate_value() gives something of shape (), but the reward that environment outputs is actually of shape (num_agents).

This is curious because I believe the scenario should output a reward of shape ().

Maybe it's a wrapper thing?

sash-a · 2024-11-05T12:35:56Z

Thanks Louise looks like this is a bug, we duplicated the reward and discount but never update the specs, my vote is to just not duplicate it and add a dummy agent dim so everything behaves the same.

lbeyers added the bug Something isn't working label Oct 24, 2024

sash-a added the good first issue Good for newcomers label Nov 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG]Jumanji RWARE reward shape mismatch #1110

[BUG]Jumanji RWARE reward shape mismatch #1110

lbeyers commented Oct 24, 2024 •

edited by sash-a

Loading

sash-a commented Nov 5, 2024

[BUG]Jumanji RWARE reward shape mismatch #1110

[BUG]Jumanji RWARE reward shape mismatch #1110

Comments

lbeyers commented Oct 24, 2024 • edited by sash-a Loading

sash-a commented Nov 5, 2024

lbeyers commented Oct 24, 2024 •

edited by sash-a

Loading