[BugFix] action_spec_unbatched whenever necessary #2592

vmoens · 2024-11-20T11:49:46Z

Stack from ghstack (oldest at bottom):

cc @matteobettini we now have a more generic single_action_spec that is somewhat similar to unbached_action_spec (name is borrowed from gymnasium)
There is something a bit fishy about unbatched_action_spec though which is that it's the unbatched_full_action_spec, and it doesn't (always) match action spec in its sturcture (it will always be composite).

LMK if these changes make sense (implemented these changes to fix tutorials/sphinx-tutorials/multiagent_competitive_ddpg.py which was broken)

[ghstack-poisoned]

pytorch-bot · 2024-11-20T11:49:49Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2592

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[DomainsOnly] Jobs fail with GLIBC version not found

❌ 3 New Failures, 1 Cancelled Job, 15 Unrelated Failures

As of commit 98a3b39 with merge base a47b32c ():

NEW FAILURES - The following jobs have failed:

Continuous Benchmark (PR) / CPU Pytest benchmark (gh)
FAILED ../../../../../../tmp/test_objectives_benchmarks.py::test_iql_speed[True-None] - torch._dynamo.exc.Unsupported: Graph break under GenericContextWrappingVariable
Continuous Benchmark (PR) / GPU Pytest benchmark (gh)
FAILED ../../../../tmp/test_objectives_benchmarks.py::test_iql_speed[True-None] - torch._dynamo.exc.Unsupported: Graph break under GenericContextWrappingVariable
SOTA Tests on Linux / tests (3.9, 12.1) / linux-job (gh)
RuntimeError: Command docker exec -t 45d00801dc209b9d4eeb8b68c6125c8092b0db32e3985f2ef76dbd5d324d273b /exec failed with exit code 1

CANCELLED JOB - The following job was cancelled. Please retry:

Generate documentation / build-docs (3.10, 12.1) / linux-job (gh)
##[error]The operation was canceled.

FLAKY - The following job failed but was likely due to flakiness present on trunk:

Unit-tests on Linux / tests-cpu-oldget (3.12) / linux-job (gh) (similar failure)
test/test_trainer.py::TestRecorder::test_recorder

BROKEN TRUNK - The following jobs failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Build Linux Wheels / pytorch/rl (pytorch/rl, test/smoke_test.py, torchrl) / upload / manywheel-py3_9-rocm6_1 (gh) (trunk failure)
##[error]Unable to find an artifact with the name: pytorch_rl__3.9_rocm6.1_x86_64
Build Linux Wheels / pytorch/rl (pytorch/rl, test/smoke_test.py, torchrl) / upload / manywheel-py3_9-rocm6_2 (gh) (trunk failure)
##[error]Unable to find an artifact with the name: pytorch_rl__3.9_rocm6.2_x86_64
Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cpu (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow
Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cuda11_8 (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow
Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cuda12_4 (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow
Habitat Tests on Linux / tests (3.9, 12.1) / linux-job (gh) (trunk failure)
AttributeError: _ARRAY_API not found
Libs Tests on Linux / unittests-jumanji (3.9, 12.1) / linux-job (gh) (trunk failure)
test/test_libs.py::TestJumanji::test_jumanji_rendering[batch_size1-RubiksCube-v0]
Unit-tests on Linux / tests-cpu (3.10) / linux-job (gh) (trunk failure)
test/test_trainer.py::TestRecorder::test_recorder
Unit-tests on Linux / tests-cpu (3.11) / linux-job (gh) (trunk failure)
test/test_trainer.py::TestRecorder::test_recorder
Unit-tests on Linux / tests-cpu (3.12) / linux-job (gh) (trunk failure)
test/test_trainer.py::TestRecorder::test_recorder
Unit-tests on Linux / tests-cpu (3.9) / linux-job (gh) (trunk failure)
test/test_trainer.py::TestRecorder::test_recorder
Unit-tests on Linux / tests-gpu (3.11, 12.1) / linux-job (gh) (trunk failure)
test/test_trainer.py::TestRecorder::test_recorder
Unit-tests on Linux / tests-stable-gpu (3.10, 11.8) / linux-job (gh) (trunk failure)
test/test_trainer.py::TestRecorder::test_recorder
Unit-tests on Windows / unittests-cpu / windows-job (gh) (trunk failure)
##[error]Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: a6748fa882a41fdd50795b46b261e6e214af2c0e Pull Request resolved: #2592

matteobettini

Got it.

Love that there is a way to get the unbatched specs easily now. This is super helpful.

I guess the prefix single_ has already been decided but I will give my 2 cents on this anyway as I think there are some important points.

I think single_ is a really confusing name. In gymnasium I guess it can work as the library is not multi-agent/multi-task. but in torchrl it sounds super confusing to me:

it could be meaning single agent
or single spec (as opposed to full spec). this is why single_full_spec sounds oximoric to me
it does not correlate with the concept of a batch_size

To me the prefix unbatched_ (or variations of it) drives the concept home better without any of the confusions above

matteobettini · 2024-11-20T12:11:25Z

sota-implementations/multiagent/iql.py

@@ -91,7 +91,7 @@ def train(cfg: "DictConfig"):  # noqa: F821
            ("agents", "action_value"),
            ("agents", "chosen_action_value"),
        ],
-        spec=env.unbatched_action_spec,
+        spec=env.single_action_spec,


Suggested change

spec=env.single_action_spec,

spec=env.single_full_action_spec,

Wherever unbatched_action_spec is used, it should be changed to single_full_action_spec

This is valid for all the places here. As the components that were receiving unbatched_ are expecting a composite

tutorials/sphinx-tutorials/multiagent_competitive_ddpg.py

tutorials/sphinx-tutorials/multiagent_ppo.py

vmoens · 2024-11-20T12:27:28Z

We can name it unbatched but it will clash with VMAS unbatched (which are somewhat different).
It's a nightly feature IIRC so we can do it

matteobettini · 2024-11-20T12:31:44Z

We can name it unbatched but it will clash with VMAS unbatched (which are somewhat different). It's a nightly feature IIRC so we can do it

Yeah, the vmas ones were there for the absence of this feature. Now that this exists the vmas ones could be removed.

Your are right that the vmas name implied full so there would be a clash there.

We could think about how to best deal with this but i think torchrl should not be prevented to use the better name cause of this

matteobettini · 2024-11-20T12:34:22Z

so the difference between full and normal is just present when there is only one agent group.

What we could do is that for a period of time we could warn vmas users that request unbatched_spec in envs with just one group that this is now returning the non-composite version and if they want the old composite one there is unbatched_full_spec

vmoens · 2024-11-20T13:07:50Z

That's bc breaking, we need to warn them the behaviour will change and they can make the warning disappear by using the full version

vmoens · 2024-11-20T13:35:35Z

Intermediate solution: single_action_spec becomes action_spec_unbatched
that way there is no conflict, and we can just raise a warning in VMAS to let people know about the new API. Wdyt?

matteobettini · 2024-11-20T13:42:45Z

Intermediate solution: single_action_spec becomes action_spec_unbatched that way there is no conflict, and we can just raise a warning in VMAS to let people know about the new API. Wdyt?

Genius! Love it

[ghstack-poisoned]

ghstack-source-id: 4168c6c8b6b5febd8db4fd43e71e46e9bfeb10cc Pull Request resolved: #2592

vmoens · 2024-11-20T14:04:07Z

Ok I also simplified the mock envs that had unbatched specs, LMK if that makes sense!

matteobettini · 2024-11-20T14:11:15Z

I think that makes sense. My comments from the first review still hold tho. I would just make this a refactoring and no logic change

so unbatched_spec in vmas should be translated to full_spec_unbatched

vmoens · 2024-11-20T14:34:17Z

I would just make this a refactoring and no logic change

There is not logic change, see my comment above

[ghstack-poisoned]

ghstack-source-id: f346c47cd2d87a9306059e3ca56affcc68a7ff9c Pull Request resolved: #2592

[ghstack-poisoned]

ghstack-source-id: c235e024bc208155e0c74d08c67a581b6a7cbc79 Pull Request resolved: #2592

vmoens · 2024-11-20T14:53:51Z

@matteobettini I'll wait until the tests pass but it should be good.
I did not make any deprecation in VMAS, not sure how you want to procede there.

matteobettini · 2024-11-20T14:55:38Z

Maybe we can put a warning to use the new feature?

[ghstack-poisoned]

ghstack-source-id: ec87794dabaf5023dac85cfc898a7c000e93331d Pull Request resolved: #2592

Update

c40a365

[ghstack-poisoned]

vmoens mentioned this pull request Nov 20, 2024

[BugFix] make buffers zero-dim in exploration modules #2591

Merged

vmoens added a commit that referenced this pull request Nov 20, 2024

[BugFix] Use single_action_spec whenever necessary

c5e83e6

ghstack-source-id: a6748fa882a41fdd50795b46b261e6e214af2c0e Pull Request resolved: #2592

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 20, 2024

vmoens added the bug Something isn't working label Nov 20, 2024

matteobettini reviewed Nov 20, 2024

View reviewed changes

Update

b59d5de

[ghstack-poisoned]

vmoens added a commit that referenced this pull request Nov 20, 2024

[BugFix] action_spec_unbatched whenever necessary

6dae0cf

ghstack-source-id: 4168c6c8b6b5febd8db4fd43e71e46e9bfeb10cc Pull Request resolved: #2592

vmoens changed the title ~~[BugFix] Use single_action_spec whenever necessary~~ [BugFix] action_spec_unbatched whenever necessary Nov 20, 2024

Update

f9c4e00

[ghstack-poisoned]

vmoens added a commit that referenced this pull request Nov 20, 2024

[BugFix] action_spec_unbatched whenever necessary

acd00a1

ghstack-source-id: f346c47cd2d87a9306059e3ca56affcc68a7ff9c Pull Request resolved: #2592

Update

17a6f57

[ghstack-poisoned]

vmoens added a commit that referenced this pull request Nov 20, 2024

[BugFix] action_spec_unbatched whenever necessary

c2e27b1

ghstack-source-id: c235e024bc208155e0c74d08c67a581b6a7cbc79 Pull Request resolved: #2592

vmoens added the Environments Adds or modifies an environment wrapper label Nov 20, 2024

vmoens mentioned this pull request Nov 20, 2024

[Refactor] Use <spec>_unbatched in VMAS #2593

Merged

vmoens added 3 commits November 20, 2024 15:40

Update

23279b4

[ghstack-poisoned]

Update

73f7a46

[ghstack-poisoned]

Update

98a3b39

[ghstack-poisoned]

vmoens merged commit 98a3b39 into gh/vmoens/44/base Nov 20, 2024
55 of 73 checks passed

vmoens added a commit that referenced this pull request Nov 20, 2024

[BugFix] action_spec_unbatched whenever necessary

d30599e

ghstack-source-id: ec87794dabaf5023dac85cfc898a7c000e93331d Pull Request resolved: #2592

vmoens deleted the gh/vmoens/44/head branch November 20, 2024 21:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] action_spec_unbatched whenever necessary #2592

[BugFix] action_spec_unbatched whenever necessary #2592

vmoens commented Nov 20, 2024 •

edited

Loading

pytorch-bot bot commented Nov 20, 2024 •

edited

Loading

matteobettini left a comment

matteobettini Nov 20, 2024

vmoens commented Nov 20, 2024

matteobettini commented Nov 20, 2024

matteobettini commented Nov 20, 2024 •

edited

Loading

vmoens commented Nov 20, 2024

vmoens commented Nov 20, 2024

matteobettini commented Nov 20, 2024

vmoens commented Nov 20, 2024

matteobettini commented Nov 20, 2024

vmoens commented Nov 20, 2024

vmoens commented Nov 20, 2024

matteobettini commented Nov 20, 2024

	spec=env.single_action_spec,
	spec=env.single_full_action_spec,

[BugFix] action_spec_unbatched whenever necessary #2592

[BugFix] action_spec_unbatched whenever necessary #2592

Conversation

vmoens commented Nov 20, 2024 • edited Loading

pytorch-bot bot commented Nov 20, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2592

❗ 1 Active SEVs

❌ 3 New Failures, 1 Cancelled Job, 15 Unrelated Failures

matteobettini left a comment

Choose a reason for hiding this comment

matteobettini Nov 20, 2024

Choose a reason for hiding this comment

vmoens commented Nov 20, 2024

matteobettini commented Nov 20, 2024

matteobettini commented Nov 20, 2024 • edited Loading

vmoens commented Nov 20, 2024

vmoens commented Nov 20, 2024

matteobettini commented Nov 20, 2024

vmoens commented Nov 20, 2024

matteobettini commented Nov 20, 2024

vmoens commented Nov 20, 2024

vmoens commented Nov 20, 2024

matteobettini commented Nov 20, 2024

vmoens commented Nov 20, 2024 •

edited

Loading

pytorch-bot bot commented Nov 20, 2024 •

edited

Loading

matteobettini commented Nov 20, 2024 •

edited

Loading