Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Algorithm] RLHF end-to-end, clean #1597

Merged
merged 52 commits into from
Oct 5, 2023
Merged

[Algorithm] RLHF end-to-end, clean #1597

merged 52 commits into from
Oct 5, 2023

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Oct 3, 2023

No description provided.

apbard and others added 30 commits June 27, 2023 14:43
Co-authored-by: Alessandro Pietro Bardelli <[email protected]>
# Conflicts:
#	test/test_rlhf.py
#	torchrl/data/rlhf/utils.py
#	torchrl/modules/tensordict_module/actors.py
#	torchrl/modules/tensordict_module/common.py
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 3, 2023
@vmoens vmoens added the new algo New algorithm request or PR label Oct 4, 2023
@vmoens vmoens merged commit fe19cf5 into main Oct 5, 2023
48 of 55 checks passed
@vmoens vmoens deleted the rlhf-example-refactor branch October 5, 2023 15:48
vmoens added a commit to hyerra/rl that referenced this pull request Oct 10, 2023
Co-authored-by: Alessandro Pietro Bardelli <[email protected]>
Co-authored-by: Tom Begley <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. new algo New algorithm request or PR
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants