[QUESTION] Calculations regarding calculate_per_token_loss parameter #1100
Unanswered
clarence-lee-sheng
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
In line 231-233 in megatron/core/pipeline_parallel/schedules.py (megatron/core/pipeline_parallel/schedules.py), I have two questions:
Beta Was this translation helpful? Give feedback.
All reactions