Replies: 2 comments
-
I did some preliminary experiments with the same -
|
Beta Was this translation helpful? Give feedback.
0 replies
-
Marking as stale. No activity in 60 days. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Incorrect Dataset Shuffling
gpt_dataset.py
, the dataset is being globally shuffled across epochs rather than within epoch shuffling which is the standard.Question
Has this been done on purpose? Is there any reason to prefer global shuffling over per-epoch shuffling?
Solution
Shuffle data per epoch instead of shuffling the full data. Implementation is straightforward. However, we need to fix both document and shuffle index to fix the overall problem.
Beta Was this translation helpful? Give feedback.
All reactions