Skip to content

Commit

Permalink
Merge branch 'main' into sync_with_torchtitan
Browse files Browse the repository at this point in the history
  • Loading branch information
philippguevorguian authored Aug 20, 2024
2 parents 81c555f + dad421f commit 57c3400
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 0 deletions.
Binary file added assets/images/TorchTitan_logo_main.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
2 changes: 2 additions & 0 deletions torchtitan/datasets/hf_datasets.py
Original file line number Diff line number Diff line change
Expand Up @@ -13,6 +13,7 @@

from torchdata.stateful_dataloader import StatefulDataLoader


from torchtitan.datasets.tokenizer import Tokenizer
from torchtitan.logging import logger

Expand Down Expand Up @@ -141,6 +142,7 @@ def _get_data_iter(self):
if isinstance(self._data, Dataset) and self._sample_idx == len(self._data):
return iter([])


return iter(self._data.skip(self._sample_idx))

def load_state_dict(self, state_dict):
Expand Down

0 comments on commit 57c3400

Please sign in to comment.