Skip to content

Commit

Permalink
Add sampling fields to dataset descriptor (#3782)
Browse files Browse the repository at this point in the history
Summary:
Pull Request resolved: #3782

Fields sampling_column and sampling_rate are added to dataset descriptor for supporting sampling in dataset readers.

Reviewed By: satymish

Differential Revision: D61569067

fbshipit-source-id: e5db9957538b033bbef4b7662154411b9044d1f8
  • Loading branch information
kuarora authored and facebook-github-bot committed Aug 22, 2024
1 parent a43afd6 commit 5c87f13
Showing 1 changed file with 5 additions and 0 deletions.
5 changes: 5 additions & 0 deletions benchs/bench_fw/descriptors.py
Original file line number Diff line number Diff line change
Expand Up @@ -80,6 +80,11 @@ class DatasetDescriptor:

embedding_column: Optional[str] = None

sampling_rate: Optional[float] = None

# sampling column for xdb
sampling_column: Optional[str] = None

def __hash__(self):
return hash(self.get_filename())

Expand Down

0 comments on commit 5c87f13

Please sign in to comment.