Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Whitelist generates empty part files sometimes #52

Open
jterry64 opened this issue Dec 31, 2019 · 1 comment
Open

Whitelist generates empty part files sometimes #52

jterry64 opened this issue Dec 31, 2019 · 1 comment

Comments

@jterry64
Copy link
Member

Whitelist is occasionally generating empty part files. When trying to upload to GFW API, this causes an empty file error that might put the dataset into a failed state. Easy to workaround, but seems better to just stop it from happening in the first place.

@jterry64
Copy link
Member Author

Actually, seems like this can just happen occasionally when repartitioning with Spark. There are ways to get rid of empty partitions, but seems like it can happen randomly, so I think I'll just make the datapump more resilient to this happening.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant