Skip to content

Performance degradation when repeatedly writing the same data. #1476

Answered by wjones127
llggdd asked this question in Q&A
Discussion options

You must be logged in to vote

Instead of using a path every time, after the first write_deltalake, get a DeltaTable object and pass that in instead. That way it only has to load the incremental updates.

there is also a function to create a checkpoint which you can run every ten or so commits. This function is a method on DeltaTable called create_checkpoint..

Replies: 1 comment 7 replies

Comment options

You must be logged in to vote
7 replies
@llggdd
Comment options

@wjones127
Comment options

Answer selected by llggdd
@llggdd
Comment options

@wjones127
Comment options

@llggdd
Comment options

@wjones127
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants