Is there a possibility to create a checkpoint file at custom location? #2899
-
Major Question: Explanation Ideally, we would save the checkpoint files at the AWS s3 bucket of individual party. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 7 replies
-
The protocol states this:
https://github.com/delta-io/delta/blob/master/PROTOCOL.md#checkpoints |
Beta Was this translation helpful? Give feedback.
Checkpoints have a different goal then you are expecting it to be used for. They are used to improve the speed of determining the state of the table by having the add/remove/metadata/protocol actions stored in a more efficient manner.
None of the readers should call checkpoint, only a writer should. If you want to know what you reader has already read or not, just maintain the read_last_version elsewhere.
An approach I have done in the past is where you add a column "version" into the target table, so all records you write into that target you know came from a certain version in the source. By reading the max version of the target you can derive where to start reading the source from. But…