Batch inserts #68

macobo · 2014-09-18T23:49:52Z

This pull request adds support for batching sequential INSERTs when doing tailing, speeding up tailing under certain conditions while not being slower than the current state. See also issue #47.

r? @nelhage
cc @snoble

Basic strategy is to batch consecutive inserts together per namespace. Batch gets saved whenever:

An update or delete is done to the same namespace as the insert
After streaming (up to) 1000 updates from oplog, time from last batch update is larger than 5 seconds.
More than a threshold of updates have happened in this namespace.
Program is exiting/streaming stops.

Some handwavy measurements for tailing 20000 oplog entries:

Speed is roughly the same on current master and this when alternating between doing inserts and updates. (~350s on local machine)
10 inserts per update: ~4.6x faster (76s on local machine)
20 inserts per update: ~7.4x faster (47s)
50 inserts per update: ~11x faster (32s)
1000 inserts per update: ~31x faster (~11.1s, though probably running into measurement overhead here)

Notes on potential future work (That I may or may not be working on soonish):

The next "low hanging" performance fruit to work on after this would be to optimize updates, though this wouldn't have this large of an effect.

Some ideas on how can be done: $set entries in oplog can directly be translated into postgres queries only updating those columns mentioned. Updates without $set can replace the current row in postgres with the data in oplog entry. Tricky part here is figuring out if/how this applies to tokumx even after mongoriver does oplog entry translation (if they support any other $ operations) and unset.

Another performance improvement would be to have multiple tailers in either separate threads or processes, separated by namespace. This would however keeping multiple tailing states in database (one per namespace) and I'm not quite sure what the performance implications are for mongo for querying the same oplog (with filters?) from multiple processes.

Basic strategy is to batch consecutive inserts together per namespace. Batch gets saved whenever: - An update or delete is done to the same namespace as the insert - After streaming (up to) 1000 updates from oplog, time from last batch update is larger than 5 seconds. - More than a threshold of updates have happened in this namespace. - Program is exiting/streaming stops.

nelhage · 2014-09-23T16:40:08Z

lib/mosql/streamer.rb

@@ -170,14 +174,31 @@ def optail
      if tail_from.is_a? Time
        tail_from = tailer.most_recent_position(tail_from)
      end
+
+      last_batch_insert = Time.now
      tailer.tail(:from => tail_from)
      until @done
        tailer.stream(1000) do |op|


I haven't done the digging to confirm this, but I'm pretty sure the contract on Tailer by default is that once your block returns, the op is considered to have been handled, and the timestamp may be persisted to postgres. However, with batched inserts, we haven't actually processed the op until we've flushed the inserts, so this could result in data loss if we save a timestamp before flushing the inserts.

mongoriver does have a batch mode, which allows you to explicitly mark batches and tell mongoriver when you're done with a batch. Unfortunately I've forgotten the details, so you'll probably have to source-dive :(

Good point!
My original assumption was that if the process is told to stop, it would flush due to the signal handler. However in hindsight you're right - if something catastrophic happens the data would not get flushed resulting in data loss.

Yeah, we can't assume we'll get to shutdown gracefully -- we need to handle the case where the machine dies, the program gets killed via SIGKILL, whatever.

Since the author is inactive for long time, I create a branch based on this where I added the @nelhage suggestions: #137
I hope someone can see this.

nelhage · 2014-09-23T16:44:13Z

Modulo the concerns around making sure we don't update timestamps too early, I think this lgtm.

barretod · 2016-03-24T07:34:29Z

Did you figure out how to address the concerns around timestamps? We really need this optimization in our environment.

nelhage reviewed Sep 23, 2014
View reviewed changes

pmendozav mentioned this pull request Apr 7, 2020

Karl-batch-inserts-fixes #137

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch inserts #68

Batch inserts #68

macobo commented Sep 18, 2014

nelhage Sep 23, 2014

macobo Sep 23, 2014

nelhage Sep 23, 2014

pmendozav Apr 7, 2020

nelhage commented Sep 23, 2014

barretod commented Mar 24, 2016

Batch inserts #68

Are you sure you want to change the base?

Batch inserts #68

Conversation

macobo commented Sep 18, 2014

nelhage Sep 23, 2014

Choose a reason for hiding this comment

macobo Sep 23, 2014

Choose a reason for hiding this comment

nelhage Sep 23, 2014

Choose a reason for hiding this comment

pmendozav Apr 7, 2020

Choose a reason for hiding this comment

nelhage commented Sep 23, 2014

barretod commented Mar 24, 2016