You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Reduce the frequency of reconcile to avoid stressing out the operator and long queues when there are lots of CR instances.
Implement smart reconcile to reduce the frequency in certain scenarios.
For example, if the error status conditions/messages haven't changed since the last 3 2 reconciles, then increase the reconcile interval (i.e. 5 seconds to 10 seconds). If error status didn't change in the last 4 reconciles, then increase the interval from 10 seconds to 20 seconds and so on up to 4 minutes perhaps.
Adjust the reconcile interval for successful status in a similar way, but perhaps cap the max at 2 minutes.
Parameterize the base reconcile interval (15 seconds by default) and the increase percentage (100% - meaning 15 seconds to 30 seconds), so they can be set in the Operator's ConfigMap by users if needed.
The text was updated successfully, but these errors were encountered:
leochr
changed the title
Reduce the frequency of reconcile and parameterize the reconcile intervals
Reduce the frequency of reconcile and parameterize the reconcile settings
Oct 11, 2024
Meeting summary - Friday, November 1, 2024 (Leo, Melissa):
Smart reconcile improved the overall reconcile and ready time (90+ instances became ready in 3 minutes; new instance (101st) became ready in 31 seconds). Controller was run locally.
Next steps:
Investigate if there is anything to do for the 'Warning' status condition
Handle invalid user config for smart reconcile in the Operator configmap: Output warning and use default values
It took 10m to fully reconcile 100 instances, each in its own namespace.
It took 2m17s minutes to fully reconcile 1 new instance, with 100 working instances.
It took 2m29s minutes to fully reconcile 1 wrongly configured instance after it was corrected, with 101 working instances and 9 non-working instances running.
Reduce the frequency of reconcile to avoid stressing out the operator and long queues when there are lots of CR instances.
32 reconciles, then increase the reconcile interval (i.e. 5 seconds to 10 seconds). If error status didn't change in the last 4 reconciles, then increase the interval from 10 seconds to 20 seconds and so on up to 4 minutes perhaps.The text was updated successfully, but these errors were encountered: