feat: flag to enable gauge replacement #77

mcfedr · 2023-09-01T15:15:00Z

Enable different modes for feature flags

fixes #65

mplachter · 2023-09-01T15:27:45Z

@mcfedr Thank you for the MR.

We have had a ton of feedback on the gauges.
We need to do something different... 🤔

Currently (to your point), we add the gauges up (which does not make sense to do)

This approach of just having the new gauge as the value, I think, can be misleading, as well as having a randomly high or low value as it's not an aggregate from all the containers\functions\pods sending metrics into the gateway. I think this is better than currently summing the gauge, as it makes no sense to do so...

My ideal situation would be to make a running average or median of the metric gauge over a given interval. I do think this will take some time to implement I do have a draft MR #57 to add this but to keep it performant I had to duplicate the memory footprint which isn't ideal (we may have to rewrite the structs and move away from slices of metrics)

We could change this behavior based on a CLI flag so we can still implement your desired change but make it so it doesn't change the current behavior unless a CLI flag is passed.

@djeebus thoughts?

djeebus · 2023-09-01T15:30:11Z

Yeah, I agree w/ all your points. A flag, something like --gauge-behavior=sum|replace, would probably be a good start, which leaves us an opening to do something like median, max, etc.

mcfedr · 2023-09-01T15:30:20Z

Yes, in my use case the value is an calculated value from some data, so no matter which process (in my case lambda functions) sends it, its the latest value - so i certainly wouldnt want any manipulation of the value

but i also thought about a cli flag, might be helpful to have different options

SpangleLabs · 2023-09-01T15:32:17Z

People keep suggesting this: #65 #71 and it seems like if you want the gauge to replace, then you should push it to a prometheus push gateway, not an aggregation gateway.
It feels like an aggregation gateway should aggregate.

I think there are cases where aggregating the gauges is exactly what's wanted, especially in summaries for example.

It seems weird to add a CLI flag that turns the aggregation gateway into a push gateway though. I can understand wanting some metrics to be aggregated and some to be latest, in which case, push some to an aggregation gateway, and some to a push gateway?

The idea of the aggregation gateway showing the average gauge value of the last N pushes or M minutes also seems odd? It would need to be synced to the fetching or the pushing, and liable to go out of sync with either. If you want a running total, that seems like something an aggregated summary could do, or a prometheus query over a latest value gauge?

mcfedr · 2023-09-01T15:42:51Z

@SpangleLabs Gauge are different from Counters though, thats clearly why this is happening

I could use both push gateway and aggregation gateway - but it does mean my app has to make two http requests to send all the metrics, and thats a big disadvantage

SpangleLabs · 2023-09-01T16:14:45Z

Yeah, I'm just quite hesitant on the idea that an aggregation gateway should also do the job of a push gateway?

I guess if you need to save on web requests then it would be helpful to have a gateway that can serve the purposes of both.. But even then that would seem best configured at a metric or job level, rather than a gateway level? Though I've no idea what syntax could enable that. Tying the operation of your application and monitoring, to the configuration of your aggregation gateway seems a weird choice, and doesn't seem to properly separate concerns.

I feel it boils down to an idea of "do one thing well"

(And certainly, making a breaking change to the behaviour seems unwise)

In fairness though, I've been digging through to find my concrete use-cases for aggregating gauges, and most of them are in places where we were rolling our own summaries, as the aggregation gateway did not support Summary metrics at the time. I might be suffering some anchoring based on that

mcfedr · 2023-09-01T16:27:09Z

So what about something like this? I'm not 100% sure its the best way to pass the flag though, but its functional like this.

pablote · 2023-09-06T23:06:31Z

It'd be great if this or something very similar got merged. Having two push gateways, one that aggregates and one that doesn't does not make sense.

SpangleLabs · 2023-09-07T11:31:38Z

Honestly, at this point, I'm growing kind of unsure when one would want a push gateway at all, alongside or instead of an aggregation gateway.

After much thought (and overthought), I think this solution looks great. Having a flag to allow backwards compatibility, but moving towards the better usage of the thing.

(Even the stuff I said before about summaries being simulated with a counter and a gauge is entirely wrong. A summary is 2 counters)

mcfedr · 2023-09-12T15:53:32Z

@mplachter maybe I can bump this for a review?

mplachter · 2023-09-12T16:06:32Z

@mcfedr This looks good if we're able to add a unit_test for this in aggregate_test.go we can get this merged in, If we add this functionality, I want to make sure we do not undo it without the proper knowledge + documentation during a future release :)

hxnir · 2024-01-01T15:09:00Z

Hey, it looks really good and would really help one of my usecases.
Any updates on the progress? If the missing unit test is the problem I am more than happy to add it myself.

Signed-off-by: Fred Cox <[email protected]>

mcfedr · 2024-01-04T10:15:45Z

@hxnir thanks for bumping this, it kept slipping down my list of things to do. I've added a couple of tests to check the new behavoir, @mplachter hopefully good for a merge now.

also with a rebase on main

JDLK7 · 2024-03-21T13:08:26Z

Hey, do you plan on merging this anytime soon?
Thanks!

xstephen95x · 2024-05-15T22:34:23Z

Also curious if this will be picked up.

KepptnKool · 2024-07-02T09:27:38Z

@djeebus @mplachter What are the chances of this PR getting merged? Do you need any support? I guess there are quite some users who would like this feature.

mplachter requested review from djeebus and mplachter September 1, 2023 15:28

mcfedr force-pushed the gauge-replace branch 2 times, most recently from a759536 to 1141a76 Compare September 1, 2023 16:17

mcfedr changed the title ~~fix: replace gauge values with the latest~~ feat: flag to enable gauge replacement Sep 1, 2023

mcfedr force-pushed the gauge-replace branch 2 times, most recently from e9abc2a to 3fcc9c1 Compare September 1, 2023 16:46

feat: flag to enable gauge replacement

9626541

Signed-off-by: Fred Cox <[email protected]>

mcfedr force-pushed the gauge-replace branch from 3fcc9c1 to 9626541 Compare January 4, 2024 10:13

tomj74 mentioned this pull request Oct 22, 2024

[Feature] Support replacing rather than summing gauges #65

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: flag to enable gauge replacement #77

feat: flag to enable gauge replacement #77

mcfedr commented Sep 1, 2023 •

edited

Loading

mplachter commented Sep 1, 2023 •

edited

Loading

djeebus commented Sep 1, 2023

mcfedr commented Sep 1, 2023

SpangleLabs commented Sep 1, 2023

mcfedr commented Sep 1, 2023

SpangleLabs commented Sep 1, 2023

mcfedr commented Sep 1, 2023

pablote commented Sep 6, 2023

SpangleLabs commented Sep 7, 2023

mcfedr commented Sep 12, 2023

mplachter commented Sep 12, 2023 •

edited

Loading

hxnir commented Jan 1, 2024

mcfedr commented Jan 4, 2024

JDLK7 commented Mar 21, 2024

xstephen95x commented May 15, 2024

KepptnKool commented Jul 2, 2024

feat: flag to enable gauge replacement #77

Are you sure you want to change the base?

feat: flag to enable gauge replacement #77

Conversation

mcfedr commented Sep 1, 2023 • edited Loading

mplachter commented Sep 1, 2023 • edited Loading

djeebus commented Sep 1, 2023

mcfedr commented Sep 1, 2023

SpangleLabs commented Sep 1, 2023

mcfedr commented Sep 1, 2023

SpangleLabs commented Sep 1, 2023

mcfedr commented Sep 1, 2023

pablote commented Sep 6, 2023

SpangleLabs commented Sep 7, 2023

mcfedr commented Sep 12, 2023

mplachter commented Sep 12, 2023 • edited Loading

hxnir commented Jan 1, 2024

mcfedr commented Jan 4, 2024

JDLK7 commented Mar 21, 2024

xstephen95x commented May 15, 2024

KepptnKool commented Jul 2, 2024

mcfedr commented Sep 1, 2023 •

edited

Loading

mplachter commented Sep 1, 2023 •

edited

Loading

mplachter commented Sep 12, 2023 •

edited

Loading