Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EncodedBodySize overflow #1407

Open
kruskall opened this issue Aug 3, 2023 · 2 comments
Open

EncodedBodySize overflow #1407

kruskall opened this issue Aug 3, 2023 · 2 comments
Labels

Comments

@kruskall
Copy link
Member

kruskall commented Aug 3, 2023

According to the specification, EncodedBodySize must be a number.

As part of the ops KPI review we've been investigating the following errors in ecs logs:

decode error: data read error: v2.spanRoot.Span: v2.span.Context: v2.spanContext.HTTP: v2.spanContextHTTP.Response: v2.spanContextHTTPResponse.EncodedBodySize: readUint64: overflow, error found in #10 byte of ...|":184467440737095520|..., bigger context ...|:{"transfer_size":299,"encoded_body_size":18446744073709552000,"decoded_body_size":0}},"destination"|..
decode error: data read error: v2.transactionRoot.Transaction: v2.transaction.Context: v2.context.Response: v2.contextResponse.EncodedBodySize: readUint64: overflow, error found in #10 byte of ...|":184467440737095520|..., bigger context ...|:{"transfer_size":299,"encoded_body_size":18446744073709552000,"decoded_body_size":0}},"marks":{"age|...

It seems 18446744073709552000 is bigger than the maximum int64, thus not a valid value for the encoded body size, making the ingestion fail in apm-server.

@devcorpio
Copy link
Contributor

Hi @kruskall,

Thanks for raising this!

I have a few questions:

  • Do we know the number of occurrences of this error?
  • Is this related to any SDH?

I'm asking those questions to know the level of priority we should assign to it

Cheers,
Alberto

@kruskall
Copy link
Member Author

kruskall commented Aug 3, 2023

Thanks for the quick response!

Do we know the number of occurrences of this error?

~300k hit last week, happening on several clusters

Is this related to any SDH?

Not that I'm aware

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants