[logs] Aggregates long lines when tailing in a k8s env - part #1 by prognant · Pull Request #6265 · DataDog/datadog-agent

prognant · 2020-08-20T09:30:51Z

What does this PR do?

It isolates the parsing logic in the log pipeline (when tailing from file):
It changes

[...] -> decoder -> line_handler -> [...]

by

[...] -> decoder -> line_parser -> line_handler -> [...]

Motivation

Make room for additional parsing+buffering logic to support split lines (usecase: log tailing in a k8s environment with explicit tailing from file, lines are split in 16k chunks) and keeps other feature intact.

Additional Notes

It will add a slight overhead.
See subsequent PR #6266 for the release note.

Describe your test plan

Existing UT adjusted.
New UT.
IRL tests.

ogaca-dd

LGTM. I am not familiar with this code and so requesting another review might be a good idea. In particular, it is hard to track the logic of rawDataLen.

Note: Test failed.

prognant · 2020-09-02T12:37:49Z

LGTM. I am not familiar with this code and so requesting another review might be a good idea. In particular, it is hard to track the logic of rawDataLen.

Note: Test failed.

The logic behind rawDataLen is to keep track of the current position while tailing to be able to restart the agent and start tailing at the exact place where the agent stopped tailing. I rebased and double checked the PR, tests are now 🟢 .

truthbk

Looks good to me! Just a couple of questions, and maybe a before and after benchmark after the changes to the pipeline would be nice to get a sense of the impact due to new allocations.

truthbk · 2020-09-08T18:00:30Z

+	}
+
+	if parser.SupportsPartialLine() {
+		lineParser = NewSingleLineParser(parser, lineHandler)


Am I missing something? These two lines in the if and else blocks are identical?

They are, the subsequent PR add an alternate path cf. #6266

truthbk · 2020-09-08T18:47:56Z

+	if err != nil {
+		log.Debug(err)
+	}
+	p.lineHandler.Handle(NewMessage(content, status, input.rawDataLen, timestamp))


I'm not entirely sure, but I feel like the new pipeline (which is way cleaner by the way) might be a little heavier on the allocations side. Any chance we could benchmark or profile the two approaches - just to get a sense of potential performance/memory impact?

I totally agree that it is likely to be a bit (I will try to find out how much and share numbers) slower. Initially I thought about having some kind of fast path dropping unnecessary step (like the no-op parser) on certain condition. However in the current code base it would have either end up in something a bit hacky, or IMHO a too big change for a single PR. After thinking about the second option I tried to write the current PR a step to a more generic pipeline that would ultimately look like :
[decoder]-(*Message)-> step #1 -(*Message)-> [...] -(*Message)->step #n -(*Message)->[output/forwarder]
So only mandatory block for a given source would then be instantiated, with a unique type circulating between blocks and then we won't get useless allocation for blocks that wouldn't ever be instantiated, and in that extend we should be able, for base cases (like flat file tailing, noop parser, no multiline log), to keep the same performance as we have today.
One "new" block we could implement once that's done ; there have been (surprisingly) a high number of request to support utf-16 encoded log, I think if we rework the pipeline, we could then have an "encoding" block usable for all log sources easily, only enabled based on a to-be-defined config knob value.
This is really opened for discussion, I was about to start an RFC on the topic with the idea of streamlining the logs processing pipeline to describe & discuss what's written above.

Interested on this RFC!

prognant · 2020-09-15T09:22:01Z

Small benchmark (ec2 t2.medium single file on a ramdisk, set offset to 0 in registry to tail it from the beginning) metrics emitted with this PR are on the left, metrics emitted with master branch are on the right:

Overall the processed log per second seems to be stable, the total memory footprint is stable. There is logically a bit more allocations (around 5-7% from what I can tell).

prognant added do-not-merge/WIP [deprecated] team/agent-core Deprecated. Use metrics-logs / shared-components labels instead.. labels Aug 20, 2020

prognant added this to the 7.23.0 milestone Aug 20, 2020

prognant requested review from a team as code owners August 20, 2020 09:30

prognant mentioned this pull request Aug 20, 2020

[logs] Aggregates long lines when tailing in a k8s env - part #2 #6266

Merged

prognant added the changelog/no-changelog No changelog entry needed label Aug 20, 2020

prognant changed the title ~~[WIP][logs] Aggregates long lines k8s file tailing - part #1~~ [WIP][logs] Aggregates long lines when tailing in a k8s env - part #1 Aug 21, 2020

prognant mentioned this pull request Aug 21, 2020

[WIP][logs] Docker tailer: split line support #6268

Closed

prognant force-pushed the prognant/aggregates-long-lines-k8s-file-tailing branch from 5164e89 to 262d561 Compare August 25, 2020 15:28

prognant changed the title ~~[WIP][logs] Aggregates long lines when tailing in a k8s env - part #1~~ [logs] Aggregates long lines when tailing in a k8s env - part #1 Aug 25, 2020

prognant mentioned this pull request Aug 28, 2020

[docker log] Adjust matcher for corner cases + UT #6296

Merged

prognant removed the do-not-merge/WIP label Aug 28, 2020

prognant force-pushed the prognant/aggregates-long-lines-k8s-file-tailing branch from 262d561 to 10b9ae7 Compare August 28, 2020 15:34

ogaca-dd approved these changes Sep 1, 2020

View reviewed changes

Comment thread pkg/logs/decoder/decoder.go

prognant force-pushed the prognant/aggregates-long-lines-k8s-file-tailing branch from 10b9ae7 to 19d6b97 Compare September 2, 2020 09:29

truthbk reviewed Sep 8, 2020

View reviewed changes

prognant requested a review from truthbk September 15, 2020 10:16

blemale approved these changes Sep 15, 2020

View reviewed changes

Comment thread pkg/logs/decoder/decoder.go Outdated

prognant modified the milestones: 7.23.0, 7.24.0 Sep 18, 2020

truthbk modified the milestones: 7.24.0, 7.23.0 Sep 18, 2020

truthbk approved these changes Sep 18, 2020

View reviewed changes

truthbk force-pushed the prognant/aggregates-long-lines-k8s-file-tailing branch from 6ff1b7b to 08291d4 Compare September 18, 2020 19:14

prognant requested a review from a team as a code owner September 18, 2020 19:25

truthbk force-pushed the prognant/aggregates-long-lines-k8s-file-tailing branch from 5fcedcf to e41caf9 Compare September 18, 2020 19:57

Add partial flags when tailing k8s container logs

eb5e63c

prognant added 10 commits September 18, 2020 16:14

parser interface compliance

9604da0

Adjust UT

92e810f

[logs] Extract parsing from lineHandler logic

b592cbf

[logs] Adjust UT, minor fixes

21b85c7

[logs] UT covering previous changes

cabaf26

[logs] fix async test

482ca9d

Docker parser handles partial itself

0c2c06d

Rebase

93b0c1e

Address reviews

ea4f8d2

Address reviews

f637f06

truthbk force-pushed the prognant/aggregates-long-lines-k8s-file-tailing branch from e41caf9 to 9d94385 Compare September 18, 2020 20:22

[logs] rebase collateral fixes

93b8256

truthbk force-pushed the prognant/aggregates-long-lines-k8s-file-tailing branch from 9d94385 to 93b8256 Compare September 18, 2020 20:38

truthbk merged commit ea7e888 into master Sep 18, 2020

truthbk deleted the prognant/aggregates-long-lines-k8s-file-tailing branch September 18, 2020 22:15

prognant mentioned this pull request Sep 28, 2020

[logs] missing method on parser.DecodingParser #6464

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[logs] Aggregates long lines when tailing in a k8s env - part #1#6265

[logs] Aggregates long lines when tailing in a k8s env - part #1#6265
truthbk merged 12 commits intomasterfrom
prognant/aggregates-long-lines-k8s-file-tailing

prognant commented Aug 20, 2020 •

edited

Loading

Uh oh!

ogaca-dd left a comment

Uh oh!

Uh oh!

prognant commented Sep 2, 2020 •

edited

Loading

Uh oh!

truthbk left a comment

Uh oh!

truthbk Sep 8, 2020

Uh oh!

prognant Sep 8, 2020

Uh oh!

Uh oh!

Uh oh!

truthbk Sep 8, 2020

Uh oh!

prognant Sep 9, 2020 •

edited

Loading

Uh oh!

gaetan-deputier Sep 15, 2020

Uh oh!

Uh oh!

prognant commented Sep 15, 2020 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

prognant commented Aug 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Motivation

Additional Notes

Describe your test plan

Uh oh!

ogaca-dd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

prognant commented Sep 2, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

truthbk left a comment

Choose a reason for hiding this comment

Uh oh!

truthbk Sep 8, 2020

Choose a reason for hiding this comment

Uh oh!

prognant Sep 8, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

truthbk Sep 8, 2020

Choose a reason for hiding this comment

Uh oh!

prognant Sep 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gaetan-deputier Sep 15, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

prognant commented Sep 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

prognant commented Aug 20, 2020 •

edited

Loading

prognant commented Sep 2, 2020 •

edited

Loading

prognant Sep 9, 2020 •

edited

Loading

prognant commented Sep 15, 2020 •

edited

Loading