Tracez Backend Benchmarking by jajanet · Pull Request #19 · kmanghat/opentelemetry-cpp

jajanet · 2020-08-20T16:15:07Z

Code for benchmarking backend components that are used in Tracez. Friend classes are used to access private functions and/or modify parts of the benchmarked classes, reducing noise in the tests. Data aggregator periodically updates are stopped, and we do benchmarking with and without a different background thread to call GetAggregatedTracezData.

Processor results:

Load Average: 0.27, 0.21, 0.19
-------------------------------------------------------------------------------------------------------------------------
Benchmark                                                                               Time             CPU   Iterations
-------------------------------------------------------------------------------------------------------------------------
TracezProcessor/BM_MakeRunning/10                                                    1474 ns         1473 ns        94675
TracezProcessor/BM_MakeRunning/1000                                                 54392 ns        54367 ns         2578
TracezProcessor/BM_GetSpans/10                                                        473 ns          473 ns       296279
TracezProcessor/BM_GetSpans/1000                                                    46988 ns        46986 ns         2986
TracezProcessor/BM_MakeRunningMakeComplete/10/process_time/real_time                37332 ns        48653 ns         3641
TracezProcessor/BM_MakeRunningMakeComplete/1000/process_time/real_time             514363 ns       904488 ns          224
TracezProcessor/BM_MakeRunningGetSpans/10/process_time/real_time                    38095 ns        48823 ns         3756
TracezProcessor/BM_MakeRunningGetSpans/1000/process_time/real_time               32439828 ns     33694986 ns            4
TracezProcessor/BM_GetSpansMakeComplete/10/process_time/real_time                   47408 ns        68470 ns         3106
TracezProcessor/BM_GetSpansMakeComplete/1000/process_time/real_time                770039 ns      1041735 ns          186
TracezProcessor/BM_MakeRunningGetSpansMakeComplete/10/process_time/real_time        59006 ns        91096 ns         2444
TracezProcessor/BM_MakeRunningGetSpansMakeComplete/1000/process_time/real_time   71411371 ns     73876469 ns            2

Aggregator results:

Load Average: 0.40, 0.16, 0.11
-----------------------------------------------------------------------------------------------
Benchmark                                                     Time             CPU   Iterations
-----------------------------------------------------------------------------------------------
TracezAggregator/BM_SingleBucketSingleName/10              3669 ns         3674 ns        38418
TracezAggregator/BM_SingleBucketSingleName/1000          279355 ns       279482 ns          501
TracezAggregatorFetch/BM_SingleBucketSingleName/10         4107 ns         4088 ns        35268
TracezAggregatorFetch/BM_SingleBucketSingleName/1000     288939 ns       288882 ns          489
TracezAggregator/BM_SingleBucketManyNames/10               3597 ns         3610 ns        39089
TracezAggregator/BM_SingleBucketManyNames/1000           279545 ns       279641 ns          501
TracezAggregatorFetch/BM_SingleBucketManyNames/10          4119 ns         4099 ns        34889
TracezAggregatorFetch/BM_SingleBucketManyNames/1000      280687 ns       280694 ns          499
TracezAggregator/BM_ManyBucketsSingleName/10             826830 ns       827120 ns         1000
TracezAggregator/BM_ManyBucketsSingleName/1000         13750523 ns     13750447 ns           79
TracezAggregatorFetch/BM_ManyBucketsSingleName/10        943028 ns       942948 ns         1000
TracezAggregatorFetch/BM_ManyBucketsSingleName/1000    11751015 ns     11749654 ns           67
TracezAggregator/BM_ManyBucketsManyNames/10             1077521 ns      1077855 ns          872
TracezAggregator/BM_ManyBucketsManyNames/1000          14186373 ns     14185956 ns           53
TracezAggregatorFetch/BM_ManyBucketsManyNames/10        1090309 ns      1089959 ns         1000
TracezAggregatorFetch/BM_ManyBucketsManyNames/1000     14621460 ns     14131165 ns           52

Related Links:

liadavid

Overall looks good, added few comments.

…en-telemetry#293)

…y#288)

kmanghat · 2020-08-26T16:21:07Z

Nice job solving the periodic update problem using friend classes!

liadavid

Nice!

kmanghat

I think we should also test the performance for sparse vs dense spans, I think the amount of information in a span would affect the performance.

kmanghat · 2020-08-26T17:32:05Z

+
+/*
+ * Aggregator handing many spans with the same name, who end instantly. This
+ * checks the scenario where there's only one Tracez name and minimal sorting


Could you explain what you mean by sorting of latencies?

i reclarified in the docstring, and it's supposed to be explaining the aggregator performance may be affected by needing to have computer whether a span is in error, running, and latency (including within bands)

Why do you say that the aggregator performance might be affected by needing to compute the type of span? Running spans are received in a separate container altogether and distinguishing between error and latency span is done with a single if statement. Are you seeing a significant variance in performance when you test with the same number of running, completed and error spans? I am just curious and I am sure you understand these tests better so feel free to push this through.

thanks for pointing that out! yes there's that minimal effect, and there's also memory considerations which i forgot to mention. all 11 buckets filled for each span name means there's more overhead compared to only 1 latency bucket

Thanks for the clarification, I forgot the tests measured memory as well.

…#282)

… in Records For Prometheus (open-telemetry#298)

jajanet added 4 commits August 20, 2020 16:04

Add initial benchmark code

0ca6905

Add working tracez processor benchmark

7a59600

Add aggregator benchmarks, use processor

44c9c42

Move latency bound to separate var

9088fd6

jajanet changed the title ~~Tracez Processor Benchmarking~~ Tracez Backend Benchmarking Aug 22, 2020

jajanet added 9 commits August 22, 2020 19:46

Merge folder restructure

94cbff3

Use args in benchmarks

cfba657

Update threading

99a8bcf

Add run complete snap benchmarks

6fca653

Remove unneeded code

be60db8

Update code comments

e1f36c7

Pass tracer by ref, test getting aggregations too

00d31a1

Add more descriptive comments

0da246d

Remove finished spans

01ae6bf

jajanet changed the base branch from master to ext-folder-restructure August 24, 2020 15:14

jajanet added 5 commits August 24, 2020 15:30

Fix typo, edit comments

6083ce2

Update comment

5c2f295

Update comments

ad3585e

Grammar, clarifications

aaf2567

numSpans->num_spans

3d2a960

liadavid reviewed Aug 24, 2020

View reviewed changes

Brandon-Kimberly and others added 4 commits August 24, 2020 15:39

Add logic to refrain from exporting un-updated metric instruments (op…

4710d4b

…en-telemetry#293)

Add zPages usage instructions, remove unneeded details (open-telemetr…

3ab51c5

…y#288)

Merge branch 'ext-folder-restructure' into tracez-benchmark

071db24

Be more descriptive, follow cpp style better

92262de

dustinpho reviewed Aug 25, 2020

View reviewed changes

Comment thread ext/zpages/test/tracez_data_aggregator_benchmark.cc Outdated

nadiaciobanu and others added 4 commits August 25, 2020 13:18

Add OTLP exporter configuration (open-telemetry#295)

ec6f70d

Add OTLP exporter example (open-telemetry#296)

eb04081

Added Http Trace Context (open-telemetry#143)

09983ab

Add Prometheus Exporter: Step 1 (open-telemetry#280)

e0a93fd

jajanet added 6 commits August 26, 2020 14:31

Use friend classes, granulate aggr fixture

6381880

Add CPU timers where necessary

a89490d

Add End only benchmark, clear processor running mem fn

44c809a

Rename i to num_spans

793c91b

Name periodic function things better

a8febb3

Reorder functions in benchmarks

a9c2619

Cleanup

de57443

liadavid reviewed Aug 26, 2020

View reviewed changes

dustinpho reviewed Aug 26, 2020

View reviewed changes

Comment thread ext/zpages/test/tracez_data_aggregator_benchmark.cc Outdated

dustinpho reviewed Aug 26, 2020

View reviewed changes

Comment thread ext/zpages/test/tracez_data_aggregator_benchmark.cc Outdated

dustinpho reviewed Aug 26, 2020

View reviewed changes

Comment thread ext/zpages/test/tracez_data_aggregator_benchmark.cc Outdated

dustinpho reviewed Aug 26, 2020

View reviewed changes

Comment thread ext/zpages/test/tracez_data_aggregator_benchmark.cc Outdated

dustinpho reviewed Aug 26, 2020

View reviewed changes

Comment thread ext/zpages/test/tracez_data_aggregator_benchmark.cc Outdated

kmanghat reviewed Aug 26, 2020

View reviewed changes

dustinpho reviewed Aug 26, 2020

View reviewed changes

Comment thread ext/zpages/test/tracez_data_aggregator_benchmark.cc

nadiaciobanu and others added 13 commits August 26, 2020 13:56

Add OTLP exporter README (open-telemetry#299)

7d4c181

Deduplicate aggregator benchmark code

5fc728f

Fix broken link (open-telemetry#303)

b23a759

Cosmetic changes from code review

4b16f82

Readabiliy, int->unsigned, makecomplete processor benchmark

b118544

const int->const unsigned, var name clarifcation

61ea8ba

Add more details, deduplicate more code

c3c7661

Add Prometheus Exporter: Step 2 - PrometheusCollector (open-telemetry…

935e7ff

…#282)

Propagate trace and parent span ids (open-telemetry#305)

c36024d

Modify KvToString() Method in Instrument.h to Allow Valid Label Names…

78012e1

… in Records For Prometheus (open-telemetry#298)

Ensure all latency bands filled, run formatter

033a980

Rebase

9772eb8

Better wording and add medium article link

2d10a23

Conversation

jajanet commented Aug 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

liadavid left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kmanghat commented Aug 26, 2020

Uh oh!

liadavid left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kmanghat left a comment

Choose a reason for hiding this comment

Uh oh!

kmanghat Aug 26, 2020

Choose a reason for hiding this comment

Uh oh!

jajanet Aug 27, 2020

Choose a reason for hiding this comment

Uh oh!

kmanghat Aug 27, 2020

Choose a reason for hiding this comment

Uh oh!

jajanet Aug 29, 2020

Choose a reason for hiding this comment

Uh oh!

kmanghat Sep 1, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

jajanet commented Aug 20, 2020 •

edited

Loading