Add benchmark suite for MSQ window functions by Akshat-Jain · Pull Request #17377 · apache/druid

Akshat-Jain · 2024-10-18T11:29:38Z

Description

This PR adds a benchmark suite for MSQ window function queries.

A sample run on my local gave the following results:

Benchmark                                                         (maxNumTasks)  (rowsPerSegment)  Mode  Cnt      Score      Error  Units
MSQWindowFunctionsBenchmark.windowWithoutGroupBy                              2          20000000  avgt    5  96681.604 ± 2425.579  ms/op
MSQWindowFunctionsBenchmark.windowWithoutGroupBy                              5          20000000  avgt    5  94676.305 ± 4012.108  ms/op
MSQWindowFunctionsBenchmark.windowWithoutSorting                              2          20000000  avgt    5  17515.970 ±  498.066  ms/op
MSQWindowFunctionsBenchmark.windowWithoutSorting                              5          20000000  avgt    5  15996.262 ± 1552.218  ms/op
MSQWindowFunctionsBenchmark.multipleWindows                                   2          20000000  avgt    5   63215.499 ± 4722.604  ms/op
MSQWindowFunctionsBenchmark.multipleWindows                                   5          20000000  avgt    5   69287.847 ± 4985.326  ms/op
MSQWindowFunctionsBenchmark.windowWithHighCardinalityPartitionBy              2          20000000  avgt    5   69469.122 ± 3016.019  ms/op
MSQWindowFunctionsBenchmark.windowWithHighCardinalityPartitionBy              5          20000000  avgt    5   70951.896 ± 4343.354  ms/op
MSQWindowFunctionsBenchmark.windowWithLowCardinalityPartitionBy               2          20000000  avgt    5     507.584 ±  600.999  ms/op
MSQWindowFunctionsBenchmark.windowWithLowCardinalityPartitionBy               5          20000000  avgt    5     413.795 ±   38.195  ms/op
MSQWindowFunctionsBenchmark.windowWithSorting                                 2          20000000  avgt    5   16682.792 ±  239.561  ms/op
MSQWindowFunctionsBenchmark.windowWithSorting                                 5          20000000  avgt    5   16422.890 ±  225.643  ms/op

(Note: The above run was done with the changes of #17373, as otherwise I was running into the Channel has no capacity issue)

This PR has:

been self-reviewed.
added documentation for new or modified features or behaviors.
a release note entry in the PR description.
added Javadocs for most classes and all non-trivial methods. Linked related entities via Javadoc links.
added or updated version, license, or notice information in licenses.yaml
added comments explaining the "why" and the intent of the code wherever would not be obvious for an unfamiliar reader.
added unit tests or modified existing tests to cover new code paths, ensuring the threshold for code coverage is met.
added integration tests.
been tested in a test Druid cluster.

adarshsanjeev

Overall, this looks good to me. Few comments.

adarshsanjeev · 2024-10-29T04:27:09Z

+      runStep.run();
+    }
+
+    BaseExecuteQuery execStep = (BaseExecuteQuery) runSteps.get(runSteps.size() - 1);


I'm not very familiar with this, why is the change required to run all runsteps for this benchmark (and not others)?

queryTestBuilder.results() didn't support MSQ apparently, since it only had the execute step, and not the ExtractResultsFactory step.

So, without this change, instead of getting the query results, we were getting the query ID. The ExtractResultsFactory step (added as a custom runner when declaring QueryTestBuilder) takes the query ID, and contacts the overlord client, and fetches the actual results.

(and not others)?

In the regular MSQ tests, we do testBuilder().run() which handled both steps already.

* Add benchmark suite for MSQ window functions * Fix inspection checks * Address review comment: Rename method

Add benchmark suite for MSQ window functions

97a9b3a

github-actions Bot added Area - Batch Ingestion Area - Querying Area - Dependencies Area - MSQ For multi stage queries - https://github.com/apache/druid/issues/12262 labels Oct 18, 2024

Fix inspection checks

5019c9f

adarshsanjeev reviewed Oct 28, 2024

View reviewed changes

Address review comment: Rename method

2ec2195

Akshat-Jain requested a review from adarshsanjeev October 28, 2024 05:03

adarshsanjeev approved these changes Oct 29, 2024

View reviewed changes

cryptoe merged commit 21e7e5c into apache:master Oct 30, 2024

jtuglu1 pushed a commit to jtuglu1/druid that referenced this pull request Nov 20, 2024

Add benchmark suite for MSQ window functions (apache#17377)

c891091

* Add benchmark suite for MSQ window functions * Fix inspection checks * Address review comment: Rename method

adarshsanjeev added this to the 32.0.0 milestone Jan 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add benchmark suite for MSQ window functions#17377

Add benchmark suite for MSQ window functions#17377
cryptoe merged 3 commits intoapache:masterfrom
Akshat-Jain:msq-wf-benchmarks

Akshat-Jain commented Oct 18, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adarshsanjeev left a comment

Uh oh!

adarshsanjeev Oct 29, 2024

Uh oh!

Akshat-Jain Oct 29, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Akshat-Jain commented Oct 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adarshsanjeev left a comment

Choose a reason for hiding this comment

Uh oh!

adarshsanjeev Oct 29, 2024

Choose a reason for hiding this comment

Uh oh!

Akshat-Jain Oct 29, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Akshat-Jain commented Oct 18, 2024 •

edited

Loading