Increase overhaed budget for test_sampler_transition_overhead #10214

aaltay · 2019-11-26T00:16:11Z

Fixes the mistake from #10012. #10012 doubled the measured time instead of the allowed overhead.

Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:

Choose reviewer(s) and mention them in a comment (R: @username).
Format the pull request title like [BEAM-XXX] Fixes bug in ApproximateQuantiles, where you replace BEAM-XXX with the appropriate JIRA issue, if applicable. This will automatically link the pull request to the issue.
If this contribution is large, please file an Apache Individual Contributor License Agreement.

See the Contributor Guide for more tips on how to make review process smoother.

Post-Commit Tests Status (on master branch)

Lang	SDK	Apex	Dataflow	Gearpump	Samza	Spark
Go		---	---	---	---
Java
Python		---		---	---
XLang	---	---	---	---	---	---

Pre-Commit Tests Status (on master branch)

---	Java	Python	Go	Website
Non-portable
Portable	---		---	---

See .test-infra/jenkins/README for trigger phrase, status and link of all Jenkins jobs.

Fixes the mistake from #10012. #10012 doubled the measured time instead of the allowed overhead.

aaltay · 2019-11-26T01:17:47Z

Run Portable_Python PreCommit

tvalentyn · 2019-11-26T02:30:48Z

sdks/python/apache_beam/runners/worker/statesampler_test.py

    # take 0.17us when compiled in opt mode or 0.48 us when compiled with in
    # debug mode).
-    self.assertLess(overhead_us, 10.0)
+    self.assertLess(overhead_us, 20.0)


Seeing the comment on line #127 now - the upper bound is set to 20x expected value already - if the test still fails after 3 attempts perhaps we should be investigating what triggers the failure? Otherwise I am not sure we should be keeping this test. What overhead is bad enough to investigate? Also, if we happen to parallelize test execution, this may increase the flakiness of this test. Perhaps it needs to be a [micro]benchmark.
LGTM to address the flakiness issue regardless of this discussion.

You have a good point. Perhaps 3 retries is just enough. I do regularly see overhead in the order of 5.0 - 10.0 in slow environments. > 10.0 is rare and likely 3 retries will address the flakiness.

I can revert this part maybe. What do you think?

I think this test needs to be converted to a benchmark, and be run in an isolated environment. I suspect we are not getting much the value of having it in the test suite. If the overhead increases 5x, the test will become more flaky and we will likely up the upper bound more instead of investigating. Given current state if this test exercises some codepath that other tests don't, it may be better to keep it in the suite as long is it does not flake, but long term we should move it out from unit tests. That said, I am ok with keeping the value to be 20 and even removing the assertion until we make it a benchmark.

It make sense to convert this to a micro benchmark. I think the main value of this being a test and not a benchmark is that it can potentially catch issues that might add large increases here with unittests whereas most authors will not run benchmarks for their changes unless they know/suspect of potential perf implications.

/cc @pabloem

I had in mind a benchmark with alerts / dashboards. I realize that we probably don't have infrastructure for this now. I am not sure how likely is that someone will make a change that will exceed the threshold by 20x, i'd say unlikely but not impossible. A more likely scenario would be a 3x regression, but the test might not catch this in time.

I agree that this is better as a microbenchmark, but we need alerting and updates, yes. We have a good amount of infrastructure to set our microbenchmarks to have this in the next few months (we just need alerts).

Once we have all the infrastructure pieces, I will be happy to look into creating the microbenchmark suite with metrics exported.

aaltay · 2019-11-26T16:15:30Z

Run Python PreCommit

aaltay · 2019-11-28T00:33:22Z

Run Python PreCommit

aaltay · 2019-12-03T02:02:52Z

Could not find a good way to pull from master in apache/beam branch to get the test fixes. Moved the changes to a new PR: #10264

Increase overhaed budget for test_sampler_transition_overhead

a96eaeb

Fixes the mistake from #10012. #10012 doubled the measured time instead of the allowed overhead.

aaltay requested a review from tvalentyn November 26, 2019 00:16

tvalentyn reviewed Nov 26, 2019

View reviewed changes

aaltay closed this Dec 3, 2019

aaltay deleted the aaltay-patch-1-1 branch December 3, 2019 02:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increase overhaed budget for test_sampler_transition_overhead #10214

Increase overhaed budget for test_sampler_transition_overhead #10214

Uh oh!

aaltay commented Nov 26, 2019

Uh oh!

aaltay commented Nov 26, 2019

Uh oh!

tvalentyn Nov 26, 2019

Uh oh!

aaltay Nov 26, 2019

Uh oh!

tvalentyn Nov 26, 2019

Uh oh!

aaltay Nov 27, 2019

Uh oh!

tvalentyn Nov 28, 2019

Uh oh!

pabloem Dec 4, 2019

Uh oh!

aaltay commented Nov 26, 2019

Uh oh!

aaltay commented Nov 28, 2019

Uh oh!

aaltay commented Dec 3, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Increase overhaed budget for test_sampler_transition_overhead #10214

Increase overhaed budget for test_sampler_transition_overhead #10214

Uh oh!

Conversation

aaltay commented Nov 26, 2019

Post-Commit Tests Status (on master branch)

Pre-Commit Tests Status (on master branch)

Uh oh!

aaltay commented Nov 26, 2019

Uh oh!

tvalentyn Nov 26, 2019

Choose a reason for hiding this comment

Uh oh!

aaltay Nov 26, 2019

Choose a reason for hiding this comment

Uh oh!

tvalentyn Nov 26, 2019

Choose a reason for hiding this comment

Uh oh!

aaltay Nov 27, 2019

Choose a reason for hiding this comment

Uh oh!

tvalentyn Nov 28, 2019

Choose a reason for hiding this comment

Uh oh!

pabloem Dec 4, 2019

Choose a reason for hiding this comment

Uh oh!

aaltay commented Nov 26, 2019

Uh oh!

aaltay commented Nov 28, 2019

Uh oh!

aaltay commented Dec 3, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants