[bp/1.35] Flakes stack#44685
Closed
Copilot wants to merge 8 commits intorelease/v1.35from
Closed
Conversation
Co-authored-by: phlax <454682+phlax@users.noreply.github.com>
as this can still flake in msan Signed-off-by: Ryan Northey <ryan@synca.io> Co-authored-by: phlax <454682+phlax@users.noreply.github.com>
Scale RequestTimeoutMs by TIMEOUT_FACTOR to match IdleTimeoutMs. The hard-coded 200ms races test setup on loaded/sanitizer runs, causing the request timer to fire before the test body executes. fix #26979 Signed-off-by: Ryan Northey <ryan@synca.io> Co-authored-by: phlax <454682+phlax@users.noreply.github.com>
#44500) QuicSpdyClientSession::ShouldCreateOutgoingBidirectionalStream() can legitimately return false, but we return true unconditionally to avoid a nullptr deref in QuicHttpClientConnectionImpl::newStream. The existing ASSERT on the parent's return value was therefore incorrect and triggered flakes (e.g. in buffer_accounting_integration_test on MSAN). Drop the ASSERT and update the comment. Fixes #41526 --------- Signed-off-by: Ryan Northey <ryan@synca.io> Co-authored-by: phlax <454682+phlax@users.noreply.github.com>
by separating the tests Hopefully fix: #44427 Signed-off-by: Ryan Northey <ryan@synca.io> Co-authored-by: phlax <454682+phlax@users.noreply.github.com>
Signed-off-by: Adi Suissa-Peleg <adip@google.com> Co-authored-by: phlax <454682+phlax@users.noreply.github.com>
Copilot
AI
changed the title
[WIP] Backport flake-fix stack from PR #44644 to release/v1.35
[bp/1.35] Flakes stack
Apr 27, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Backport of the flake-fix stack from #44644 (
[bp/1.37] Flakes stack) torelease/v1.35. 7 of the 8 commits apply; #43711 is intentionally skipped (see below).Cherry-picked commits
test: Fix ws integration test flake— dropscpu:3/largesize tags, adds timeout-driven websocket upgrade teststest/ws: Follow up fixes from #44508test/ws: Increase sharding further— still flaking under MSANtest/idle_timeout: Fix flake— scaleRequestTimeoutMsbyTIMEOUT_FACTORShouldCreateOutgoingBidirectionalStream#44500 —quic: Remove flaky ASSERT in ShouldCreateOutgoingBidirectionalStream— ASSERT was incorrect; parent can legitimately return false (fixes MSAN flakes inbuffer_accounting_integration_test)test: Attempt to fix flake in grpc transcoder test— split combined testlrs: proper shutdown of the LRS stream— adds~LoadStatsReporterdestructor to disable timers and close gRPC stream before destruction, fixing shutdown race with ADSSkipped
#43711 (
dns test: improve tests by teardown the server and upstream) —dns_filter_integration_test.cconrelease/v1.35uses thecaresresolver, lacksOsSysCallsWithMockedDnsandsetupResponseParser-based fixture; the teardown patch does not apply cleanly and the flake context differs.Conflict resolutions
changelogs/current.yamlcontext conflict — retained only theload_reportbug_fixes entry; dropped unrelatedmain-only entries.websocket_integration_test.cc— new tests were inserted between two adjacent tests in v1.35; accepted incoming additions.Related
Warning
Firewall rules blocked me from connecting to one or more addresses (expand for details)
I tried to connect to the following addresses, but was blocked by firewall rules:
https://api.github.com/graphql/usr/bin/gh gh auth status(http block)https://api.github.com/repos/envoyproxy/envoy/pulls/44685/usr/bin/curl curl -s -X PATCH -H Authorization: token ****** -H Accept: application/vnd.github.v3+json REDACTED -d {"title": "[bp/1.35] Flakes stack", "draft": false}(http block)/usr/bin/curl curl -v -X PATCH -H Authorization: token ****** -H Accept: application/vnd.github.v3+json REDACTED -d {"title": "[bp/1.35] Flakes stack", "draft": false}(http block)/usr/bin/curl curl -s -o /tmp/pr_update_result.json -w %{http_code} -X PATCH -H Authorization: token ****** -H Accept: application/vnd.github.v3+json -H Content-Type: application/json REDACTED -d {"title": "[bp/1.35] Flakes stack"}(http block)If you need me to access, download, or install something from one of these locations, you can either:
Original prompt
Backport the flake-fix stack from PR #44644 (which targeted
release/v1.37) torelease/v1.35.Cherry-pick the following 7 upstream commits in order from
mainontorelease/v1.35(note: one of the 8 commits from the 1.37 backport is intentionally skipped for 1.35 — see below):1b7af978b8819e08d2a804556d2ca52d77a0169c— test: Fix ws integration test flake (test: Fix ws integration test flake #44508)095e13d2740727c31e781f7abc4fb67ab1dd04ac— test/ws: Follow up fixes from test: Fix ws integration test flake #44508 (test/ws: Follow up fixes from #44508 #44518)080522bf396dd344fe28dfc98c29eea4e7fab012— test/idle_timeout: Fix flake (test/idle_timeout: Fix flake #44584)ShouldCreateOutgoingBidirectionalStream#44500) — quic: Remove flaky ASSERT inShouldCreateOutgoingBidirectionalStream(quic: Remove flaky ASSERT inShouldCreateOutgoingBidirectionalStream#44500)Skip: #43711 (
dns test: improve tests by teardown the server and upstream). Thetest/extensions/filters/udp/dns_filter/dns_filter_integration_test.cconrelease/v1.35is structurally different (uses thecaresresolver instead ofgetaddrinfo, lacks theOsSysCallsWithMockedDnsmock, lackssetupResponseParser-based fixture). The teardown patch does not apply cleanly and the original flake context differs — skip rather than hand-port.You can find the exact merge SHAs for the squash commits by looking at the commits in
phlax:bp-1.37-flakes(the source branch of #44644). The list is available via/repos/envoyproxy/envoy/pulls/44644/commits.Verification done already against
release/v1.35:test/integration/idle_timeout_integration_test.ccstill hasRequestTimeoutMs = 200;— test/idle_timeout: Fix flake #44584 applies.source/common/quic/envoy_quic_client_session.hstill has the flakyASSERT(quic::QuicSpdyClientSession::ShouldCreateOutgoingBidirectionalStream())— quic: Remove flaky ASSERT inShouldCreateOutgoingBidirectionalStream#44500 applies.source/common/upstream/load_stats_reporter.hhas no~LoadStatsReporterdestructor — lrs: proper shutdown of the LRS stream #43083 applies (note: the constructor signature on 1.35 takesGrpc::RawAsyncClientPtr async_clientrather thanGrpc::RawAsyncClientSharedPtr&&— this does not affect the destructor patch but be aware while resolving any conflicts).Workflow:
bp-1.35-flakesfromrelease/v1.35.[bp/1.35] Flakes stackagainst baserelease/v1.35.[bp/1.35]PRs (check recent merged 1.35 backports for reference).Do NOT open any issues. Just the PR.
The following is the prior conversation context from the user's chat exploration (may be truncated):
User: this backported a load of flakes to 1.37 - wondering if all are correct/needed/fix somethign etc? for other supported branches - 1.35 and 1.36 (release/v{x})
Assistant: ## Summary of
[bp/1.37] Flakes stack(#44644)The PR backported 8 upstream commits to
release/v1.37. Here they are with applicability torelease/v1.36andrelease/v1.35:test: Fix ws integration test flake— fixes 200ms timeout race + dropscpu:3/large size intest/integration/BUILDwebsocket_integration_testtest/ws: Follow up fixes from #44508test/ws: Increase sharding furthertest/idle_timeout: Fix flake— scaleRequestTimeoutMsbyTIMEOUT_FACTORquic: Remove flaky ASSERT in ShouldCreateOutgoingBidirectionalStreambuffer_accounting_integration_testMSANThis pull request was created from Copilot chat.