Bug 1970315: testPodSandboxCreation: skip sandbox errors for pods which were not deleted during network update #26208

vrutkovs · 2021-06-08T10:06:39Z

"pods should successfully create sandboxes" test should mark pod events as flakes if network is being updated.
During network update CNI binaries may be in the middle of update. This may cause sandbox errors like:

error adding container to network "ovn-kubernetes": failed to send CNI request: Post "http://dummy/": EOF
Multus: [openshift-dns/dns-default-nbkz2]: have you checked that your default network is ready? still waiting for readinessindicatorfile @ /var/run/multus/cni/net.d/10-ovn-kubernetes.conf. pollimmediate error: timed out waiting for the condition
error adding container to network "openshift-sdn": failed to find plugin "openshift-sdn" in path [/opt/multus/bin /var/lib/cni/bin /usr/libexec/cni]

"never deleted" search in 4.7 -> 4.8 upgrades for last 7 days

If these events are occuring during network/machine-config update and sandboxes eventually get created (i.e. the pod never gets deleted) these events are marked as flakes.

Test runs:

vrutkovs · 2021-06-08T11:21:55Z

/test e2e-aws-upgrade

openshift-ci · 2021-06-10T08:50:26Z

@vrutkovs: This pull request references Bugzilla bug 1970315, which is invalid:

expected the bug to target the "4.8.0" release, but it targets "---" instead

Comment /bugzilla refresh to re-evaluate validity if changes to the Bugzilla bug are made, or edit the title of this pull request to link to a different bug.

Details

In response to this:

Bug 1970315: testPodSandboxCreation: skip sandbox errors for pods which were not deleted during network update

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

petr-muller · 2021-06-10T09:22:05Z

/lgtm

smarterclayton · 2021-06-10T14:13:22Z

pkg/synthetictests/networking.go

 		if deletionTime == nil {
+			// mark sandboxes errors as flakes if networking is being updated
+			// these pods eventually get created
+			operatorsProgressing := intervalcreation.IntervalsFromEvents_OperatorProgressing(events, event.From, event.To)


Hrm, this could be O(N^2) on a pretty big N, have you verified that IntervalsFromEvents uses binary search?

Seems IntervalsFromEvents_OperatorProgressing is O(N)

The list should be sorted, so if you know from to you can do a binary search o(logn) to find the start and then same for the end. Or maybe just at the beginning do a single pass and calculate all the intervals that the operator is progressing (which should be very small O) and then just do the smaller loop here?

(see intervals.go / monitor.go for a method that already uses sort.Search() to do this)

Reworked this to use monitorapi functions:

CopyAndSort to create a copy of events and sort them by type

IntervalsFromEvents_OperatorProgressing to build a list of operator progressing events

sort.Search to find events for network/machine-config

openshift-bot · 2021-06-11T01:02:13Z

/bugzilla refresh

Recalculating validity in case the underlying Bugzilla bug has changed.

openshift-ci · 2021-06-11T01:02:16Z

@openshift-bot: This pull request references Bugzilla bug 1970315, which is invalid:

expected the bug to target the "4.8.0" release, but it targets "---" instead

Comment /bugzilla refresh to re-evaluate validity if changes to the Bugzilla bug are made, or edit the title of this pull request to link to a different bug.

Details

In response to this:

/bugzilla refresh

Recalculating validity in case the underlying Bugzilla bug has changed.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

vrutkovs · 2021-06-11T16:14:43Z

/bugzilla refresh

openshift-ci · 2021-06-11T16:14:46Z

@vrutkovs: This pull request references Bugzilla bug 1970315, which is invalid:

expected the bug to target the "4.8.0" release, but it targets "---" instead

Comment /bugzilla refresh to re-evaluate validity if changes to the Bugzilla bug are made, or edit the title of this pull request to link to a different bug.

Details

In response to this:

/bugzilla refresh

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-bot · 2021-06-12T01:01:10Z

/bugzilla refresh

Recalculating validity in case the underlying Bugzilla bug has changed.

openshift-ci · 2021-06-12T01:01:12Z

@openshift-bot: This pull request references Bugzilla bug 1970315, which is invalid:

expected the bug to target the "4.8.0" release, but it targets "---" instead

Comment /bugzilla refresh to re-evaluate validity if changes to the Bugzilla bug are made, or edit the title of this pull request to link to a different bug.

Details

In response to this:

/bugzilla refresh

Recalculating validity in case the underlying Bugzilla bug has changed.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-bot · 2021-06-13T01:01:10Z

/bugzilla refresh

Recalculating validity in case the underlying Bugzilla bug has changed.

openshift-ci · 2021-06-13T01:01:13Z

@openshift-bot: This pull request references Bugzilla bug 1970315, which is invalid:

expected the bug to target the "4.8.0" release, but it targets "---" instead

Comment /bugzilla refresh to re-evaluate validity if changes to the Bugzilla bug are made, or edit the title of this pull request to link to a different bug.

Details

In response to this:

/bugzilla refresh

Recalculating validity in case the underlying Bugzilla bug has changed.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-bot · 2021-06-14T01:01:10Z

/bugzilla refresh

Recalculating validity in case the underlying Bugzilla bug has changed.

openshift-ci · 2021-06-14T01:01:13Z

@openshift-bot: This pull request references Bugzilla bug 1970315, which is invalid:

expected the bug to target the "4.8.0" release, but it targets "---" instead

Comment /bugzilla refresh to re-evaluate validity if changes to the Bugzilla bug are made, or edit the title of this pull request to link to a different bug.

Details

In response to this:

/bugzilla refresh

Recalculating validity in case the underlying Bugzilla bug has changed.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-bot · 2021-06-14T08:28:34Z

/bugzilla refresh

The main branch will open for development of next OCP version. Recalculating validity of PRs linked to this PR.

openshift-ci · 2021-06-14T08:29:38Z

@openshift-bot: This pull request references Bugzilla bug 1970315, which is invalid:

expected the bug to target the "4.9.0" release, but it targets "---" instead

Comment /bugzilla refresh to re-evaluate validity if changes to the Bugzilla bug are made, or edit the title of this pull request to link to a different bug.

Details

In response to this:

/bugzilla refresh

The main branch will open for development of next OCP version. Recalculating validity of PRs linked to this PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-bot · 2021-06-15T01:00:36Z

/bugzilla refresh

Recalculating validity in case the underlying Bugzilla bug has changed.

openshift-ci · 2021-06-15T01:00:43Z

@openshift-bot: This pull request references Bugzilla bug 1970315, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker.

3 validation(s) were run on this bug

bug is open, matching expected state (open)
bug target release (4.9.0) matches configured target release for branch (4.9.0)
bug is in the state NEW, which is one of the valid states (NEW, ASSIGNED, ON_DEV, POST, POST)

Requesting review from QA contact:
/cc @zhaozhanqi

Details

In response to this:

/bugzilla refresh

Recalculating validity in case the underlying Bugzilla bug has changed.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

vrutkovs · 2021-06-25T06:55:58Z

/retest

yselkowitz · 2021-06-28T20:15:19Z

/retest

ravisantoshgudimetla

/lgtm

Thank you for working on this @vrutkovs

smarterclayton · 2021-06-29T15:47:03Z

/retest

openshift-ci · 2021-06-29T15:47:11Z

[APPROVALNOTIFIER] This PR is APPROVED

Approval requirements bypassed by manually added approval.

This pull-request has been approved by: petr-muller, ravisantoshgudimetla, vrutkovs

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

pkg/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

openshift-bot · 2021-06-29T17:41:37Z

/retest

Please review the full test history for this PR and help us cut down flakes.

openshift-bot · 2021-06-29T18:06:37Z

/retest

Please review the full test history for this PR and help us cut down flakes.

openshift-bot · 2021-06-29T18:53:42Z

/retest

Please review the full test history for this PR and help us cut down flakes.

openshift-bot · 2021-06-29T22:59:33Z

/retest

Please review the full test history for this PR and help us cut down flakes.

openshift-bot · 2021-06-30T02:13:35Z

/retest

Please review the full test history for this PR and help us cut down flakes.

openshift-bot · 2021-06-30T02:37:33Z

/retest

Please review the full test history for this PR and help us cut down flakes.

openshift-bot · 2021-06-30T03:01:32Z

/retest

Please review the full test history for this PR and help us cut down flakes.

vrutkovs · 2021-06-30T07:15:11Z

/test e2e-aws-upgrade

vrutkovs · 2021-06-30T11:26:37Z

/cherrypick release-4.8

openshift-cherrypick-robot · 2021-06-30T11:26:38Z

@vrutkovs: once the present PR merges, I will cherry-pick it on top of release-4.8 in a new PR and assign it to you.

Details

In response to this:

/cherrypick release-4.8

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

vrutkovs · 2021-06-30T11:28:24Z

/skip

e2e-aws-upgrade failing due to openshift/release#19836

vrutkovs · 2021-06-30T12:54:35Z

/retest

vrutkovs · 2021-06-30T15:20:05Z

/test e2e-metal-ipi-ovn-ipv6

vrutkovs · 2021-06-30T21:08:34Z

/test e2e-metal-ipi-ovn-ipv6

openshift-bot · 2021-06-30T21:12:12Z

/retest

Please review the full test history for this PR and help us cut down flakes.

openshift-ci · 2021-06-30T23:29:08Z

@vrutkovs: The following tests failed, say /retest to rerun all failed tests:

Test name	Commit	Details	Rerun command
ci/prow/e2e-gcp-disruptive	`124818d`	link	`/test e2e-gcp-disruptive`
ci/prow/e2e-aws-disruptive	`124818d`	link	`/test e2e-aws-disruptive`
ci/prow/e2e-aws-upgrade	`54660ee`	link	`/test e2e-aws-upgrade`

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

openshift-bot · 2021-07-01T00:32:16Z

/retest

Please review the full test history for this PR and help us cut down flakes.

openshift-bot · 2021-07-01T00:44:11Z

/retest

Please review the full test history for this PR and help us cut down flakes.

openshift-ci · 2021-07-01T06:28:16Z

@vrutkovs: All pull requests linked via external trackers have merged:

openshift/origin#26208

Bugzilla bug 1970315 has been moved to the MODIFIED state.

Details

In response to this:

Bug 1970315: testPodSandboxCreation: skip sandbox errors for pods which were not deleted during network update

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-cherrypick-robot · 2021-07-01T06:28:56Z

@vrutkovs: new pull request created: #26297

Details

In response to this:

/cherrypick release-4.8

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jun 8, 2021

openshift-ci bot requested review from csrwng and mfojtik June 8, 2021 10:06

vrutkovs force-pushed the sandboxes-neverdeleted-network-update branch from 92d2deb to 124818d Compare June 8, 2021 14:31

vrutkovs changed the title ~~WIP testPodSandboxCreation: skip sandbox errors for pods which were not deleted during network update~~ Bug 1970315: testPodSandboxCreation: skip sandbox errors for pods which were not deleted during network update Jun 10, 2021

openshift-ci bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jun 10, 2021

openshift-ci bot added bugzilla/severity-medium Referenced Bugzilla bug's severity is medium for the branch this PR is targeting. bugzilla/invalid-bug Indicates that a referenced Bugzilla bug is invalid for the branch this PR is targeting. labels Jun 10, 2021

openshift-ci bot assigned petr-muller Jun 10, 2021

openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Jun 10, 2021

smarterclayton reviewed Jun 10, 2021

View reviewed changes

openshift-ci bot added bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. and removed bugzilla/invalid-bug Indicates that a referenced Bugzilla bug is invalid for the branch this PR is targeting. labels Jun 15, 2021

ravisantoshgudimetla approved these changes Jun 29, 2021

View reviewed changes

openshift-ci bot assigned ravisantoshgudimetla Jun 29, 2021

openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Jun 29, 2021

smarterclayton added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 29, 2021

openshift-merge-robot merged commit 7a95251 into openshift:master Jul 1, 2021

openshift-cherrypick-robot mentioned this pull request Jul 1, 2021

[release-4.8] Bug 1978090: testPodSandboxCreation: skip sandbox errors for pods which were not deleted during network update #26297

Merged

Bug 1970315: testPodSandboxCreation: skip sandbox errors for pods which were not deleted during network update #26208

Bug 1970315: testPodSandboxCreation: skip sandbox errors for pods which were not deleted during network update #26208

Uh oh!

Conversation

vrutkovs commented Jun 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vrutkovs commented Jun 8, 2021

Uh oh!

openshift-ci bot commented Jun 10, 2021

Uh oh!

petr-muller commented Jun 10, 2021

Uh oh!

smarterclayton Jun 10, 2021

Choose a reason for hiding this comment

Uh oh!

vrutkovs Jun 10, 2021

Choose a reason for hiding this comment

Uh oh!

smarterclayton Jun 10, 2021

Choose a reason for hiding this comment

Uh oh!

smarterclayton Jun 11, 2021

Choose a reason for hiding this comment

Uh oh!

vrutkovs Jun 17, 2021

Choose a reason for hiding this comment

Uh oh!

openshift-bot commented Jun 11, 2021

Uh oh!

openshift-ci bot commented Jun 11, 2021

Uh oh!

vrutkovs commented Jun 11, 2021

Uh oh!

openshift-ci bot commented Jun 11, 2021

Uh oh!

openshift-bot commented Jun 12, 2021

Uh oh!

openshift-ci bot commented Jun 12, 2021

Uh oh!

openshift-bot commented Jun 13, 2021

Uh oh!

openshift-ci bot commented Jun 13, 2021

Uh oh!

openshift-bot commented Jun 14, 2021

Uh oh!

openshift-ci bot commented Jun 14, 2021

Uh oh!

openshift-bot commented Jun 14, 2021

Uh oh!

openshift-ci bot commented Jun 14, 2021

Uh oh!

openshift-bot commented Jun 15, 2021

Uh oh!

openshift-ci bot commented Jun 15, 2021

Uh oh!

vrutkovs commented Jun 25, 2021

Uh oh!

yselkowitz commented Jun 28, 2021

Uh oh!

ravisantoshgudimetla left a comment

Choose a reason for hiding this comment

Uh oh!

smarterclayton commented Jun 29, 2021

Uh oh!

openshift-ci bot commented Jun 29, 2021

Uh oh!

openshift-bot commented Jun 29, 2021

Uh oh!

openshift-bot commented Jun 29, 2021

Uh oh!

openshift-bot commented Jun 29, 2021

Uh oh!

openshift-bot commented Jun 29, 2021

Uh oh!

openshift-bot commented Jun 30, 2021

Uh oh!

openshift-bot commented Jun 30, 2021

Uh oh!

openshift-bot commented Jun 30, 2021

vrutkovs commented Jun 8, 2021 •

edited

Loading

openshift-ci bot commented Jun 30, 2021 •

edited

Loading