Skip to content

Conversation

@dgoodwin
Copy link
Contributor

In PR #27516 we suspect reporting of flakes broke due to a missed
assumption that test.flake accompanied test.success. Our new goal is to
more clearly have just one status set, so we're going to lean into the
new approach and properly break out the flake state into it's own case.

@dgoodwin dgoodwin changed the title Fix flake reporting for certain tests. OCPBUGS-3633: Fix flake reporting for certain tests. Nov 15, 2022
@openshift-ci-robot openshift-ci-robot added jira/valid-bug Indicates that a referenced Jira bug is valid for the branch this PR is targeting. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. labels Nov 15, 2022
@openshift-ci-robot
Copy link

@dgoodwin: This pull request references Jira Issue OCPBUGS-3633, which is valid. The bug has been moved to the POST state.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target version (4.13.0) matches configured target version for branch (4.13.0)
  • bug is in the state ASSIGNED, which is one of the valid states (NEW, ASSIGNED, POST)

The bug has been updated to refer to the pull request using the external bug tracker.

Details

In response to this:

In PR #27516 we suspect reporting of flakes broke due to a missed
assumption that test.flake accompanied test.success. Our new goal is to
more clearly have just one status set, so we're going to lean into the
new approach and properly break out the flake state into it's own case.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@dgoodwin
Copy link
Contributor Author

/hold
/payload 4.13 ci blocking
/payload 4.13 nightly blocking

@openshift-ci openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 15, 2022
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Nov 15, 2022

@dgoodwin: trigger 4 job(s) of type blocking for the ci release of OCP 4.13

  • periodic-ci-openshift-release-master-ci-4.13-e2e-aws-ovn-upgrade
  • periodic-ci-openshift-release-master-ci-4.13-upgrade-from-stable-4.12-e2e-aws-ovn-upgrade
  • periodic-ci-openshift-release-master-ci-4.13-upgrade-from-stable-4.12-e2e-azure-sdn-upgrade
  • periodic-ci-openshift-release-master-ci-4.13-e2e-aws-sdn-serial

See details on https://pr-payload-tests.ci.openshift.org/runs/ci/167ce980-64fa-11ed-9d34-9258f548db56-0

trigger 6 job(s) of type blocking for the nightly release of OCP 4.13

  • periodic-ci-openshift-release-master-nightly-4.13-e2e-aws-sdn-upgrade
  • periodic-ci-openshift-release-master-ci-4.13-e2e-azure-ovn-upgrade
  • periodic-ci-openshift-release-master-ci-4.13-upgrade-from-stable-4.12-e2e-gcp-ovn-upgrade
  • periodic-ci-openshift-release-master-nightly-4.13-e2e-aws-sdn-serial
  • periodic-ci-openshift-release-master-nightly-4.13-e2e-metal-ipi-ovn-ipv6
  • periodic-ci-openshift-release-master-nightly-4.13-e2e-metal-ipi-sdn-bm

See details on https://pr-payload-tests.ci.openshift.org/runs/ci/167ce980-64fa-11ed-9d34-9258f548db56-1

@openshift-ci openshift-ci bot requested review from mfojtik and spadgett November 15, 2022 15:28
@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Nov 15, 2022
@xueqzhan
Copy link
Contributor

/lgtm

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Nov 15, 2022
@dgoodwin
Copy link
Contributor Author

/hold

Payloads look totally busted, not sure why yet.

@dgoodwin
Copy link
Contributor Author

/payload 4.13 ci blocking
/payload 4.13 nightly blocking

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Nov 15, 2022

@dgoodwin: trigger 4 job(s) of type blocking for the ci release of OCP 4.13

  • periodic-ci-openshift-release-master-ci-4.13-e2e-aws-ovn-upgrade
  • periodic-ci-openshift-release-master-ci-4.13-upgrade-from-stable-4.12-e2e-aws-ovn-upgrade
  • periodic-ci-openshift-release-master-ci-4.13-upgrade-from-stable-4.12-e2e-azure-sdn-upgrade
  • periodic-ci-openshift-release-master-ci-4.13-e2e-aws-sdn-serial

See details on https://pr-payload-tests.ci.openshift.org/runs/ci/8bfdbc10-650f-11ed-9bdb-7ae30ba4a890-0

trigger 6 job(s) of type blocking for the nightly release of OCP 4.13

  • periodic-ci-openshift-release-master-nightly-4.13-e2e-aws-sdn-upgrade
  • periodic-ci-openshift-release-master-ci-4.13-e2e-azure-ovn-upgrade
  • periodic-ci-openshift-release-master-ci-4.13-upgrade-from-stable-4.12-e2e-gcp-ovn-upgrade
  • periodic-ci-openshift-release-master-nightly-4.13-e2e-aws-sdn-serial
  • periodic-ci-openshift-release-master-nightly-4.13-e2e-metal-ipi-ovn-ipv6
  • periodic-ci-openshift-release-master-nightly-4.13-e2e-metal-ipi-sdn-bm

See details on https://pr-payload-tests.ci.openshift.org/runs/ci/8bfdbc10-650f-11ed-9bdb-7ae30ba4a890-1

@dgoodwin
Copy link
Contributor Author

I don't think this worked. The payload command is pretty busted at least for minor upgrades, but this micro ran:

https://prow.ci.openshift.org/view/gs/origin-ci-test/logs/openshift-origin-27553-ci-4.13-e2e-aws-ovn-upgrade/1592578595266498560

In here we see no hit for the "firing or pending", and checking the junit files I can see it ran in the output, but it has no individual junit just like before.

@DennisPeriquet
Copy link
Contributor

I went to that job and saw Alerts shouldn't report any unexpected alerts in firing or pending state in the e2e spyglass chart but not in either Flakes, Passing, or Skipped.

In PR openshift#27516 we suspect reporting of flakes broke due to a missed
assumption that test.flake accompanied test.success. Our new goal is to
more clearly have just one status set, so we're going to lean into the
new approach and properly break out the flake state into it's own case.
@dgoodwin dgoodwin force-pushed the fix-flake-reporting branch from 47f277e to aa7fb74 Compare November 16, 2022 13:47
@openshift-ci openshift-ci bot removed the lgtm Indicates that a PR is ready to be merged. label Nov 16, 2022
@dgoodwin
Copy link
Contributor Author

@xueqzhan may have found the issue, PR updated, trying again.

/payload 4.13 ci blocking

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Nov 16, 2022

@dgoodwin: trigger 4 job(s) of type blocking for the ci release of OCP 4.13

  • periodic-ci-openshift-release-master-ci-4.13-e2e-aws-ovn-upgrade
  • periodic-ci-openshift-release-master-ci-4.13-upgrade-from-stable-4.12-e2e-aws-ovn-upgrade
  • periodic-ci-openshift-release-master-ci-4.13-upgrade-from-stable-4.12-e2e-azure-sdn-upgrade
  • periodic-ci-openshift-release-master-ci-4.13-e2e-aws-sdn-serial

See details on https://pr-payload-tests.ci.openshift.org/runs/ci/42760470-65b5-11ed-9360-8bb1c6594d2c-0

@DennisPeriquet
Copy link
Contributor

/lgtm

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Nov 16, 2022
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Nov 16, 2022

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: DennisPeriquet, dgoodwin, xueqzhan

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment


// calculate the effective test set we ran, excluding any incompletes
tests, _ = splitTests(tests, func(t *testCase) bool { return t.success || t.failed || t.skipped })
tests, _ = splitTests(tests, func(t *testCase) bool { return t.success || t.flake || t.failed || t.skipped })
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Whatever this does, we should clean this up sometime this quarter.

@deads2k
Copy link
Contributor

deads2k commented Nov 16, 2022

lgtm as well.

@dgoodwin
Copy link
Contributor Author

Payload itself didn't do well due to the minor upgrades failing until we get openshift/release#34072

Serial didn't install. And https://prow.ci.openshift.org/view/gs/origin-ci-test/logs/aggregator-periodic-ci-openshift-release-master-ci-4.13-e2e-aws-ovn-upgrade/1592877204990070784 looks to have failed for real reasons.

Test pass/flake/skip counts look correct, I think we've got it fixed.

@dgoodwin
Copy link
Contributor Author

/cherry-pick release-4.12

/retest-required

@openshift-cherrypick-robot

@dgoodwin: once the present PR merges, I will cherry-pick it on top of release-4.12 in a new PR and assign it to you.

Details

In response to this:

/cherry-pick release-4.12

/retest-required

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@dgoodwin
Copy link
Contributor Author

/hold cancel
/retest-required

@openshift-ci openshift-ci bot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 16, 2022
@DennisPeriquet
Copy link
Contributor

/test e2e-aws-ovn-fips

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Nov 17, 2022

@dgoodwin: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/e2e-aws-ovn-single-node aa7fb74 link false /test e2e-aws-ovn-single-node
ci/prow/e2e-aws-ovn-single-node-upgrade aa7fb74 link false /test e2e-aws-ovn-single-node-upgrade
ci/prow/e2e-aws-ovn-single-node-serial aa7fb74 link false /test e2e-aws-ovn-single-node-serial
ci/prow/e2e-aws-ovn-upgrade aa7fb74 link false /test e2e-aws-ovn-upgrade
ci/prow/e2e-openstack-ovn aa7fb74 link false /test e2e-openstack-ovn

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@openshift-merge-robot openshift-merge-robot merged commit 403f0a4 into openshift:master Nov 17, 2022
@openshift-ci-robot
Copy link

@dgoodwin: Some pull requests linked via external trackers have merged:

The following pull requests linked via external trackers have not merged:

These pull request must merge or be unlinked from the Jira bug in order for it to move to the next state. Once unlinked, request a bug refresh with /jira refresh.

Jira Issue OCPBUGS-3633 has not been moved to the MODIFIED state.

Details

In response to this:

In PR #27516 we suspect reporting of flakes broke due to a missed
assumption that test.flake accompanied test.success. Our new goal is to
more clearly have just one status set, so we're going to lean into the
new approach and properly break out the flake state into it's own case.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-cherrypick-robot

@dgoodwin: new pull request created: #27557

Details

In response to this:

/cherry-pick release-4.12

/retest-required

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. jira/valid-bug Indicates that a referenced Jira bug is valid for the branch this PR is targeting. lgtm Indicates that a PR is ready to be merged.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants