test: increase e2e test run with 15 minutes#2184
test: increase e2e test run with 15 minutes#2184sinnykumari wants to merge 1 commit intoopenshift:masterfrom
Conversation
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: sinnykumari The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
By analyzing some of gcp-op test run logs, it seems system reboot time has increased by around 30 seconds.
|
|
I am not sure why the time has increased, we can investigate that later. Until then let's increase the timeout so that we unblock PRs. |
|
I think we should figure out the underlying problem and not extend the test time. Since reboot time per node is 50% more |
|
(As per slack, we're doing some investigation on this before deciding how to resolve) |
|
/hold |
|
Sadly we never summarized this bug anywhere public on the MCO repo AFAICS. openshift/cluster-network-operator#859 merged which should help - let's try to verify that. |
|
OK another status update on this; I kept being confused how it wasn't helping but it turns out CI (and nightly) payload generation has been broken until just now, so our CI runs were still using an old cluster-network-operator. Let's keep an eye out now to see if openshift/cluster-network-operator#859 actually improves things! |
fixes e2e-gcp-op test failing in ci due to timeout
743fe9b to
dcf26d4
Compare
|
nightly images are green now, openshift/cluster-network-operator#859 should be included in recent payload in ci run now. For sanity check, re triggered test here. |
|
We should probably consider this to start clearing out the PR backlog. |
e2e-aws and e2e-aws-serial are also both currently broken and being worked on. we shouldnt be overriding all of those (3+) tests just to get prs in imo. for gcp-op: openshift/cluster-dns-operator#213 (comment) (and #2229) need to be merged but can't bc of the above e2e-aws issues so it seems like first the e2e-aws tests needs to get fixed bc that's blocking the dns pr which once that merges unblocks our ci. |
|
It's all interlinked though. We now have so many PRs outstanding that we're waiting on AWS "leases" in some, e.g. For example we could combine this PR with #2229 Basically I think we should try to do something other than be blocked. |
|
Right but a bigger problem mentioned in slack is that ci and nightly payload acceptance is also broken due to this... I'm going to go and try to create more urgency on the aws bc we shouldnt be merging in these condiation but it should also be more important than it is rn. I don't think overriding required tests across the board is the right choice. we can land 2229 but we will still be blocked on other required tests. if there are maybe problems with payloads and across ocp.... aws really needs to get fixed bc green means nothing. |
|
@sinnykumari: The following tests failed, say
Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
|
As an update: this comes down to https://bugzilla.redhat.com/show_bug.cgi?id=1897604 (which is also blocking the dns fix: openshift/cluster-dns-operator#213) and blocking all merges across ocp. There's a new channel (incident-kcm..) now where people having started working on it. Instead of trying to hack around and merge this, which won't help bc other required tests are blocking on all repos incl this one, we are waiting for the above bz to be resolved. |
|
Closing this PR since actual slowness issue has been fixed with openshift/cluster-dns-operator#213 |
|
@sinnykumari: Closed this PR. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
fixes e2e-gcp-op test failing in ci due to timeout