Bug 1731263: Reenable basic pod preemption e2es#23758
Bug 1731263: Reenable basic pod preemption e2es#23758damemi wants to merge 1 commit intoopenshift:masterfrom
Conversation
|
@damemi: This pull request references Bugzilla bug 1731263, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
@deads2k you mentioned that the |
183cd6d to
abb0c6b
Compare
|
/retest |
|
/hold |
@damemi how about checking what the newer version will look like? I mean that from k8s 1.16 introduced in the rebase and look at it only after rebase lands. I'm moving the BZ to 4.3 |
|
@soltysh This test is actually ahead of 1.16 right now (origin already has the changes in kubernetes/kubernetes#82350 which hasn't merged) |
df5de77 to
683c7b5
Compare
|
/retest |
|
@damemi what's the call here? |
|
@soltysh I need to revisit this test entirely, it's not flaking at all upstream but when I submitted this PR to remove |
|
@damemi what's the current state of this? |
683c7b5 to
78cebd2
Compare
|
It seems that this test was removed from Though, I can't find the test in 4.2 testgrid or 4.3 but I do see preemption tests in, for example, 4.1 blocking: https://testgrid.k8s.io/redhat-openshift-ocp-release-4.1-blocking#release-openshift-origin-installer-e2e-aws-serial-4.1. Maybe I'm just not finding it, so I'll switch the BZ back to modified and have QA take a look as well |
|
/reopen |
|
@damemi: Reopened this PR. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
@damemi: This pull request references Bugzilla bug 1731263, which is valid. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
/retest |
|
/retest |
|
So I noticed that, with our changes to this test, the pod wasn't always being scheduled onto the node we expected (for example, if there is a low priority pod, we were assuming that the preemptor pod would be placed on the same node as the low priority pod, which wasn't always the case, and so the test failed with no pods preempted and the preemptor pod running). Now I've made an update that ensures that the preemptor pod will land on the appropriate node. However, it seems that preemption just isn't running, because now the preemptor pod is attempting to schedule on the correct node, but failing due to lack of cpu: @ravisantoshgudimetla, do we need to enable podpreemption in our e2es somehow? Failing to schedule due to lack of cpu is exactly what we want to test Update: actually this is interesting, looking at the usages as the victim pods are scheduled: it appears that the low priority pod is requesting However, I am still curious about my first question regarding if we need to do anything to enable preemption here? |
|
/retest |
|
/retest |
|
Issues go stale after 90d of inactivity. Mark the issue as fresh by commenting If this issue is safe to close now please do so with /lifecycle stale |
c45b9d9 to
f2cceac
Compare
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: damemi The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
/remove-lifecycle stale |
f2cceac to
222af67
Compare
|
/retest |
1 similar comment
|
/retest |
|
@damemi: The following tests failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
|
Irrelevant. |
|
@soltysh: Closed this PR. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
This test was marked as flaky but has had a number of fixes applied to it (pr #23645 and #23719) tht have adjusted the node utilization calculations and structure of the test to hopefully make it more stable. In order to address the comments in the BZ for this test's flakiness (https://bugzilla.redhat.com/show_bug.cgi?id=1731263#c9) we should remove the [flaky] tag to re-enable the tests.