Skip to content

Bug 1854249: [On-Prem] Workaround issues with keepalived v2.0.10#1909

Merged
openshift-merge-robot merged 1 commit intoopenshift:masterfrom
mandre:onprem-keepalived
Jul 8, 2020
Merged

Bug 1854249: [On-Prem] Workaround issues with keepalived v2.0.10#1909
openshift-merge-robot merged 1 commit intoopenshift:masterfrom
mandre:onprem-keepalived

Conversation

@mandre
Copy link
Copy Markdown
Member

@mandre mandre commented Jul 7, 2020

- What I did

This commit extracts the chk_ocp script into a separate file and wraps
the check scripts with the timeout command to workaround
acassen/keepalived#1364.

This PR supersedes #1908.

Co-Authored-By: Yossi Boaron yboaron@redhat.com
Co-Authored-By: Ben Nemec bnemec@redhat.com

- How to verify it

Installer can deploy OCP on on-prem platform again.

- Description for the changelog

Wrap keepalived checks with timeout to workaround known issue with keepalived v2.0.10.

@openshift-ci-robot openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jul 7, 2020
@mandre
Copy link
Copy Markdown
Member Author

mandre commented Jul 7, 2020

/test e2e-openstack

@mandre mandre marked this pull request as draft July 7, 2020 16:40
@openshift-ci-robot openshift-ci-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jul 7, 2020
@mandre mandre force-pushed the onprem-keepalived branch from 3f4662a to c3017c3 Compare July 7, 2020 18:12
@mandre
Copy link
Copy Markdown
Member Author

mandre commented Jul 7, 2020

/test e2e-openstack

1 similar comment
@mandre
Copy link
Copy Markdown
Member Author

mandre commented Jul 7, 2020

/test e2e-openstack

@Fedosin
Copy link
Copy Markdown
Contributor

Fedosin commented Jul 7, 2020

Am I right that the command is the same, but you just created a one-liner script instead of calling it directly?

@mandre
Copy link
Copy Markdown
Member Author

mandre commented Jul 8, 2020

/test e2e-openstack

@mandre mandre force-pushed the onprem-keepalived branch from c3017c3 to b82bf18 Compare July 8, 2020 08:24
@mandre
Copy link
Copy Markdown
Member Author

mandre commented Jul 8, 2020

/test e2e-openstack

This commit extracts the chk_ocp script into a separate file and wraps
the check scripts with the timeout command to workaround
acassen/keepalived#1364.
@mandre mandre force-pushed the onprem-keepalived branch from b82bf18 to 30d90ab Compare July 8, 2020 09:51
@openshift-ci-robot openshift-ci-robot removed the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jul 8, 2020
@mandre mandre changed the title [DNM] Test keepalived workaround for openstack Bug 1854249: [On-Prem] Workaround issues with keepalived v2.0.10 Jul 8, 2020
@openshift-ci-robot openshift-ci-robot added the bugzilla/severity-urgent Referenced Bugzilla bug's severity is urgent for the branch this PR is targeting. label Jul 8, 2020
@openshift-ci-robot
Copy link
Copy Markdown
Contributor

@mandre: This pull request references Bugzilla bug 1854249, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target release (4.6.0) matches configured target release for branch (4.6.0)
  • bug is in the state NEW, which is one of the valid states (NEW, ASSIGNED, ON_DEV, POST, POST)
Details

In response to this:

Bug 1854249: [On-Prem] Workaround issues with keepalived v2.0.10

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci-robot openshift-ci-robot added the bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. label Jul 8, 2020
@mandre mandre marked this pull request as ready for review July 8, 2020 09:52
@openshift-ci-robot openshift-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jul 8, 2020
@mandre
Copy link
Copy Markdown
Member Author

mandre commented Jul 8, 2020

/test e2e-openstack
/test e2e-vsphere
/test e2e-ovirt

@openshift-ci-robot
Copy link
Copy Markdown
Contributor

@mandre: This pull request references Bugzilla bug 1854249, which is valid.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target release (4.6.0) matches configured target release for branch (4.6.0)
  • bug is in the state POST, which is one of the valid states (NEW, ASSIGNED, ON_DEV, POST, POST)
Details

In response to this:

Bug 1854249: [On-Prem] Workaround issues with keepalived v2.0.10

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

1 similar comment
@openshift-ci-robot
Copy link
Copy Markdown
Contributor

@mandre: This pull request references Bugzilla bug 1854249, which is valid.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target release (4.6.0) matches configured target release for branch (4.6.0)
  • bug is in the state POST, which is one of the valid states (NEW, ASSIGNED, ON_DEV, POST, POST)
Details

In response to this:

Bug 1854249: [On-Prem] Workaround issues with keepalived v2.0.10

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@celebdor
Copy link
Copy Markdown
Contributor

celebdor commented Jul 8, 2020

/lgtm

@openshift-ci-robot openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Jul 8, 2020
@mandre
Copy link
Copy Markdown
Member Author

mandre commented Jul 8, 2020

/retest

@sinnykumari
Copy link
Copy Markdown
Contributor

/approve

@openshift-ci-robot openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jul 8, 2020
@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

9 similar comments
@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@kikisdeliveryservice
Copy link
Copy Markdown
Contributor

Q: do we expect vsphere job to pass with this PR @mandre ?

@cybertron
Copy link
Copy Markdown
Member

/lgtm

This worked for me locally. I think the flapping of the VIP will be helped by #1893 which adds rise and fall values to the check so they won't immediately cause failover because of a single timeout. This should get us back to where we were though.

I think @patrickdillon is the vsphere contact, so maybe he can comment on the status of the vsphere job.

@openshift-ci-robot
Copy link
Copy Markdown
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: celebdor, cybertron, mandre, sinnykumari

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

1 similar comment
@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-ci-robot
Copy link
Copy Markdown
Contributor

openshift-ci-robot commented Jul 8, 2020

@mandre: The following test failed, say /retest to rerun all failed tests:

Test name Commit Details Rerun command
ci/prow/e2e-vsphere 30d90ab link /test e2e-vsphere

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@openshift-merge-robot openshift-merge-robot merged commit f30b2a3 into openshift:master Jul 8, 2020
@openshift-ci-robot
Copy link
Copy Markdown
Contributor

@mandre: All pull requests linked via external trackers have merged: openshift/machine-config-operator#1909. Bugzilla bug 1854249 has been moved to the MODIFIED state.

Details

In response to this:

Bug 1854249: [On-Prem] Workaround issues with keepalived v2.0.10

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. bugzilla/severity-urgent Referenced Bugzilla bug's severity is urgent for the branch this PR is targeting. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. lgtm Indicates that a PR is ready to be merged.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

9 participants