Skip to content

Bug 1875181: [4.4] crio: bump to v1.17.5#2056

Closed
haircommander wants to merge 2 commits intoopenshift:release-4.4from
haircommander:bump-crio-version
Closed

Bug 1875181: [4.4] crio: bump to v1.17.5#2056
haircommander wants to merge 2 commits intoopenshift:release-4.4from
haircommander:bump-crio-version

Conversation

@haircommander
Copy link
Copy Markdown
Member

- What I did
when the crio version falls out of sync with the verison in openshift, we end up rendering the incorect ignition config

This fixes a problem where, upon creating a ctrcfg, we lose our default_env value

Signed-off-by: Peter Hunt pehunt@redhat.com

- How to verify it
run crio config on a node
apply any ctrcfg
run crio config again
check the diff, noting the changes didn't strictly include that which came in the ctrcfg

note: this would be a good e2e test to add, however, running crio config on a node has proven to be challenging. One has to chroot, or else there will be ldd issues. However, a combination of the way the go os/exec package works, along with the need to run chroot/ host /bin/bash -c 'crio config has caused it to just fail for me. not sure why yet.
- Description for the changelog

@openshift-ci-robot
Copy link
Copy Markdown
Contributor

@haircommander: No Bugzilla bug is referenced in the title of this pull request.
To reference a bug, add 'Bug XXX:' to the title of this pull request and request another bug refresh with /bugzilla refresh.

Details

In response to this:

[4.4] crio: bump to v1.17.5

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@haircommander haircommander changed the title [4.4] crio: bump to v1.17.5 Bug 1875181: [4.4] crio: bump to v1.17.5 Sep 2, 2020
@openshift-ci-robot
Copy link
Copy Markdown
Contributor

@haircommander: This pull request references Bugzilla bug 1875181, which is invalid:

  • expected the bug to target the "4.4.z" release, but it targets "4.5.0" instead
  • expected dependent Bugzilla bug 1875179 to be in one of the following states: VERIFIED, RELEASE_PENDING, CLOSED (ERRATA), but it is MODIFIED instead
  • expected dependent Bugzilla bug 1875180 to be in one of the following states: VERIFIED, RELEASE_PENDING, CLOSED (ERRATA), but it is MODIFIED instead
  • expected dependent Bugzilla bug 1875179 to target a release in 4.5.0, 4.5.z, but it targets "---" instead

Comment /bugzilla refresh to re-evaluate validity if changes to the Bugzilla bug are made, or edit the title of this pull request to link to a different bug.

Details

In response to this:

Bug 1875181: [4.4] crio: bump to v1.17.5

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci-robot openshift-ci-robot added bugzilla/severity-unspecified Referenced Bugzilla bug's severity is unspecified for the PR. bugzilla/invalid-bug Indicates that a referenced Bugzilla bug is invalid for the branch this PR is targeting. labels Sep 2, 2020
@haircommander
Copy link
Copy Markdown
Member Author

@runcom @kikisdeliveryservice @yuqi-zhang I believe I've done the correct vendoring. I ran make go-deps in a go 1.12 container, but it took along more changes than I expected. please advise if I should do something differently

@cgwalters
Copy link
Copy Markdown
Member

Hmm this is a nontrivial PR targeted for 4.4. First: is this already fixed in 4.5? Does this really have the severity necessary to ship to 4.4?

Are there any easy workarounds a 4.4 user could do instead?

Copy link
Copy Markdown
Member

@cgwalters cgwalters left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also long term I think we need to better solve the fundamental structural problem that we have crio config in two places:

  • Embedded in RHCOS as defaults
  • Vendored in the MCO

And what would be much better IMO is to teach the MCO's rendering to use the drop-ins to only write values that it changed.

@haircommander
Copy link
Copy Markdown
Member Author

Hmm this is a nontrivial PR targeted for 4.4. First: is this already fixed in 4.5? Does this really have the severity necessary to ship to 4.4?

Yes, it's not an issue in 4.5 afaict, as 4.5 has a version of cri-o vendored that has this field

Are there any easy workarounds a 4.4 user could do instead?
either don't use a ctrcfg, or allow NSS_SDB_USE_CACHE=yes (the default) which caused issues in the past (I cannot remember what, though, cc @mrunalp )

@runcom
Copy link
Copy Markdown
Member

runcom commented Sep 3, 2020

And what would be much better IMO is to teach the MCO's rendering to use the drop-ins to only write values that it changed.

we recently started doing that if I'm not entirely mistaken, cc @umohnani8

@umohnani8
Copy link
Copy Markdown
Contributor

Yup, in 4.5 we use drop-in files at crio.conf.d that only write the values have been changed. The drop-in files definitely avoid losing any information as everything else falls back to the cri-o defaults.

@haircommander
Copy link
Copy Markdown
Member Author

/retest

1 similar comment
@haircommander
Copy link
Copy Markdown
Member Author

/retest

@haircommander
Copy link
Copy Markdown
Member Author

/bugzilla refresh

@openshift-ci-robot
Copy link
Copy Markdown
Contributor

@haircommander: This pull request references Bugzilla bug 1875181, which is invalid:

  • expected dependent Bugzilla bug 1875179 to target a release in 4.5.0, 4.5.z, but it targets "4.6.0" instead

Comment /bugzilla refresh to re-evaluate validity if changes to the Bugzilla bug are made, or edit the title of this pull request to link to a different bug.

Details

In response to this:

/bugzilla refresh

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci-robot openshift-ci-robot added bugzilla/severity-low Referenced Bugzilla bug's severity is low for the branch this PR is targeting. and removed bugzilla/severity-unspecified Referenced Bugzilla bug's severity is unspecified for the PR. labels Sep 11, 2020
@haircommander
Copy link
Copy Markdown
Member Author

/bugzilla refresh

@openshift-ci-robot openshift-ci-robot added the bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. label Sep 11, 2020
@openshift-ci-robot
Copy link
Copy Markdown
Contributor

@haircommander: This pull request references Bugzilla bug 1875181, which is valid.

6 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target release (4.4.z) matches configured target release for branch (4.4.z)
  • bug is in the state POST, which is one of the valid states (NEW, ASSIGNED, ON_DEV, POST, POST)
  • dependent bug Bugzilla bug 1875180 is in the state VERIFIED, which is one of the valid states (VERIFIED, RELEASE_PENDING, CLOSED (ERRATA))
  • dependent Bugzilla bug 1875180 targets the "4.5.z" release, which is one of the valid target releases: 4.5.0, 4.5.z
  • bug has dependents
Details

In response to this:

/bugzilla refresh

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci-robot openshift-ci-robot removed the bugzilla/invalid-bug Indicates that a referenced Bugzilla bug is invalid for the branch this PR is targeting. label Sep 11, 2020
@haircommander
Copy link
Copy Markdown
Member Author

@cgwalters do you still want changes here?

@runcom PTAL

@haircommander
Copy link
Copy Markdown
Member Author

@umohnani8
Copy link
Copy Markdown
Contributor

LGTM

@kikisdeliveryservice
Copy link
Copy Markdown
Contributor

LGTM but deferring to @cgwalters

@kikisdeliveryservice
Copy link
Copy Markdown
Contributor

Ping @runcom @cgwalters

@cgwalters
Copy link
Copy Markdown
Member

Tentative
/approve
but I'd hope most users have upgraded instead to 4.5 at least.

I think someone else on the node team can add the /lgtm right?

/retest
Since we're now on agnostic.

OK and it does look like there's at least an e2e that covers the container runtime configs in 4.4: https://github.com/openshift/machine-config-operator/blob/release-4.4/test/e2e/ctrcfg_test.go
so that's good.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

2 similar comments
@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@sdodson
Copy link
Copy Markdown
Member

sdodson commented Nov 8, 2020

/hold
@haircommander ptal at these failing tests

@openshift-ci-robot openshift-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 8, 2020
@sdodson sdodson removed the cherry-pick-approved Indicates a cherry-pick PR into a release branch has been approved by the release branch manager. label Nov 9, 2020
@sdodson
Copy link
Copy Markdown
Member

sdodson commented Nov 9, 2020

Clearing stale cherry-pick-approved. I wish there were a way to stop retests without leaving the hold on here but I don't know of such a mechanism. Next Patch Manager please consult with the author to see what can be done to get this to merge successfully next week.

@haircommander
Copy link
Copy Markdown
Member Author

/retest

when the crio version falls out of sync with the verison in openshift, we end up rendering the incorect ignition config

This fixes a problem where, upon creating a ctrcfg, we lose our default_env value

Signed-off-by: Peter Hunt <pehunt@redhat.com>
@openshift-ci-robot openshift-ci-robot removed the lgtm Indicates that a PR is ready to be merged. label Nov 11, 2020
@openshift-ci-robot
Copy link
Copy Markdown
Contributor

New changes are detected. LGTM label has been removed.

@haircommander
Copy link
Copy Markdown
Member Author

let's see if a rebase fixes it!

@nee1esh
Copy link
Copy Markdown

nee1esh commented Nov 12, 2020

/retest

Waiting for the previous MC isn't right here; we're deleting
a config and so what we want to wait for is the previous pool
target.

Drop the racy `sleep()`.
@haircommander
Copy link
Copy Markdown
Member Author

also carrying #2229

@sdodson
Copy link
Copy Markdown
Member

sdodson commented Nov 20, 2020

/retest

3 similar comments
@haircommander
Copy link
Copy Markdown
Member Author

/retest

@haircommander
Copy link
Copy Markdown
Member Author

/retest

@haircommander
Copy link
Copy Markdown
Member Author

/retest

@openshift-merge-robot
Copy link
Copy Markdown
Contributor

@haircommander: The following test failed, say /retest to rerun all failed tests:

Test name Commit Details Rerun command
ci/prow/e2e-gcp-op 3c8e5aa link /test e2e-gcp-op

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@lyman9966
Copy link
Copy Markdown

/retest

@kikisdeliveryservice
Copy link
Copy Markdown
Contributor

Are we still trying to get this into 4.4?
 /test e2e-gcp-op

@haircommander
Copy link
Copy Markdown
Member Author

yes but I have no idea why the test is failing and haven't had the capacity to check
/retest

@openshift-ci
Copy link
Copy Markdown
Contributor

openshift-ci Bot commented Feb 4, 2021

@haircommander: The following tests failed, say /retest to rerun all failed tests:

Test name Commit Details Rerun command
ci/prow/e2e-gcp-op 3c8e5aa link /test e2e-gcp-op
ci/prow/e2e-aws-workers-rhel7 3c8e5aa link /test e2e-aws-workers-rhel7

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@openshift-bot
Copy link
Copy Markdown
Contributor

Issues go stale after 90d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle stale

@openshift-ci-robot openshift-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 5, 2021
@haircommander
Copy link
Copy Markdown
Member Author

4.4 is EOL, good bye PR

@openshift-ci-robot
Copy link
Copy Markdown
Contributor

@haircommander: This pull request references Bugzilla bug 1875181. The bug has been updated to no longer refer to the pull request using the external bug tracker.

Details

In response to this:

Bug 1875181: [4.4] crio: bump to v1.17.5

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. bugzilla/severity-medium Referenced Bugzilla bug's severity is medium for the branch this PR is targeting. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale.

Projects

None yet

Development

Successfully merging this pull request may close these issues.