Skip to content

enable kubelet server cert bootstrapping#187

Merged
openshift-merge-robot merged 2 commits intoopenshift:masterfrom
sjenning:kubelet-server-cert-rotation
Dec 3, 2018
Merged

enable kubelet server cert bootstrapping#187
openshift-merge-robot merged 2 commits intoopenshift:masterfrom
sjenning:kubelet-server-cert-rotation

Conversation

@sjenning
Copy link
Copy Markdown
Contributor

@smarterclayton @ericavonb @mrogers950 @rphillips

wait for openshift/cluster-machine-approver#3 to merge and ensure that machine-approver is running

@openshift-ci-robot openshift-ci-robot added the size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. label Nov 20, 2018
@openshift-ci-robot openshift-ci-robot added size/S Denotes a PR that changes 10-29 lines, ignoring generated files. and removed size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Nov 20, 2018
@sjenning
Copy link
Copy Markdown
Contributor Author

/hold

@openshift-ci-robot openshift-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 20, 2018
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also add RotateKubeletClientCertificate, and let's make sure rotate certificates is set here too.

3.10 on always rotated both, but at a slower interval. We want 4.0 to rotate both, more quickly (days).

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, this is set for workers, but not masters. I didn't want to make that change as part of this until I understood why this was done.

@sjenning
Copy link
Copy Markdown
Contributor Author

Test cluster is working with the approver starting and approving new machines as expected, but I think I need to move the approver to runlevel 0. At runlevel 1 is starts late and kubelets will not accept incoming connections until the server cert is installed. openshift/cluster-machine-approver#4

@sjenning
Copy link
Copy Markdown
Contributor Author

/retest

@smarterclayton
Copy link
Copy Markdown
Contributor

smarterclayton commented Nov 21, 2018 via email

@sjenning
Copy link
Copy Markdown
Contributor Author

i think openshift/cluster-machine-approver#4 may have not gone in before the previous retest
/retest

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

EOL got stripped on this file

@sjenning sjenning force-pushed the kubelet-server-cert-rotation branch 2 times, most recently from da1c326 to f9fe715 Compare November 26, 2018 17:36
@sjenning
Copy link
Copy Markdown
Contributor Author

# crictl logs f2fd6df8c27e0
2018/11/26 18:16:29 socat[4847] E connect(5, AF=1 "/var/run/openshift-sdn/cni-server.sock", 40): No such file or directory
User "sa" set.
Context "kubelet" modified.
I1126 18:16:29.545148    4826 start_network.go:200] Reading node configuration from /tmp/sdn-config.yaml
Invalid NodeConfig /tmp/sdn-config.yaml
  servingInfo.certFile: Invalid value: "/etc/kubernetes/pki/kubelet.crt": could not read file: stat /etc/kubernetes/pki/kubelet.crt: no such file or directory
  servingInfo.keyFile: Invalid value: "/etc/kubernetes/pki/kubelet.key": could not read file: stat /etc/kubernetes/pki/kubelet.key: no such file or directory

# crictl inspect f2fd6df8c27e0
...
      {
        "containerPath": "/etc/kubernetes/pki",
        "hostPath": "/var/lib/kubelet/pki",
        "propagation": "PROPAGATION_PRIVATE",
        "readonly": true,
        "selinuxRelabel": false
      },
...

The sdn pod is not starting and is blocking the cluster control plane from starting. It is not clear why the sdn pod needs the kubelet server key/cert. Seem to be a remnant of using the NodeConfig code in the sdn process. Discussing with @squeed and @dcbw.

@sjenning
Copy link
Copy Markdown
Contributor Author

needs openshift/origin#21551 plus a PR to the cluster-network-operator to not set the kubelet serving info in the generated NodeConfig.

@sjenning
Copy link
Copy Markdown
Contributor Author

next in line openshift/cluster-network-operator#43

@sjenning
Copy link
Copy Markdown
Contributor Author

/retest

2 similar comments
@sjenning
Copy link
Copy Markdown
Contributor Author

/retest

@sjenning
Copy link
Copy Markdown
Contributor Author

/retest

@smarterclayton
Copy link
Copy Markdown
Contributor

Please rebase - this is critical for getting metrics from nodes.

@openshift-bot
Copy link
Copy Markdown
Contributor

@sjenning: PR needs rebase.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-bot openshift-bot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Dec 3, 2018
@sjenning
Copy link
Copy Markdown
Contributor Author

sjenning commented Dec 3, 2018

/hold cancel

@openshift-ci-robot openshift-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Dec 3, 2018
@sjenning sjenning force-pushed the kubelet-server-cert-rotation branch from f9fe715 to 6b49762 Compare December 3, 2018 16:23
@openshift-ci-robot openshift-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Dec 3, 2018
@sjenning
Copy link
Copy Markdown
Contributor Author

sjenning commented Dec 3, 2018

/retest

1 similar comment
@sjenning
Copy link
Copy Markdown
Contributor Author

sjenning commented Dec 3, 2018

/retest

@smarterclayton
Copy link
Copy Markdown
Contributor

/lgtm

@openshift-ci-robot openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Dec 3, 2018
@openshift-ci-robot
Copy link
Copy Markdown
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: sjenning, smarterclayton

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci-robot openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Dec 3, 2018
@openshift-merge-robot openshift-merge-robot merged commit 84bf7d6 into openshift:master Dec 3, 2018
osherdp pushed a commit to osherdp/machine-config-operator that referenced this pull request Apr 13, 2021
DEVEXP-418: fetch arch from GOARCH, allow for z/ppc
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged. size/S Denotes a PR that changes 10-29 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants