Bug 1847185: fix: GetLabelsForVolume panic issue for azure disk PV#25121
Bug 1847185: fix: GetLabelsForVolume panic issue for azure disk PV#25121mfojtik merged 1 commit intoopenshift:masterfrom
Conversation
|
@enxebre: This pull request references Bugzilla bug 1847185, which is invalid:
Comment DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
/hold |
|
/bugzilla refresh |
|
@enxebre: This pull request references Bugzilla bug 1847185, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker. 3 validation(s) were run on this bug
DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
@enxebre: This pull request references Bugzilla bug 1847185, which is valid. 3 validation(s) were run on this bug
DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
/retest |
|
/cherry-pick release-4.5 |
|
@enxebre: once the present PR merges, I will cherry-pick it on top of release-4.5 in a new PR and assign it to you. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
/retest |
|
/hold cancel |
|
cc @sttts @mfojtik @jim-minter PTAL |
|
@enxebre I'm encouraging upstream to fix this more comprehensively if it's possible. Might be worth waiting a little to see what happens? |
|
/hold |
|
/hold cancel Then if ARO still needs a way to instantiate the azure client for the api server can be discussed separately. I think getting back to https://github.com/Azure/ARO-RP/pull/487/files#diff-f23154e33f71b30de1fae50fbf2b1dadL56-L60 might be an option. FWIW cloud providers are planned to go out of tree in kubernetes 1.21 / OCP 4.8. |
|
/retest |
|
/retest |
@enxebre I agree. We'd like to get this into 4.3 urgently. |
|
@jim-minter as per @mjudeikis this has been validated to prevent ARO from panicking. |
|
/lgtm |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: enxebre, mfojtik The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
/retest Please review the full test history for this PR and help us cut down flakes. |
2 similar comments
|
/retest Please review the full test history for this PR and help us cut down flakes. |
|
/retest Please review the full test history for this PR and help us cut down flakes. |
|
/retest |
|
merging with button, the queue is hosed and this is critical fix needed in ARO 4.5. |
|
@enxebre: All pull requests linked via external trackers have merged: openshift/origin#25121. Bugzilla bug 1847185 has been moved to the MODIFIED state. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
@enxebre: new pull request created: #25168 DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
@eparis @jwforres @stevekuznetsov 3 days of retesting this simple fix that is actually causing 40% of our ARO cluster to be broken... i would like to know how we can mitigate this in future. |
|
@enxebre: The following tests failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
This prevents
GetAzureDiskLabelsfrom panicking whenc.DisksClientis nil.This panicking makes the API server to crash loop for some ARO clusters as elaborated in https://bugzilla.redhat.com/show_bug.cgi?id=1847185
kubernetes/kubernetes#92166
kubernetes/kubernetes#92167