Skip to content

Conversation

@kimorris27
Copy link
Contributor

Which issue this PR addresses:

https://issues.redhat.com/browse/ARO-23318

What this PR does / why we need it:

There have been more and more of these Hive cluster install failures in the queue recently for some reason. The issue is always customer-inflicted by an Azure Policy that modifies a CSP cluster's storage account to disable key based authentication, so we should be returning the failure cause directly to the customer rather than responding to alerts and sending AzComms.

To get this going in prod, there are two more things I'll need to follow up with once this PR merges:

  • Update the Hive release pipeline with the new install log regex; it will get released during the next Hive release
  • Update our IcM automation's "non_alerting_reasons" to include this new reason

Test plan for issue:

Pushed additional install log regexes to eastus shared dev Hive and tested with a local dev cluster where I set up a policy just like a customer would; my local dev RP returned the KeyBasedAuthenticationNotPermitted issue back to me.

Is there any documentation that needs to be updated for this PR?

No

How do you know this will function as expected in production?

Tested thoroughly in local dev, which is close enough to prod in this case

@kimorris27 kimorris27 merged commit d5000d7 into master Jan 7, 2026
22 checks passed
@kimorris27 kimorris27 deleted the kimorris27/ARO-23318-propagate-hive-error-to-cx branch January 7, 2026 14:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

chainsaw Pull requests or issues owned by Team Chainsaw

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants