OCPBUGS-65938: increase IAM waiter timeout and remove custom delay options#10124
Conversation
…tions The AWS IAM role and instance profile waiters had a 2 minute timeout with custom delay options (1-5 seconds). This timeout was insufficient in CI environment where IAM calls can be throttled. Increased the timeout to 15 minutes and removed the custom delay options to use the AWS SDK defaults (min 1s and max 120s).
|
@tthvo: This pull request references Jira Issue OCPBUGS-65938, which is invalid:
Comment The bug has been updated to refer to the pull request using the external bug tracker. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/cc @barbacbd |
|
/jira refresh |
|
@tthvo: This pull request references Jira Issue OCPBUGS-65938, which is valid. The bug has been moved to the POST state. 3 validation(s) were run on this bug
DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/test e2e-aws-ovn-public-subnets e2e-aws-ovn-public-ipv4-pool e2e-aws-ovn-custom-iam-profile e2e-aws-overlay-mtu-ovn-1200 |
|
We can see the same issue would hit instance profile provisioning in openshift/machine-api-provider-aws#155: ci/prow/e2e-aws-serial-1of2. Thus, this fix extends to both IAM role and instanceProfile handler. |
|
/approve |
|
@patrickdillon: This PR has been marked as verified by DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: patrickdillon The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
54f4b77
into
openshift:main
|
@tthvo: Jira Issue Verification Checks: Jira Issue OCPBUGS-65938 Jira Issue OCPBUGS-65938 has been moved to the MODIFIED state and will move to the VERIFIED state when the change is available in an accepted nightly payload. 🕓 DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
@tthvo: The following test failed, say
Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
The AWS IAM role and instance profile waiters had a 2 minute timeout with custom delay options (1-5 seconds). This timeout was insufficient in CI environment where IAM calls can be throttled.
Increased the timeout to 15 minutes and removed the custom delay options to use the AWS SDK defaults (min 1s and max 120s). This allows the installer to "rest" and avoid aggressive retry.
See a sample failed run: https://prow.ci.openshift.org/view/gs/test-platform-results/logs/periodic-ci-openshift-[…]perator-release-4.21-periodics-e2e-aws/1992830755360739328