Correct suite retry units, delete pxe artifacts by hickeng · Pull Request #8074 · vmware/vic

hickeng · 2018-06-19T16:52:37Z

Retry attempts for suite provisioning were incorrectly being interpreted as
seconds instead of attempts.

The new vSAN testbed spec was using a different pxe folder name than expected
in the folder cleanup logic (pxeinstall instead of pxe) leading to disk quota issues.

Add Host To Distributed Switch should retry within the keyword or not at all
for consistent error handling and behaviour.

Adds parameterization for number of parallel jobs and use of ops-user.

Towards #8067

lmalvins · 2018-06-19T19:09:57Z

tests/resources/Nimbus-Util.robot

    Open Connection  %{NIMBUS_GW}
    Wait Until Keyword Succeeds  2 min  30 sec  Login  %{NIMBUS_USER}  %{NIMBUS_PASSWORD}
-    Run Keyword If  ${deletePXE}  Execute Command  ${NIMBUS_LOCATION} rm -rf public_html/pxe/*
+    Run Keyword If  ${deletePXE}  Execute Command  ${NIMBUS_LOCATION} rm -rf public_html/pxe/* public_html/pxeinstall/*


LGTM, we do the same with the vic-ui e2e tests and pxe folder

lmalvins

LGTM

Checks an error return from a nimbus provisioning call instead of trying to infer status solely from console output. Reduce sleep in inner retry loops during nimbus provisioning to 1m instead of 5m. This should really be parameterized or removed completely in favour of the outer retry logic unless there's significant improvement in success probability from retrying at that level.

Previously there were retries of different durations wrapped around the outside of the Add Host To Distributed Switch keyword. If it may need retrying then the operation should do so to avoid this cluster, and those cases where a retry was not present.

This commit attempts to improve predictability by normalizing error checking, retry approaches, pxe boot directory, and parameterizing previously hard-coded build config. The latter is included because it allows the caller to make reasonable estimates about how the build will run rather than having to dig through test code. Previously there were retries of different durations wrapped around the outside of the Add Host To Distributed Switch keyword. If it may need retrying then the operation should do so to avoid this cluster, and those cases where a retry was not present. Reduce sleep in inner retry loops to fail faster and adds a check for exit code in addition to output parsing for faster fail. The inner loops should be parameterized or removed completely in favour of the outer retry logic unless there's significant improvement in success probability from retrying at that level however this commit does not embark on that work. (cherry picked from commit 1387ab1)

…) (vmware#8151) This commit attempts to improve predictability by normalizing error checking, retry approaches, pxe boot directory, and parameterizing previously hard-coded build config. The latter is included because it allows the caller to make reasonable estimates about how the build will run rather than having to dig through test code. Previously there were retries of different durations wrapped around the outside of the Add Host To Distributed Switch keyword. If it may need retrying then the operation should do so to avoid this cluster, and those cases where a retry was not present. Reduce sleep in inner retry loops to fail faster and adds a check for exit code in addition to output parsing for faster fail. The inner loops should be parameterized or removed completely in favour of the outer retry logic unless there's significant improvement in success probability from retrying at that level however this commit does not embark on that work. (cherry picked from commit 1387ab1)

…8151) This commit attempts to improve predictability by normalizing error checking, retry approaches, pxe boot directory, and parameterizing previously hard-coded build config. The latter is included because it allows the caller to make reasonable estimates about how the build will run rather than having to dig through test code. Previously there were retries of different durations wrapped around the outside of the Add Host To Distributed Switch keyword. If it may need retrying then the operation should do so to avoid this cluster, and those cases where a retry was not present. Reduce sleep in inner retry loops to fail faster and adds a check for exit code in addition to output parsing for faster fail. The inner loops should be parameterized or removed completely in favour of the outer retry logic unless there's significant improvement in success probability from retrying at that level however this commit does not embark on that work. (cherry picked from commit 1387ab1)

hickeng requested a review from a team as a code owner June 19, 2018 16:52

vmwclabot added the cla-not-required label Jun 19, 2018

lmalvins reviewed Jun 19, 2018

View reviewed changes

lmalvins approved these changes Jun 19, 2018

View reviewed changes

hickeng added 2 commits June 19, 2018 14:09

Correct suite retry units, delete pxe artifacts

f3bfd0b

hickeng force-pushed the 8067b branch from b76e68f to ccf53dc Compare June 19, 2018 21:10

Fix compile errors in concurrent test tool

961e67c

hickeng force-pushed the 8067b branch from ccf53dc to 1a341ec Compare June 20, 2018 00:19

Correct injecton of retry attempts/delay

9bc6546

hickeng force-pushed the 8067b branch 4 times, most recently from 81bf9cb to db2cbe3 Compare June 20, 2018 16:53

hickeng force-pushed the 8067b branch from db2cbe3 to f461a4e Compare June 20, 2018 20:11

zjs approved these changes Jun 21, 2018

View reviewed changes

hickeng merged commit 1387ab1 into vmware:master Jun 21, 2018

zjs mentioned this pull request Jul 19, 2018

Cherry-pick: Changes to allow us to run nightly tests for 1.4.3 #8151

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correct suite retry units, delete pxe artifacts#8074

Correct suite retry units, delete pxe artifacts#8074
hickeng merged 5 commits intovmware:masterfrom
hickeng:8067b

hickeng commented Jun 19, 2018 •

edited

Loading

Uh oh!

lmalvins Jun 19, 2018 •

edited

Loading

Uh oh!

lmalvins left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

hickeng commented Jun 19, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lmalvins Jun 19, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lmalvins left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

hickeng commented Jun 19, 2018 •

edited

Loading

lmalvins Jun 19, 2018 •

edited

Loading