Skip to content

Correct suite retry units, delete pxe artifacts#8074

Merged
hickeng merged 5 commits intovmware:masterfrom
hickeng:8067b
Jun 21, 2018
Merged

Correct suite retry units, delete pxe artifacts#8074
hickeng merged 5 commits intovmware:masterfrom
hickeng:8067b

Conversation

@hickeng
Copy link
Contributor

@hickeng hickeng commented Jun 19, 2018

Retry attempts for suite provisioning were incorrectly being interpreted as
seconds instead of attempts.

The new vSAN testbed spec was using a different pxe folder name than expected
in the folder cleanup logic (pxeinstall instead of pxe) leading to disk quota issues.

Add Host To Distributed Switch should retry within the keyword or not at all
for consistent error handling and behaviour.

Adds parameterization for number of parallel jobs and use of ops-user.

Towards #8067

@hickeng hickeng requested a review from a team as a code owner June 19, 2018 16:52
Open Connection %{NIMBUS_GW}
Wait Until Keyword Succeeds 2 min 30 sec Login %{NIMBUS_USER} %{NIMBUS_PASSWORD}
Run Keyword If ${deletePXE} Execute Command ${NIMBUS_LOCATION} rm -rf public_html/pxe/*
Run Keyword If ${deletePXE} Execute Command ${NIMBUS_LOCATION} rm -rf public_html/pxe/* public_html/pxeinstall/*
Copy link

@lmalvins lmalvins Jun 19, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, we do the same with the vic-ui e2e tests and pxe folder

Copy link

@lmalvins lmalvins left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

hickeng added 2 commits June 19, 2018 14:09
Checks an error return from a nimbus provisioning call instead of
trying to infer status solely from console output.

Reduce sleep in inner retry loops during nimbus provisioning to 1m instead
of 5m. This should really be parameterized or removed completely in favour
of the outer retry logic unless there's significant improvement in success
probability from retrying at that level.
@hickeng hickeng force-pushed the 8067b branch 4 times, most recently from 81bf9cb to db2cbe3 Compare June 20, 2018 16:53
Previously there were retries of different durations wrapped around the
outside of the Add Host To Distributed Switch keyword. If it may need
retrying then the operation should do so to avoid this cluster, and those
cases where a retry was not present.
@hickeng hickeng merged commit 1387ab1 into vmware:master Jun 21, 2018
zjs pushed a commit to zjs/vic that referenced this pull request Jul 19, 2018
This commit attempts to improve predictability by normalizing error
checking, retry approaches, pxe boot directory, and parameterizing
previously hard-coded build config. The latter is included because it
allows the caller to make reasonable estimates about how the build will
run rather than having to dig through test code.

Previously there were retries of different durations wrapped around the
outside of the Add Host To Distributed Switch keyword. If it may need
retrying then the operation should do so to avoid this cluster, and those
cases where a retry was not present.

Reduce sleep in inner retry loops to fail faster and adds a check for exit
code in addition to output parsing for faster fail. The inner loops should
be parameterized or removed completely in favour of the outer retry
logic unless there's significant improvement in success probability from
retrying at that level however this commit does not embark on that
work.

(cherry picked from commit 1387ab1)
zjs pushed a commit to zjs/vic that referenced this pull request Jul 27, 2018
…) (vmware#8151)

This commit attempts to improve predictability by normalizing error
checking, retry approaches, pxe boot directory, and parameterizing
previously hard-coded build config. The latter is included because it
allows the caller to make reasonable estimates about how the build will
run rather than having to dig through test code.

Previously there were retries of different durations wrapped around the
outside of the Add Host To Distributed Switch keyword. If it may need
retrying then the operation should do so to avoid this cluster, and those
cases where a retry was not present.

Reduce sleep in inner retry loops to fail faster and adds a check for exit
code in addition to output parsing for faster fail. The inner loops should
be parameterized or removed completely in favour of the outer retry
logic unless there's significant improvement in success probability from
retrying at that level however this commit does not embark on that
work.

(cherry picked from commit 1387ab1)
zjs pushed a commit that referenced this pull request Jul 27, 2018
…8151)

This commit attempts to improve predictability by normalizing error
checking, retry approaches, pxe boot directory, and parameterizing
previously hard-coded build config. The latter is included because it
allows the caller to make reasonable estimates about how the build will
run rather than having to dig through test code.

Previously there were retries of different durations wrapped around the
outside of the Add Host To Distributed Switch keyword. If it may need
retrying then the operation should do so to avoid this cluster, and those
cases where a retry was not present.

Reduce sleep in inner retry loops to fail faster and adds a check for exit
code in addition to output parsing for faster fail. The inner loops should
be parameterized or removed completely in favour of the outer retry
logic unless there's significant improvement in success probability from
retrying at that level however this commit does not embark on that
work.

(cherry picked from commit 1387ab1)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants