This test (from #8851) seems flaky by design, does it really matter which task finishes first in absolute time? example failure on main: https://ci.tlcpack.ai/blue/organizations/jenkins/tvm/detail/main/2391/pipeline older PR failures: https://ci.tlcpack.ai/blue/organizations/jenkins/tvm/detail/PR-9802/11/ https://ci.tlcpack.ai/blue/organizations/jenkins/tvm/detail/PR-9909/5/ cc @tqchen @vinx13 @shingjan