Skip to content

retrigger failed test tasks for Linux cloud pools 5 times and identify permanent failures #1191

@Archaeopteryx

Description

@Archaeopteryx

Subtask of #1189

If a task failed and has the jobKind of test:

  • Check if the task has been classified. If yes, do nothing and quit here.
  • Check if the task is tier 1 or 2. If not, do nothing and quit here.
  • Identify if the failure are test failures or other issues, e.g. infrastructure
    • If there is no task artifact which contains errorsummary in its name or is incomplete, then the task either hit an infrastructure issue or the test suite does not support failure summaries by errorsummary.
  • If test failure
    • Check if the task got executed on Linux cloud worker pool. The Task Queue ID listed for it (example should start with gecko-t/t-linux-.
    • Schedule up to 5 task retriggers to have at least 6 tasks (original one + 5 retriggers): check if there are already retriggers scheduled for the push
  • retriggers complete: Decide if the failure rate is a regression
    • retriggers: At least 2 retriggers observed the same test failures as the initial task: regression

Classification would be automatically by LPID.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions