Test: Intentional build and test failures for LLM plugin by mokagio · Pull Request #22779 · wordpress-mobile/WordPress-Android

mokagio · 2026-04-09T00:29:12Z

Summary

Introduces intentional compilation error and test assertion failure to verify the Claude build analysis CI plugin from Fix Claude build analysis gate: correctly skip only when all failures are non-essential #22760 correctly detects and comments on failures.
Do not merge — this PR exists solely to test the LLM plugin.

Test plan

CI runs and fails as expected (build error + test failure)
The LLM plugin posts a comment explaining the failures

Posted by Claude Code (Opus 4.6) on behalf of @mokagio with approval.

🤖 Generated with Claude Code

350e6db forces a build failure, which resulted in the expected build failure annotation

b88be39 reverts the above, leaving only Danger as a failure. Unfortunately, the script did not behave as expected and still run the build failure analysis

78f16aa fixes it:

885f030 added another non-essential step to verify the array checks

91b15eb brought the build to green (I added the label and milestone to satisfy Danger) and revealed a bug: The gating step logged "Real failures detected, running Claude analysis" despite zero failures. The logic assumes non_essential_failures == 0 means essential steps failed, but it's also 0 when nothing failed.

Finally, 85e1e9c fixed it (TBD)

Improve the custom prompt to ignore non-real failures (the intentional exit 1 trigger, Danger PR Check, and broken/skipped jobs) and respond briefly when no actual failures exist. Migrate model from Sonnet 4.5 to Sonnet 4.6. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Use iangmaia/claude-summarize fork which adds build_log_mode, max_log_lines, and on-failure trigger. This feeds only failed job logs to Claude (capped at 1500 lines) for more focused analysis with less noise. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Add upload-claude-analysis.sh that checks non-essential step outcomes before uploading the Claude analysis pipeline. When only Danger or other non-critical jobs failed, the pipeline is not uploaded — no analysis, no annotation, no PR comment. Also simplify the custom prompt now that Danger is filtered upstream. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

When real failures co-occur with a Danger failure, Claude should still ignore Danger rather than wasting analysis space on it. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…L scalar Agent-Logs-Url: https://github.com/wordpress-mobile/WordPress-Android/sessions/f2282ed4-20ab-42a2-b37b-6fac3bfa3514 Co-authored-by: mokagio <1218433+mokagio@users.noreply.github.com>

…tial co-failures Agent-Logs-Url: https://github.com/wordpress-mobile/WordPress-Android/sessions/1f6e9e79-7580-4d15-97d2-c85bb5452559 Co-authored-by: mokagio <1218433+mokagio@users.noreply.github.com>

The jq filter only matched state == "failed", missing jobs with state == "timed_out". This could cause Claude to be skipped when a real job times out alongside a non-essential failure like Danger. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Test the LLM CI plugin by introducing: - A compilation error (reference to non-existent type) - A test assertion failure (swapped expected values) --- Generated with the help of Claude Code, https://code.claude.com Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Verify that Claude analysis is skipped when only Danger (non-essential) fails. --- Generated with the help of Claude Code, https://claude.ai/code Co-Authored-By: Claude Code Opus 4.6 <noreply@anthropic.com>

wpmobilebot · 2026-04-09T01:21:12Z

📲 You can test the changes from this Pull Request in WordPress Android by scanning the QR code below to install the corresponding build.

	App Name	WordPress Android
	Build Type	Debug
	Version	pr22779-85e1e9c
	Build Number	`1488`
	Application ID	`org.wordpress.android.prealpha`
	Commit	`85e1e9c`
	Installation URL	0eim8c093oo1o

Automatticians: You can use our internal self-serve MC tool to give yourself access to those builds if needed.

wpmobilebot · 2026-04-09T01:24:05Z

📲 You can test the changes from this Pull Request in Jetpack Android by scanning the QR code below to install the corresponding build.

	App Name	Jetpack Android
	Build Type	Debug
	Version	pr22779-85e1e9c
	Build Number	`1488`
	Application ID	`com.jetpack.android.prealpha`
	Commit	`85e1e9c`
	Installation URL	12g6qctnku9vo

Automatticians: You can use our internal self-serve MC tool to give yourself access to those builds if needed.

`buildkite-agent step get outcome` returns `hard_failed`, not `failed`. The wrong string meant non-essential failures were never detected. --- Generated with the help of Claude Code, https://claude.ai/code Co-Authored-By: Claude Code Opus 4.6 <noreply@anthropic.com>

Adds a step that always fails and registers it in the non-essential array to verify multi-entry gating. --- Generated with the help of Claude Code, https://claude.ai/code Co-Authored-By: Claude Code Opus 4.6 <noreply@anthropic.com>

This reverts commit 885f030.

Query total failures first via the API. When zero, exit early instead of falling through to the non-essential check which assumed failures. --- Generated with the help of Claude Code, https://claude.ai/code Co-Authored-By: Claude Code Opus 4.6 <noreply@anthropic.com>

iangmaia and others added 8 commits April 8, 2026 17:41

Add Danger back to the prompt exclusion list

6238001

When real failures co-occur with a Danger failure, Claude should still ignore Danger rather than wasting analysis space on it. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Address review feedback: fix shebang, use Bash array, use literal YAM…

9b69bc1

…L scalar Agent-Logs-Url: https://github.com/wordpress-mobile/WordPress-Android/sessions/f2282ed4-20ab-42a2-b37b-6fac3bfa3514 Co-authored-by: mokagio <1218433+mokagio@users.noreply.github.com>

Fix upload-claude-analysis.sh to correctly handle essential+non-essen…

06a203a

…tial co-failures Agent-Logs-Url: https://github.com/wordpress-mobile/WordPress-Android/sessions/1f6e9e79-7580-4d15-97d2-c85bb5452559 Co-authored-by: mokagio <1218433+mokagio@users.noreply.github.com>

mokagio self-assigned this Apr 9, 2026

Revert intentional build/test failures

b88be39

Verify that Claude analysis is skipped when only Danger (non-essential) fails. --- Generated with the help of Claude Code, https://claude.ai/code Co-Authored-By: Claude Code Opus 4.6 <noreply@anthropic.com>

mokagio mentioned this pull request Apr 9, 2026

Fix Claude build analysis gate: correctly skip only when all failures are non-essential #22760

Merged

1 task

mokagio and others added 3 commits April 9, 2026 14:25

Add second non-essential step to test array

885f030

Adds a step that always fails and registers it in the non-essential array to verify multi-entry gating. --- Generated with the help of Claude Code, https://claude.ai/code Co-Authored-By: Claude Code Opus 4.6 <noreply@anthropic.com>

Revert "Add second non-essential step to test array"

91b15eb

This reverts commit 885f030.

mokagio added the [Type] Tooling label Apr 9, 2026

mokagio added this to the 26.8 milestone Apr 9, 2026

mokagio closed this Apr 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test: Intentional build and test failures for LLM plugin#22779

Test: Intentional build and test failures for LLM plugin#22779
mokagio wants to merge 13 commits intotrunkfrom
iangmaia/test-llm-plugin-failures

mokagio commented Apr 9, 2026 •

edited

Loading

Uh oh!

wpmobilebot commented Apr 9, 2026 •

edited

Loading

Uh oh!

wpmobilebot commented Apr 9, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

mokagio commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

wpmobilebot commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wpmobilebot commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mokagio commented Apr 9, 2026 •

edited

Loading

wpmobilebot commented Apr 9, 2026 •

edited

Loading

wpmobilebot commented Apr 9, 2026 •

edited

Loading