Optimize CI integration test matrix by splitting bottleneck groups#7603
Merged
Optimize CI integration test matrix by splitting bottleneck groups#7603
Conversation
Split three large test groups into smaller, focused groups: 1. Split "Workflow Cache & Actions" (247.4s) into 3 groups: - Workflow Cache - Workflow Actions Pin Validation (isolates slow SHA tests) - Workflow Actions & Containers 2. Split "CLI Completion & Other" (155.4s) into 5 groups: - CLI Add & List Commands - CLI Update Command - CLI Audit & Inspect - CLI Completion & Other (reduced catch-all) 3. Split "Workflow Misc Part 2" (150.8s) into 3 groups: - Workflow String & Sanitization - Workflow Runtime & Setup - Workflow Misc Part 2 (reduced catch-all) Total groups increased from 26 to 33 for better parallelization. Expected impact: ~60% reduction in longest-running groups. Co-authored-by: mnkiefer <8320933+mnkiefer@users.noreply.github.com>
Copilot
AI
changed the title
[WIP] Optimize CI integration test matrix to reduce bottlenecks
Optimize CI integration test matrix by splitting bottleneck groups
Dec 25, 2025
pelikhan
approved these changes
Dec 25, 2025
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Three test groups were dominating CI runtime: "Workflow Cache & Actions" (247s), "CLI Completion & Other" (155s), and "Workflow Misc Part 2" (151s). These are split into focused, parallelizable groups.
Changes
Workflow Cache & Actions → 3 groups:
Workflow Cache- Cache validation testsWorkflow Actions Pin Validation- SHA verification tests (16s+ each, network-bound)Workflow Actions & Containers- Remaining action/container testsCLI Completion & Other → 5 groups:
CLI Add & List CommandsCLI Update CommandCLI Audit & InspectCLI Completion & Other- Reduced catch-all with updated skip_patternWorkflow Misc Part 2 → 3 groups:
Workflow String & SanitizationWorkflow Runtime & SetupWorkflow Misc Part 2- Reduced catch-all with updated skip_patternImpact
Original prompt
This section details on the original issue you should resolve
<issue_title>[ci-coach] Optimize CI integration test matrix to reduce bottlenecks</issue_title>
<issue_description>## CI Optimization Proposal
Summary
This PR addresses three severe bottlenecks in the CI integration test matrix identified through analysis of recent test runs. The changes rebalance the test matrix by splitting large, slow test groups into smaller, focused groups that can complete faster.
Expected Impact: ~60% reduction in the longest-running integration test groups, reducing overall CI time.
Analysis Results
Analyzed test timing data from the most recent successful CI run:
TestActionPinSHAsMatchVersionTagstests (16s+ each)Optimizations
1. Workflow Cache & Actions → Split into 3 Groups
Type: Matrix Rebalancing
Impact: ~165s per run (67% reduction in bottleneck)
Risk: Low
Current State:
TestActionPinSHAsMatchVersionTagstests that validate action SHAs (16s+ per action)Proposed Structure:
Rationale:
TestActionPinSHAsMatchVersionTagstests are network-bound (verify GitHub tags) and run 16s+ per action. Isolating them allows other tests to complete faster in parallel.2. CLI Completion & Other → Extract Specific Command Groups
Type: Matrix Rebalancing
Impact: ~100s per run (65% reduction)
Risk: Low
Current State:
Proposed Structure:
Rationale: The original catch-all contained 1,023 tests including tests for
add,list,update,audit, andinspectcommands. Extracting these specific command groups reduces the catch-all size and enables better parallelization.3. Workflow Misc Part 2 → Extract String & Runtime Groups
Type: Matrix Rebalancing
Impact: ~100s per run (66% reduction)
Risk: Low
Current State:
Proposed Structure:
Rationale: The catch-all contained 3,997 tests, making it the largest test group by far. Extracting string manipulation and runtime setup tests into dedicated groups reduces the catch-all size significantly.
Expected Impact
Estimated Total Savings: ~165s in critical path per CI run
Validation Status
Manual Validation Performed:
Required Post-Merge Validation:
Testing Plan
💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.