test: add integration tests for watch capabilities (#709)#868
Conversation
🟢 Impact Analysis — PR #868Risk tier: 🟢 LOW 📊 Summary
🎯 Risk Factors
📦 Modules Affectedtests (1 file)
This report is generated automatically for every PR. See #733 for details. |
🛫 PR Readiness Check
PR Scope: 🔧 Infrastructure
|
| Status | Check | Details |
|---|---|---|
| ❌ | Single commit | 2 commits — consider squashing before review |
| ✅ | Not in draft | Ready for review |
| ❌ | Branch up to date | dev is 3 commit(s) ahead — rebase recommended |
| ❌ | Copilot review | No Copilot review yet — it may still be processing |
| ✅ | Changeset present | No source files changed — changeset not required |
| ✅ | Scope clean | No .squad/ or docs/proposals/ files |
| ✅ | No merge conflicts | No merge conflicts |
| ✅ | Copilot threads resolved | 0 active Copilot thread(s) resolved (1 outdated skipped) |
| ❌ | CI passing | 16 check(s) still running |
Files Changed (1 file, +713 −0)
| File | +/− |
|---|---|
test/cli/watch-capabilities.test.ts |
+713 −0 |
Total: +713 −0
This check runs automatically on every push. Fix any ❌ items and push again.
See CONTRIBUTING.md and PR Requirements for details.
There was a problem hiding this comment.
Pull request overview
Adds a new vitest suite to cover the highest-risk squad watch capability plugins introduced in/around #709, improving confidence in preflight gating, filtering logic, and subprocess-driven flows without relying on real gh, git, or filesystem state.
Changes:
- Adds a comprehensive
watch-capabilitiestest suite covering Execute, Cleanup, Decision Hygiene, Self Pull, and Board capability behaviors. - Introduces shared mocks for SDK storage,
child_process, andfsto keep tests deterministic.
9e3185a to
9069171
Compare
Add 51 tests covering 5 watch capabilities (execute, cleanup, decision-hygiene, self-pull, board) that shipped with zero test coverage in PR #709. Test coverage: - ExecuteCapability: buildAgentPrompt, findExecutableIssues edge cases, preflight, execute flow (adapter mock, agent dispatch, errors, timeouts) - CleanupCapability: preflight, round-skipping, file pruning by date, stale inbox warnings, config validation - DecisionHygieneCapability: preflight, threshold logic, merge trigger, timeout handling - SelfPullCapability: preflight, git stash/fetch/pull flow, stash pop failure, source change detection - BoardCapability: preflight checks Closes #709 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
9069171 to
4f7a946
Compare
tamirdresher
left a comment
There was a problem hiding this comment.
Approved by @tamirdresher — reviewed diff, CI green, changes are clean and well-tested. cc @diberry
Add 51 tests covering 5 watch capabilities (execute, cleanup, decision-hygiene, self-pull, board) that shipped with zero test coverage in PR #709. Test coverage: - ExecuteCapability: buildAgentPrompt, findExecutableIssues edge cases, preflight, execute flow (adapter mock, agent dispatch, errors, timeouts) - CleanupCapability: preflight, round-skipping, file pruning by date, stale inbox warnings, config validation - DecisionHygieneCapability: preflight, threshold logic, merge trigger, timeout handling - SelfPullCapability: preflight, git stash/fetch/pull flow, stash pop failure, source change detection - BoardCapability: preflight checks Closes #709 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
|
Closed by #theSquadsquad triage — stale/answered, no further action needed. |
Summary
Adds 51 tests covering the 5 highest-risk watch capabilities that shipped in PR #709 with zero test coverage. This was flagged as 🟡 High priority in the architecture review.
Capabilities tested
Closes #709