auto-fix batch claude/friendly-maxwell-UlJEd 2026-05-03 by intendednull · Pull Request #560 · intendednull/willow

intendednull · 2026-05-03T01:02:05Z

Scheduled /resolving-issues sweep. Eight small-scope fixes from the 2026-05-02 general-audit (master ticket #513) landed sequentially. One audit child (#535) coordinator-skipped as ambiguous-fix-path (design call needed); skill edit captures the new pattern.

Fixes

Fixes audit F25 [supply]: setup-e2e.sh uses npm install — should use npm ci #530 — fix(scripts): use npm ci in setup-e2e (8089a62). One-line lockfile-respect swap; package-lock.json precondition verified.
Fixes audit F23 [supply]: cargo install trunk/just in setup-e2e.sh omits --locked #529 — build(setup-e2e): pin trunk + just installs with --locked (a5e2ad0). trunk@0.21.14 mirrors deploy.yml's pin; just@1.50.0 is latest stable.
Fixes audit F1 [robustness]: replay/role.rs missing storage's heads-cap rejection #514 — fix(replay): cap peer HeadsSummary in sync (c65ffab). Sibling-of-closed (auto-fix batch claude/friendly-maxwell-BjjKA (2026-05-02) #507 b075140 storage cap). Approach A: centralized MAX_AUTHORS_PER_SYNC in willow-common alongside SYNC_BATCH_LIMIT so storage + replay can't drift; replay returns WorkerResponse::Denied mirroring storage's bail! text. Tests sync_request_rejects_oversize_heads + sync_request_accepts_exact_cap_heads. (5 files, +111/-17.)
Fixes audit F3 [robustness]: is_zero_duration misses 0.01ms!important reduced-motion override #515 — fix(web): treat sub-1ms transition as zero-duration (d23f3f8). Sibling-of-closed (feat(web): event-based waits PR-3 — data-state lifecycle + page.clock helper #496 8d89f18). Added parse_duration_seconds parser + is_zero_duration_str predicate w/ 1ms epsilon — handles the 0.01ms !important reduced-motion override + future near-zero serializations. 9 unit tests (testable string predicate, no DOM). Browser run deferred to CI (no wasm-pack/Firefox in sandbox; cargo check --target wasm32 fallback gate green).
Fixes audit F47 [quality]: worker_cache test uses std::thread::sleep — flake risk #547 — test(client): inject now into worker_cache eviction (b20b8ec). Approach A: added pub(crate) fn evict_stale_at(&mut self, now: Instant); existing evict_stale() delegates. Test rewritten to use synthetic now = Instant::now() + 60s — no more std::thread::sleep. (audit F45 [robustness]: WorkerCache uses Instant::now() (native-only) in willow-client lib crate #545 Instant-on-wasm gating left to its own ticket.)
Fixes audit F43 [robustness]: web::app unwraps web_sys::window() on user-gesture path #543 — fix(web): handle missing window in getUserMedia path (4a19bc6). let Some(window) = web_sys::window() else { tracing::error!(...); return; } matches surrounding let Ok(...) else style. No other unwraps in the gesture block.
Fixes audit F37 [arch]: willow-identity carries tokio dev-dep despite being lib crate #537 — chore(identity): drop unused tokio dev-dep (2363f99). grep confirmed only a doc-comment hit, no #[tokio::test] use. tempfile dev-dep stays (used by tempdir tests).
Fixes audit F26 [robustness]: deploy verify uses sleep 3 instead of polling #531 — ci(deploy): poll verify endpoint up to ~30s (baa4484). Replaced blind sleep 3 w/ 15-attempt × 2s curl poll loop, exit 1 on final failure with stderr diagnostic. systemctl is-active check unchanged.
Fixes audit F13 [docs]: KickMembers permission listed in CLAUDE.md but not a Permission variant #521 — docs: drop fictional KickMembers/Administrator perms (64ca482). Dropped both from CLAUDE.md's permission list + the agentic-peer-api spec's "Valid permission values" line. Reworded trust_peer/untrust_peer table entries to "Grant/Revoke admin status (via vote)". Aligns docs w/ canonical Permission enum at crates/state/src/event.rs:46-69 and the design rationale at docs/specs/2026-04-01-per-author-merkle-dag-state-design.md:295,623,634,1565. Did NOT add the variant — would require state-machine + spec change, out of scope.

Already-Fixed

None this run. The general-audit at #513 was filed earlier the same day against the same main @ b901575 head this run started from, so by definition no fixes had landed between audit and dispatch (skill now notes this is the expected zero-yield case for same-day audit-to-fix gaps).

Parked

None. One audit child (#535 — Arc<Mutex> in willow-actor fsm.rs:415) was coordinator-skipped during step 6 picks, NOT parked: the test code at the cited line is already inside #[cfg(test)] mod tests (lines 175-460), so the audit's literal "move into mod tests" prescription is satisfied at HEAD. The audit's underlying concern is real (the !**/tests* exclusion glob doesn't catch in-file test modules) but the fix is a design call — either move the test to an external crates/actor/tests/fsm_tests.rs so the path-based glob catches it, or update the audit glob. Skill edit 5320c8e documents this new "ambiguous-fix-path" coordinator-skip pattern; left open for the next run with fresh eyes / accumulated context.

Skill Evolution

5320c8e docs(skill): coordinator-skip for ambiguous-fix-path issues — adds two notes to step 6 of the Core Loop:
- Same-day audit-to-fix expects zero already-fixed hits. When the audit was filed against the same HEAD the run starts from, no fixes can have landed in the gap by definition. Don't over-invest in the sweep in this case; one quick git log <audit-ref>..origin/main per pick is enough.
- Ambiguous-fix-path coordinator-skip pattern. When an audit's prescribed fix is ambiguous-by-design (≥ 2 valid approaches needing a design call) AND the literal premise is already satisfied at HEAD, skip from the dispatch queue without closing. Surface in run-end Lessons Learned. Don't comment on the issue (audit body already captures the concern). audit F35 [debt]: Arc<Mutex> in willow-actor fsm.rs (move into mod tests if test-only) #535 was the canonical example surfaced this run.

Lessons Learned

8/8 implementer dispatches landed cleanly, no blockers. No mid-flight aborts, no finalize-implementer rescues, no scope-creep guards tripped. Sequential pattern + Monitor SHA-watch pattern both worked smoothly. Two-signal convergence (SHA-advance via Monitor + agent-completion notification) behaved per the existing skill guidance — both arrived per dispatch in mostly the same order, git status after each signal disambiguated cleanly.
Coordinator-side pre-flight grep/sed verification at HEAD was load-bearing. Step 6 spot-checks of every cited file/line confirmed all 9 picks were real before dispatch — caught one skip-worthy pick (audit F35 [debt]: Arc<Mutex> in willow-actor fsm.rs (move into mod tests if test-only) #535) and saved its dispatch slot. No false-premise audit findings reached an implementer this run. Worth keeping the discipline; cheap up front, expensive when wasted.
Brainstorm gate threshold worked correctly across the picks. Six issues skipped brainstorm (one-liner shell / config / docs / single-let-else / dev-dep removal / yaml shell loop). Two triggered automated brainstorm (audit F1 [robustness]: replay/role.rs missing storage's heads-cap rejection #514 cross-crate const placement → A vs B; audit F3 [robustness]: is_zero_duration misses 0.01ms!important reduced-motion override #515 parser approach → A vs B vs C; audit F47 [quality]: worker_cache test uses std::thread::sleep — flake risk #547 clock injection → A vs B vs C). Implementer briefs included pre-decided narrowing context where the coordinator could (e.g. for audit F1 [robustness]: replay/role.rs missing storage's heads-cap rejection #514, named the WorkerResponse::Denied pattern to mirror storage's bail!); each implementer's brainstorm landed within the cap. None expanded to plan-file territory.
Mechanical scope expansion ("call-site migration is part of the fix") triggered three times, all cleanly justified.
- audit F1 [robustness]: replay/role.rs missing storage's heads-cap rejection #514 added willow-common dep edge to willow-replay to centralize the cap. Caught two prod sites (storage + replay) that share the invariant.
- audit F3 [robustness]: is_zero_duration misses 0.01ms!important reduced-motion override #515 swapped a 9-arm matches! predicate for a parser + epsilon predicate. The string-predicate refactor was the load-bearing site, the parser is its testable foundation.
- audit F13 [docs]: KickMembers permission listed in CLAUDE.md but not a Permission variant #521 fixed the same KickMembers/Administrator drift in two doc files (CLAUDE.md + the agentic-peer-api spec) — single coherent fix, not creep.
All three flagged the deviation in implementer reports + commit bodies. Skill's existing wording covered each; no edit needed.
#506 territory (pre-existing wasm-clippy --all-targets failures in willow-client tests) surfaced again from the audit F47 [quality]: worker_cache test uses std::thread::sleep — flake risk #547 implementer. PR auto-fix batch claude/friendly-maxwell-f34GI 2026-05-02 #511 fixed this on its branch but is unmerged, so each implementer touching willow-client still hits it on clippy --target wasm32 -p willow-client --all-targets. Mitigation already known — implementer scoped to --lib and noted the gap. Once auto-fix batch claude/friendly-maxwell-f34GI 2026-05-02 #511 merges, the friction disappears.
No false-alarm finalize-implementer dispatches this run. Full-SHA capture (skill d5d6457) + Monitor wait pattern held; no preemptive crash conclusions on lagging completion notifications.

Test plan

Master-PR CI is the load-bearing gate. Locally, each implementer ran the scoped subset (fmt, native + wasm clippy on touched crates, native test, wasm32 check). No just check workspace-wide run since just is unavailable in some sandboxes; raw cargo equivalents per the skill's fallback.

CI gates to verify on this PR:

cargo fmt
cargo clippy workspace (native + wasm32)
cargo test workspace (state + client + identity + replay + storage + common + web + actor + worker)
wasm-pack browser tests (Firefox + geckodriver — only observable on CI; new is_zero_duration parser tests run as native unit tests)
cargo audit (no advisory changes this run)
Playwright e2e (no behaviour changes — sanity only)

Generated by Claude Code

`npm install` can mutate package-lock.json; `npm ci` installs exactly the lockfile + refuses to mutate it. Deterministic E2E setup demands the strict variant. No Rust changes — only fmt run as smoke check (clippy/test skipped). Refs #530

Bare `cargo install` ignores each tool's Cargo.lock, so any compromised transitive dep on crates.io silently lands in the E2E env. `--locked --version X.Y.Z` deterministic. - trunk 0.21.14 matches deploy.yml workflow pin - just 1.50.0 latest stable (no existing pin in workflows) Refs #529

Mirror the storage cap added by PR #507 / b075140 (MAX_AUTHORS_PER_SYNC = 256) on the replay path. Without the guard, ReplayRole::handle_request(WorkerRequest::Sync) iterates a peer-supplied HeadsSummary into a BTreeMap and walks the in-memory DAG once per author — same DoS shape as storage's sync_since/history before #507. Approach A (centralize the const in willow-common alongside SYNC_BATCH_LIMIT) chosen over B (define a local const in replay): the cap is a wire-protocol invariant that BOTH workers must agree on. A single source of truth in willow-common — already a dep of both crates — guarantees they cannot drift. Cost is one extra crate dep edge for willow-replay (already had willow-state, willow-identity, willow-worker, willow-network). Storage's local pub const is removed; it now imports from willow-common. No behavioural change to storage — the value and the bail! sites are byte-identical. Replay uses WorkerResponse::Denied { reason } (sync handler, not anyhow::Result) mirroring the existing "unknown server" branch and the storage error message text. Tests: - sync_request_rejects_oversize_heads (MAX+1 → Denied) - sync_request_accepts_exact_cap_heads (MAX → not Denied) Refs #514 https://claude.ai/code/session_019HhgeDZ5HCbEUygRRLCjde

is_zero_duration only matched "", "0s", "0ms" — but the global prefers-reduced-motion rule in style.css forces transition-duration to 0.01ms !important on every element, which engines serialise as either "0.01ms" or "0.0001s". Both slipped past the strict matcher, so mobile_shell, confirm_dialog, bottom_sheet, grove_drawer, and message reactions all sat waiting for a transitionend that never fires under reduced-motion — UI hang. Replace the string-equality check with parse_duration_seconds (parses both s and ms suffixes) and accept anything ≤ 1ms (epsilon) as zero. Unparseable input stays conservative (not zero). Sibling-of-closed audit follow-up to #496 (8d89f18). Approach A (parse-and-compare) chosen over B (hardcode the two known strings) because reduced-motion is the authoritative contract — any sub-millisecond duration is indistinguishable from "no transition" for transitionend purposes, so a numeric threshold is the durable fix. Tests added (native, no DOM): parse_duration_seconds_handles_units, parse_duration_seconds_rejects_malformed, is_zero_duration_str_recognises_explicit_zero, is_zero_duration_str_recognises_reduced_motion_override, is_zero_duration_str_treats_sub_millisecond_as_zero, is_zero_duration_str_rejects_real_durations, is_zero_duration_str_multi_value_all_zero, is_zero_duration_str_multi_value_mixed_is_not_zero, is_zero_duration_str_unparseable_is_not_zero. Gates: fmt clean, clippy native + wasm32 clean (-D warnings), 86 willow-web lib tests pass, wasm32 --tests check clean (wasm-pack / geckodriver not available in env — used cargo check --target wasm32-unknown-unknown --tests as fallback gate per CLAUDE.md). Refs #515 https://claude.ai/code/session_019HhgeDZ5HCbEUygRRLCjde

Audit F47 (#547): `evict_stale_removes_expired` slept 10ms after a 1ms TTL — flake risk on slow CI per `condition-based-waiting`. Fix per audit suggestion: inject a clock. Added `pub(crate) evict_stale_at(&mut self, now: Instant)`; production `evict_stale()` delegates with `Instant::now()`. Test now feeds a synthetic future `now`, removing the timing dependency entirely. Picked clock-injection (A) over a clock trait (B, too heavy for one test) and over cutoff-injection (C, equivalent but maps less directly to "inject a clock" in the audit). #545 (Instant::now wasm linkage) is a separate ticket — not addressed here. Refs #547

Match surrounding let-Ok-else style: log + return instead of unwrap, so non-window contexts (e.g. worker harness) do not panic. Refs #543

No production or test code in willow-identity uses tokio. `grep -rn "tokio::\|#\[tokio::" crates/identity/src` returns only a doc-comment reference at lib.rs:99. Lib crates must stay tokio-free per CLAUDE.md; dev-dep was dead weight. Refs #537

Replaces a blind 3s sleep before curl with a 15-attempt loop (2s backoff). The relay can take longer than 3s to bind after a restart, and combined with StrictHostKeyChecking=no a slow bind could mask a silent deploy failure as a passing job. Refs #531

Permission enum (crates/state/src/event.rs:46-69) defines exactly: SyncProvider, ManageChannels, ManageRoles, SendMessages, CreateInvite (plus __UnknownLegacy sentinel). Admin status and member kicks live on the ProposedAction + vote path by design — see docs/specs/2026-04-01-per-author-merkle-dag-state-design.md:295 ("the Permission enum does not contain Administrator"). The type system enforces the governance path; granting admin via GrantPermission is structurally impossible. CLAUDE.md and the agentic-peer-api spec listed KickMembers and Administrator as valid permission values — pure doc drift. Realign both with the enum and clarify trust_peer/untrust_peer route through the vote path, not GrantPermission. Refs #521

Same-day audit runs yield ~zero already-fixed sweep hits — note explicitly so future runs skip the deeper search. Document the ambiguous-fix-path pattern (audit premise real, fix is a design call w/ ≥2 valid approaches): skip from dispatch queue without closing, surface in Lessons Learned. Surfaced this run by #535 (Arc<Mutex> in fsm.rs is already in mod tests; audit's "move to mod tests" prescription doesn't apply — fix is either external test file or glob update, design call).

claude added 11 commits May 3, 2026 00:28

chore: open auto-fix batch claude/friendly-maxwell-UlJEd

18d059b

fix(scripts): use npm ci in setup-e2e

8089a62

`npm install` can mutate package-lock.json; `npm ci` installs exactly the lockfile + refuses to mutate it. Deterministic E2E setup demands the strict variant. No Rust changes — only fmt run as smoke check (clippy/test skipped). Refs #530

fix(web): handle missing window in getUserMedia path

4a19bc6

Match surrounding let-Ok-else style: log + return instead of unwrap, so non-window contexts (e.g. worker harness) do not panic. Refs #543

chore(identity): drop unused tokio dev-dep

2363f99

No production or test code in willow-identity uses tokio. `grep -rn "tokio::\|#\[tokio::" crates/identity/src` returns only a doc-comment reference at lib.rs:99. Lib crates must stay tokio-free per CLAUDE.md; dev-dep was dead weight. Refs #537

intendednull merged commit 150df97 into main May 3, 2026
8 checks passed

intendednull deleted the claude/friendly-maxwell-UlJEd branch May 3, 2026 08:55

intendednull mentioned this pull request May 3, 2026

auto-fix batch claude/friendly-maxwell-M5xB6 2026-05-03 #566

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

auto-fix batch claude/friendly-maxwell-UlJEd 2026-05-03#560

auto-fix batch claude/friendly-maxwell-UlJEd 2026-05-03#560
intendednull merged 11 commits into
mainfrom
claude/friendly-maxwell-UlJEd

intendednull commented May 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

intendednull commented May 3, 2026

Fixes

Already-Fixed

Parked

Skill Evolution

Lessons Learned

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants