fix(sandbox): TAP-668 — HEALTHCHECK readability + HOME env for ralph user by wtthornton · Pull Request #3 · wtthornton/ralph-claude-code

wtthornton · 2026-05-02T22:31:31Z

Summary

Adds ENV HOME=/home/ralph so npm/gh/claude can write state dirs as the dropped user.
HEALTHCHECK now uses test -r (not test -f) and emits a stderr cause line — docker inspect shows the actual reason instead of opaque exit 1.
Comment block on bind-mount UID alignment requirement.

Test plan

docker build -f Dockerfile.sandbox . succeeds
docker inspect --format='{{json .State.Health}}' includes the new error message on a deliberately broken mount

🤖 Generated with Claude Code

…user Three concrete bugs in Dockerfile.sandbox: 1. ENV HOME=/home/ralph is not auto-exported when USER drops to ralph. npm/gh/claude config writes silently failed under the dropped user because $HOME defaulted to "/". 2. HEALTHCHECK used `test -f` which doesn't distinguish missing-file from permission-denied. Switched to `test -r` (readable). On a bind-mounted .ralph/ owned by a different host UID, the old check reported "unhealthy" with no actionable cause. 3. Failure now emits a stderr line including the file path and current user, so `docker inspect --format='{{json .State.Health}}'` shows the cause instead of an opaque exit 1. Also documented the bind-mount UID alignment requirement in a comment above WORKDIR. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* test+ci: enforce hook+workflow invariants instead of brittle counts Five tests were exposed once the bats count mismatch was fixed (PR #8). Three of the five were brittle assertions that locked out legitimate plugin ecosystems; two were a real workflow gap. Long-term right fix is to encode the actual invariants and close the gap, not silence the tests. Workflow gap fixed: - .github/workflows/codeql-analysis.yml now pins `defaults.run.shell: bash` per the TAP-667 standard. Previously the only hand-authored workflow without this. Two tests (#3, #6) were really one root cause. Test invariants tightened (instead of "exactly N" counts): - HOOKS-2: `bash <path>` commands must reference EITHER .ralph/hooks/ OR .claude/hooks/. The original test rejected .claude/hooks/ entries and broke as soon as tapps-mcp registered hooks there. - "all hook commands start with bash": now also accepts the bare .claude/hooks/<name>.sh form that tapps-mcp emits when registering Linear MCP governance hooks. Catches garbage paths / tool names (Write/Edit) without policing plugin command-emission style. - "PreToolUse has exactly two entries": rewritten to verify Ralph's two safety hooks (Bash → validate-command.sh, Edit|Write → protect-ralph-files.sh) are present AND wired to the right scripts. Plugin-injected entries are allowed; what's protected is removal or rewiring of Ralph's own defenses. Local: 1455/1455 unit tests passing, no warnings. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * test(integration): replace dead ALLOWED_TOOLS asserts with negative invariant Two integration tests asserted that setup.sh-generated .ralphrc shipped with `ALLOWED_TOOLS="..."` containing Bash(npm *)/Bash(pytest), but ADR-0006 deleted the legacy `-p` mode and the ALLOWED_TOOLS allowlist along with it — tool surface now lives in .claude/agents/ralph.md (`tools:` allowlist + `disallowedTools:` blocklist). Replaced both with a single negative assertion: if a future change re-introduces `ALLOWED_TOOLS=` to .ralphrc, this test fires so we don't silently split the tool-surface contract across two files again. The positive invariant (tool surface defined in agent file) is already covered by HOOKS-6 in tests/unit/test_hooks.bats. Local: full integration suite (203 tests) passing. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * test(evals): split .ralphrc protection into create-allowed vs edit-blocked The eval test "FILE PROTECTION: blocks edit to .ralphrc" asserted the hook returns exit 2 when no .ralphrc exists in the test fixture dir. But the hook's actual contract (HOOKS-5 in tests/unit/test_hooks.bats) is: ALLOW creating a new .ralphrc when absent, BLOCK editing once it exists. The test was asserting the wrong half of the contract — it went red the moment the eval suite started running end-to-end (post PR #8 / #9 fixes that unmasked the eval step). Split into two tests that match the real invariant: 1. Edit on EXISTING .ralphrc → blocked (touch then test) 2. Create on ABSENT .ralphrc → allowed (HOOKS-5 already covers this for the hook script directly; this is the eval-level mirror) Local: 69/69 deterministic evals passing. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

wtthornton merged commit 64df537 into main May 3, 2026
2 checks passed

wtthornton deleted the tap-668-dockerfile-sandbox-healthcheck branch May 3, 2026 00:01

wtthornton mentioned this pull request May 3, 2026

test+ci: enforce hook+workflow invariants instead of brittle counts #9

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(sandbox): TAP-668 — HEALTHCHECK readability + HOME env for ralph user#3

fix(sandbox): TAP-668 — HEALTHCHECK readability + HOME env for ralph user#3
wtthornton merged 1 commit into
mainfrom
tap-668-dockerfile-sandbox-healthcheck

wtthornton commented May 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

wtthornton commented May 2, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant