Add web4-governance plugin for AI governance with R6 workflow by dp-web4 · Pull Request #20448 · anthropics/claude-code

dp-web4 · 2026-01-23T20:23:44Z

Web4 Governance Plugin for Claude Code
Lightweight AI governance with T3 trust tensors, entity witnessing, and R6 audit trails.

Note: "web4" = trust-native internet infrastructure for the AI agent era (cryptographic provenance, verifiable
accountability). Generic descriptor, not a trademark claim.

R6 = Rules + Role + Request + Reference + Resource → Result (structured audit record format)

Features
-Entity Trust - T3/V3 tensors (6D each) for MCP servers, agents, references
-Witnessing - Bidirectional trust flow through observation
-R6 Workflow - Formal intent→action→result with hash-linked provenance
-Rust Backend - (auto Python fallback)
-Trust Decay - Unused entities decay toward neutral over time

Components
governance/ - Trust tensors, witnessing, R6 ledger, session management
hooks/ - session_start, pre/post_tool_use, heartbeat
web4-trust-core/ - Rust crate with PyO3 + WASM bindings

Test Plan
Entity trust + witnessing (12 tests passing)
Rust backend verification + Python fallback
Real session integration
See README.md for full documentation.

dp-web4 · 2026-01-31T22:06:55Z

Comment for PR #20448

Clarification: Scope, Foundations, and Positioning

Thanks to everyone reviewing this PR. Based on feedback from external reviewers, I wanted to clarify a few points about what this plugin is (and isn't), and where it fits in the broader landscape.

What This Is

The core contribution isn't any single element (audit logs, policy gates, trust metrics), but the combination of:

Pre-action gating (not just after-the-fact logging)
Hash-linked provenance (tamper-evident audit chain)
Structured intent capture (R6 workflow formalism)

...implemented as a developer-portable, hook-based plugin rather than a platform-locked or enterprise-only system.

What This Isn't

To be explicit about scope:

This doesn't make agents "safe" or "correct" — only inspectable, accountable, and governable
T3 trust tensors are operational heuristics for permissioning, not epistemic confidence or alignment signals
Completeness is bounded by the host's hook surface — we can only govern what the hooks expose

We're building governance infrastructure, not claiming to solve alignment.

Foundational Research

This plugin implements concepts from the Web4 trust-native architecture. For deeper context on trust tensors, entity witnessing, coherence metrics, and the broader theoretical framework, see:

Web4 Whitepaper: https://dp-web4.github.io/web4/

The whitepaper covers:

Linked Context Tokens (LCT) — unforgeable entity identity
T3/V3 Trust and Value Tensors — multi-dimensional trust mechanics
R6 Workflow Formalism — structured intent capture
Markov Relevancy Horizons (MRH) — context boundaries
ATP/ADP Economics — attention allocation

How This Fits the Big Picture

Web4 Architecture provides the theoretical foundation — trust-native societies for humans and AI.

Governance Tiers define implementation depth:

Tier	Name	Capabilities
1	Observational	R6 audit, hash chain, soft LCT
1.5	Policy	Rules, presets, rate limiting ← This PR
2	Authorization	Full T3/ATP, hardware LCT
3	Training	Meta-cognitive, developmental

Runtime Implementations demonstrate portability:

Runtime	Implementation
Claude Code	This plugin (`hooks/`) ← This PR
Moltbot	`extensions/web4-governance/`
Hardbound	Full Rust implementation (Tier 2)

Competitive Context

For reviewers familiar with the space:

Alternative	Comparison
Jackson et al. (policy engines)	Strong theory, less developer-portable
AWS Bedrock AgentCore	Similar gates, but AWS-native, not intent-aware
Enterprise audit tooling	Good logs, weak agent semantics

Our lane: lightweight, open, agent-native, intent-aware.

Summary

This is missing infrastructure, not speculative architecture. Happy to address specific questions or concerns.

Related: A parallel implementation exists for Moltbot using the same R6 framework, demonstrating portability across runtimes.

…1-4) Web4 governance plugin for Claude Code hooks — structured audit trails, trust tensors, entity witnessing, policy gating, and event streaming. Tiers: observational audit (T1), policy presets and rate limiting (T1.5), signing and persistent witnesses (T2), multi-target extraction (T3), event stream monitoring (T4). See plugins/web4-governance/README.md for full documentation. PR: anthropics#20448 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

A corrupt session file (e.g. invalid control characters from an interrupted write) caused json.load() to raise JSONDecodeError on every subsequent tool call. Claude Code reported each as 'PreToolUse:Bash hook error / Failed with non-blocking status code: Traceback...' — noisy and wasted tokens. Patch: catch JSONDecodeError + OSError, rename the bad file to *.json.corrupt for forensics, fall through to lazy session re-init. Future corruptions self-heal in one tool call instead of polluting every subsequent invocation. Also removed the deprecated 'warn-git-push-no-pat' rule from ~/.web4/policies/ — PAT auth is deprecated, all dp-web4 remotes are SSH. The warning was firing on every git push command; signal-to-noise was zero. Kept the other 5 safety rules intact. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

…bust too Earlier patch only covered pre_tool_use.py's load path. Two more leaks: 1. post_tool_use.py had the same unprotected json.load(session_file) -> crashed and reported 'PostToolUse:Bash hook error' on every tool call when the session file was bad. 2. Both hooks did non-atomic 'with open(f, w)' + json.dump. If hook A is reading while hook B is writing, the reader can see a half-written file -> JSONDecodeError -> bad-file quarantine. THIS WAS THE ACTUAL ROOT CAUSE OF THE ORIGINAL CORRUPTION. Fixing the read path stopped the crash but the corruption-on-write race kept generating new bad files. Patch: add try/except (json.JSONDecodeError, OSError) -> quarantine to post_tool_use.load_session. Make all session writes atomic (write .tmp, os.replace -> session.json) in both hooks. The race is closed at the source, and any pre-existing corrupt files self-heal via quarantine. Quarantined two more pre-existing corrupt session files manually. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

When a Web4 host LCT exists at ~/.web4/{hostname}/lct.json (bootstrapped via web4_fleet_bootstrap.py), each session records a host_lct_witness entry on session-start. This is the reverse direction of the fleet bootstrap's own scan — the bootstrap records sibling identity systems present on the host; sessions record the host LCT they started under. Bidirectional witness graph for multi-factor identity. Witness != vouch. The session is recording observation, not endorsement. Cross-system convergence is the trust signal; divergence between sessions on the same host is diagnostic. Changes: - WitnessRecord gains optional host_lct_fingerprint + salience_axis fields. Old records load without them (backward-compat). - PolicyRegistry.witness_host_lct(session_id, host_lct_id, fingerprint, salience_axis) — new method. Persists host_lct_witness records to ~/.web4/witnesses.jsonl alongside the existing session_witness and decision_witness types. - session_start hook discovers ~/.web4/{hostname}/lct.json. If found, calls witness_host_lct and embeds {lct_id, fingerprint, machine, entity_type, observed_at} into the session JSON's host_lct_witness field. If not found (most installs), the field is None and sessions proceed normally. Salience-aware fingerprinting: the host LCT fingerprint is computed salience-side by web4_fleet_bootstrap.py over the host LCT's identity-stable fields. Routine ticks don't drift; only real identity changes do. Each witness record carries salience_axis documenting what the fingerprint hashes over, so the witness graph is self-describing. Tests: 27/27 pass (25 existing + 2 new — test_witness_host_lct and test_witness_host_lct_multiple_sessions). Validated end-to-end against CBP's real host LCT 83810b44-2289-4c14-854f-ae5114f747cf. Plugin manifest 1.0.0 → 1.1.0 (additive feature, semver minor bump). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

- Author email normalized to dp@metalinxx.io (single canonical contact) - README: replace "no external dependencies / no network calls" with the accurate version (cryptography for Ed25519 signing; opt-in git fetch during git-push divergence checks) - Add requirements.txt declaring cryptography>=41.0 (plugin still gracefully degrades if missing, but the dependency is now explicit) - Remove meta-process markdown files that don't belong in the plugin: - web4-governance-issue.md (was the original issue body, in repo root) - plugins/web4-governance/PR_DESCRIPTION.md - plugins/web4-governance/FEATURE_REQUEST.md - plugins/web4-governance/HOOK_STDERR_NOTE.md - Plugin documentation files retained: README, EVENT_STREAM_API, PRESETS, docs/RUST_CORE_PROPOSAL — those are real reference docs Test status: 91 passed, 0 failed against fresh ledger.db (pre-existing schema drift in long-lived local ~/.web4/ledger.db can cause 4 failures on legacy installs; not affected by this cleanup). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

dp-web4 · 2026-04-30T00:58:52Z

Refresh: rebased + cleaned for review

Rebased against current main and cleaned up for review. The diff is now plugin-only — 44 files, 13,064 additions, 0 deletions. The earlier 15K-line / 65-file appearance was stale-base artifact from upstream workflow refactors that landed since this PR was opened in January; that's resolved by the rebase.

Cleanup in 36b8983:

Author email normalized to a single canonical contact
README dependency claim corrected (Ed25519 signing requires cryptography; git fetch runs during opt-in git-push divergence checks — both now stated honestly)
requirements.txt added (plugin still gracefully degrades without cryptography, but the dependency is now explicit)
Meta-process files removed from the plugin (PR_DESCRIPTION.md, FEATURE_REQUEST.md, HOOK_STDERR_NOTE.md) and from the repo root (web4-governance-issue.md)
Real plugin documentation retained (README.md, EVENT_STREAM_API.md, PRESETS.md, docs/RUST_CORE_PROPOSAL.md)

Test status: 91 passed, 0 failed against a fresh ~/.web4/ledger.db.

Two contextual data points worth reading alongside this PR

1. ARC-AGI-3 result. Same Claude Opus 4.6 anyone can rent today, given a Web4-style governance frame (identity + scoped authority + real-time policy evaluation + cryptographic audit), scored 94.85% on ARC-AGI-3 — a benchmark where the same model in default context scores 0%. Public scorecard. No fine-tuning, no weight changes. The structure made the difference. This PR is the developer-portable surface of that same structure — the part that lets any Claude Code session capture R6 records and apply policy gates without hardware bindings.

2. Framing relative to the Microsoft Agent Governance Toolkit (April 2026). That toolkit is runtime policy enforcement — governance for what agents do. This PR is the upstream identity and accountability ontology — governance for what agents are. They are complementary, not competing. A runtime governance toolkit consumes an identity ontology underneath; that's the layer this PR contributes to Claude Code specifically.

Happy to decompose this into smaller PRs by capability (R6 audit logging, policy hooks, hash-chain provenance, MCP witnessing) if that helps reviewability — say the word and I'll split it.

dp-web4 force-pushed the add-web4-governance-plugin branch 4 times, most recently from 7a344de to 1408cf8 Compare January 24, 2026 08:56

dp-web4 marked this pull request as draft January 24, 2026 16:09

dp-web4 marked this pull request as ready for review January 24, 2026 17:58

dp-web4 mentioned this pull request Jan 29, 2026

[FEATURE] Governance infrastructure for AI agent accountability #21794

Closed

2 tasks

dp-web4 force-pushed the add-web4-governance-plugin branch from b0e3d68 to b590d13 Compare February 8, 2026 17:50

github-actions Bot mentioned this pull request Feb 26, 2026

📊 AI CLI 工具社区动态日报 2026-02-26 duanyytop/agents-radar#11

Closed

github-actions Bot mentioned this pull request Mar 26, 2026

📊 AI CLI 工具社区动态日报 2026-03-26 gsscsd/big_model_radar#96

Open

dp-web4 force-pushed the add-web4-governance-plugin branch from 5709a7c to 8fb33f6 Compare March 26, 2026 18:27

github-actions Bot mentioned this pull request Apr 18, 2026

📊 AI CLI 工具社区动态日报 2026-04-18 gsscsd/big_model_radar#203

Open

dp-web4 and others added 5 commits April 29, 2026 17:55

dp-web4 force-pushed the add-web4-governance-plugin branch from 6e41d8a to 36b8983 Compare April 30, 2026 00:58

This was referenced Apr 30, 2026

📊 AI CLI 工具社区动态日报 2026-04-30 gsscsd/big_model_radar#269

Open

📊 AI CLI 工具社区动态日报 2026-04-30 borq168/big_model_radar#79

Open

📊 AI CLI Tools Digest 2026-04-30 borq168/big_model_radar#82

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add web4-governance plugin for AI governance with R6 workflow#20448

Add web4-governance plugin for AI governance with R6 workflow#20448
dp-web4 wants to merge 5 commits intoanthropics:mainfrom
dp-web4:add-web4-governance-plugin

dp-web4 commented Jan 23, 2026 •

edited

Loading

Uh oh!

dp-web4 commented Jan 31, 2026

Uh oh!

dp-web4 commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dp-web4 commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dp-web4 commented Jan 31, 2026

Comment for PR #20448

Clarification: Scope, Foundations, and Positioning

What This Is

What This Isn't

Foundational Research

How This Fits the Big Picture

Competitive Context

Summary

Uh oh!

dp-web4 commented Apr 30, 2026

Refresh: rebased + cleaned for review

Two contextual data points worth reading alongside this PR

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

dp-web4 commented Jan 23, 2026 •

edited

Loading