AZIMUTH

A Claude Code skill that pressure-tests decisions before you commit to them.

Run /azimuth [your decision] before you greenlight the rewrite, the hire, the launch, or the bet.

The Problem

Plans look fine until they don't. The risks that sink projects are usually the ones nobody questioned — the assumption holding everything together, the dependency nobody secured, the failure mode that's common in decisions like this one but invisible from inside it.

AZIMUTH runs the structured pressure-test before you're committed.

What you get

A verdict with a rationale. Not just "risky" — a specific recommendation: proceed, pilot first, reduce scope, delay, or reject, with the structural reason why.

Assumption audit. Every assumption classified as strong, partial, unsupported, or contradicted — plus a falsifier for each: the specific observable evidence that would prove it wrong.

Failure path analysis. The most plausible ways this fails, traced trigger → cascade → business cost. Pair-interaction analysis where two risks together produce a worse outcome than either alone.

Incentive scan. Who proposed this, who benefits, who absorbs the downside if it fails, whether dissent was heard. Structured into the verdict — not an afterthought.

Dependency fragility map. What's a single point of failure, what's secured vs. assumed, what the lead time is to replace what isn't.

Structural mitigations only. Generic advice is rejected. "Communicate better" and "monitor closely" don't appear in the output.

Quick start

npx skills add https://github.com/MrBinnacle/azimuth

Then invoke on any decision:

/azimuth We're planning to rewrite the legacy billing service in Q3
/azimuth Should we make this hire?
/azimuth We're launching next week — is the plan sound?
/azimuth Build vs. buy vs. partner for this capability?
/azimuth Pressure test our Q3 timeline

How to know it's working

Assumptions you treated as given are being validated, not accepted
The verdict surprises you — or confirms what you suspected but couldn't articulate
The failure paths describe something that has actually happened to similar decisions
The incentive scan names a conflict you hadn't explicitly surfaced
You change the plan before committing, not after

Example output

Legacy billing rewrite — PILOT FIRST verdict

Input: "We're planning to rewrite our legacy billing service in Q3. 8 weeks, 2 engineers."

Output (abbreviated):

## Azimuth Verdict
High-risk. Scope and timeline are inconsistent with known base rates for legacy rewrites.
Do not proceed without scope reduction and a validated rollback strategy.

## Recommended Decision
PILOT FIRST — Rewrite one isolated billing module with full rollback. Validate
assumptions about coupling before committing full scope.

## Confidence Level
High — base rate for legacy rewrites exceeding estimate is well-documented.
No evidence present that shifts this.

## Critical Risks
1. Integration tax — Parallel-running old and new systems historically extends to 3–5×
   estimated cutover time. No hard deprecation date defined.
2. Knowledge concentration — Single-person domain knowledge on billing logic creates
   SPOF. No fallback owner named.
3. Scope creep under deadline — "While we're at it" rewrites reliably overload scope.
   No change control mechanism defined.

## Falsifiers
- Knowledge concentration: A second engineer can document and reproduce billing edge
  case behavior independently within 2 weeks — or the SPOF is real.
- Timeline estimate: Comparable module rewrite completed within 2 weeks in a spike —
  or the 8-week total is unsupportable.

## Interaction Effects
- Integration tax + deadline pressure: When cutover extends beyond week 6, the
  remaining 2 weeks compress QA and rollback validation simultaneously — neither
  gets adequate time, and the failure modes compound rather than queue.

## Likely Failure Paths
- Billing edge cases surface in testing → scope expands → 8 weeks becomes 20 →
  old system maintenance + new system debt → both teams overloaded → defects in prod

Domains

Works on any initiative-level decision with real downside: product launches, service rewrites, key hires, partnerships and M&A, build vs. buy decisions, org changes, PE secondaries, and timeline commitments. Domain-specific templates load automatically.

Verdicts

Full verdict taxonomy

Verdict	When it fires
`PROCEED`	Evidence supports moving forward; risks are manageable
`PROCEED WITH SAFEGUARDS`	Proceed only if specific structural changes are made
`PILOT FIRST`	Validate the highest-risk assumption before committing full scope
`REDUCE SCOPE`	Current scope is not supportable; a smaller version may be
`DELAY PENDING EVIDENCE`	Decision is premature; specific information is needed
`REJECT`	Evidence or structure does not support proceeding
`INSUFFICIENT SIGNAL`	Input is too sparse, vague, or contradictory to ground analysis
`WRONG TOOL`	Input is not a pre-commitment decision question
`RESIDUAL-RISK-REGISTER`	Decision already made — produces a forward-looking residual risk register (leading indicators, escalation triggers, owners), not a go/no-go verdict

Three verdict categories: Action verdicts (PROCEED through REJECT) are go/no-go positions. Refusal verdicts (INSUFFICIENT SIGNAL, WRONG TOOL) mean analysis cannot be grounded in the input. RESIDUAL-RISK-REGISTER is an alternative-deliverable verdict — it produces useful analysis for a closed decision, not a refusal.

What's inside

File tree

azimuth/
├── SKILL.md                              # Core skill — intake routing + 10-module analysis engine
├── gotchas.md                            # 8 structural failure patterns that evade standard checklists
├── references/
│   ├── base-rates.md                     # Historical failure rates: software, startups, launches, hiring, M&A, org change
│   ├── startup-failures.md               # 8 startup-specific failure patterns with diagnostic questions
│   ├── software-failure-patterns.md      # 10 engineering failure patterns with azimuth questions
│   ├── launch-risks.md                   # Pre/during/post launch risk zones with signal and mitigation
│   ├── ma-partnership-patterns.md        # 8 M&A and partnership failure patterns with diagnostic questions
│   └── org-change-patterns.md            # 6 org change and restructure failure patterns
├── diagnostics/
│   ├── assumption-audit.md               # 5-step: extract → classify → risk-score → validate → gate
│   ├── dependency-map.md                 # Full inventory, assessment matrix, concentration risk
│   ├── incentive-conflicts.md            # 7 conflict categories, severity classification
│   └── fragility-scan.md                # 6 structural fragility indicators → LOW/MEDIUM/HIGH/CRITICAL
├── templates/
│   ├── executive-azimuth.md              # 1-page format for leadership briefings
│   ├── codebase-azimuth.md               # Refactor/migration/rewrite template
│   ├── product-launch-azimuth.md         # Launch readiness gate matrix + rollback protocol
│   ├── hiring-azimuth.md                 # Role definition audit + candidate failure path analysis
│   ├── partnership-azimuth.md            # M&A, acquisitions, strategic partnerships, vendor relationships
│   ├── secondaries-ic-azimuth.md         # PE secondaries IC recommendation template
│   ├── org-change-azimuth.md             # Restructure, consolidation, role elimination, leadership transition
│   └── build-buy-partner-azimuth.md      # Path selection: build vs. buy vs. partner with domain handoff
└── examples/
    ├── case-study-healthcare-gov.md      # Healthcare.gov DEEP mode run — 5/6 recall, 0 false positives
    └── case-study-open-source-launch-timing.md  # STANDARD mode — solo dev timing a repo launch during job search

Limitations

AZIMUTH stress-tests the decision as framed. It cannot tell you whether the framing is correct. In long sessions where prior conversation history is large (above ~150K–177K tokens), SKILL.md may load incompletely and some analysis hooks may not fire. Fresh sessions and short sessions are unaffected.

Contributing

Issues and PRs welcome. Priority areas: additional domain templates, base rate data improvements with primary source citations, and domain-specific gotchas grounded in documented failure cases.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
diagnostics		diagnostics
evals		evals
examples		examples
references		references
templates		templates
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
SKILL.md		SKILL.md
gotchas.md		gotchas.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AZIMUTH

The Problem

What you get

Quick start

How to know it's working

Example output

Domains

Verdicts

What's inside

Limitations

Contributing

License

About

Uh oh!

Releases 5

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AZIMUTH

The Problem

What you get

Quick start

How to know it's working

Example output

Domains

Verdicts

What's inside

Limitations

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages