Skip to content

feat(skills): align YAML frontmatter to agentskills.io spec + add eva…#5

Merged
jazz127 merged 1 commit intojazz127:devfrom
sytone:squad/skills-schema-alignment-v2
Apr 6, 2026
Merged

feat(skills): align YAML frontmatter to agentskills.io spec + add eva…#5
jazz127 merged 1 commit intojazz127:devfrom
sytone:squad/skills-schema-alignment-v2

Conversation

@jazz127
Copy link
Copy Markdown
Owner

@jazz127 jazz127 commented Apr 6, 2026

…l framework

Migrates all 34 skills across .squad/skills/, .copilot/skills/, and templates/skills/ to the agentskills.io specification schema:

  • name, description, license as top-level fields (spec required)
  • domain, confidence, source, triggers, roles, compatibility in metadata
  • tools arrays converted to allowed-tools strings
  • Skills without frontmatter get complete --- blocks added
  • Body content of all SKILL.md files unchanged (frontmatter only)

Adds three-phase eval framework for skill trigger quality testing:

  • Phase 1 (run-evals.mjs): keyword matching, 88.9% baseline, CI-ready
  • Phase 2 (run-llm-evals.mjs): LLM trigger + execution evals via Copilot
  • Phase 3 (optimize-description.mjs): iterative description optimization
  • Schema validator, 31 trigger fixtures, 10 execution fixtures
  • CONTRIBUTING.md, skill-review-checklist, eval README

Adds SPAN (Skill Curator) team member for skill quality gating.

Spec: https://agentskills.io/specification

What

Why

How


⚠️ Quick Check

  • If SDK/CLI source files changed: completed the applicable Changeset step below (npx changeset add / .changeset/*.md, direct CHANGELOG.md entry for maintainers, or skip-changelog label for no user-facing changes)

PR Readiness Checklist

The PR readiness bot will validate these automatically after push.
Check each item before requesting review. See CONTRIBUTING.md for full details.

Branch & Commit

  • Branch created from dev (not main)
  • Branch is up to date with dev (git fetch upstream && git rebase upstream/dev)
  • Verified diff contains only intended changes (git diff --cached --stat)
  • PR is not in draft mode (mark ready when checks pass)
  • Commit history is clean (squash fixups before review)

Build & Test

  • npm run build passes
  • npm test passes (all tests green)
  • npm run lint passes (type check clean)
  • npm run lint:eslint passes
  • For migration PRs (>20 files): include test output summary in PR description

Changeset

  • Changeset added via npx changeset add (if packages/squad-sdk/src/ or packages/squad-cli/src/ changed)
  • Or direct CHANGELOG.md entry (maintainers only — write-protected for external contributors)
  • Or skip-changelog label applied (if no user-facing changes)

Docs

  • README section updated (if new feature/module)
  • Docs feature page (if new user-facing capability)

Exports

  • package.json subpath exports updated (if new module)

Breaking Changes

Waivers

…l framework

Migrates all 34 skills across .squad/skills/, .copilot/skills/, and
templates/skills/ to the agentskills.io specification schema:
- name, description, license as top-level fields (spec required)
- domain, confidence, source, triggers, roles, compatibility in metadata
- tools arrays converted to allowed-tools strings
- Skills without frontmatter get complete --- blocks added
- Body content of all SKILL.md files unchanged (frontmatter only)

Adds three-phase eval framework for skill trigger quality testing:
- Phase 1 (run-evals.mjs): keyword matching, 88.9% baseline, CI-ready
- Phase 2 (run-llm-evals.mjs): LLM trigger + execution evals via Copilot
- Phase 3 (optimize-description.mjs): iterative description optimization
- Schema validator, 31 trigger fixtures, 10 execution fixtures
- CONTRIBUTING.md, skill-review-checklist, eval README

Adds SPAN (Skill Curator) team member for skill quality gating.

Spec: https://agentskills.io/specification

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Apr 6, 2026

⚠️ Squad File Leakage Detected

The following .squad/ files were modified in this PR:

  • .squad/agents/span/charter.md
  • .squad/agents/span/history.md
  • .squad/casting/registry.json
  • .squad/routing.md
  • .squad/skill.md
  • .squad/skills/CONTRIBUTING.md
  • .squad/skills/cross-machine-coordination/SKILL.md
  • .squad/skills/cross-squad/SKILL.md
  • .squad/skills/economy-mode/SKILL.md
  • .squad/skills/evals/README.md
  • .squad/skills/evals/agent-collaboration.eval.yaml
  • .squad/skills/evals/agent-conduct.eval.yaml
  • .squad/skills/evals/architectural-proposals.eval.yaml
  • .squad/skills/evals/ci-validation-gates.eval.yaml
  • .squad/skills/evals/ci-validation-gates.exec-eval.yaml
  • .squad/skills/evals/cli-wiring.eval.yaml
  • .squad/skills/evals/client-compatibility.eval.yaml
  • .squad/skills/evals/cross-machine-coordination.eval.yaml
  • .squad/skills/evals/cross-squad.eval.yaml
  • .squad/skills/evals/distributed-mesh.eval.yaml
  • .squad/skills/evals/economy-mode.eval.yaml
  • .squad/skills/evals/economy-mode.exec-eval.yaml
  • .squad/skills/evals/external-comms.eval.yaml
  • .squad/skills/evals/external-comms.exec-eval.yaml
  • .squad/skills/evals/fact-checking.eval.yaml
  • .squad/skills/evals/gh-auth-isolation.eval.yaml
  • .squad/skills/evals/gh-auth-isolation.exec-eval.yaml
  • .squad/skills/evals/git-workflow.eval.yaml
  • .squad/skills/evals/git-workflow.exec-eval.yaml
  • .squad/skills/evals/github-multi-account.eval.yaml
  • .squad/skills/evals/history-hygiene.eval.yaml
  • .squad/skills/evals/humanizer.eval.yaml
  • .squad/skills/evals/init-mode.eval.yaml
  • .squad/skills/evals/model-selection.eval.yaml
  • .squad/skills/evals/model-selection.exec-eval.yaml
  • .squad/skills/evals/nap.eval.yaml
  • .squad/skills/evals/optimize-description.mjs
  • .squad/skills/evals/personal-squad.eval.yaml
  • .squad/skills/evals/pr-screenshots.eval.yaml
  • .squad/skills/evals/ralph-two-pass-scan.eval.yaml
  • .squad/skills/evals/release-process.eval.yaml
  • .squad/skills/evals/release-process.exec-eval.yaml
  • .squad/skills/evals/reskill.eval.yaml
  • .squad/skills/evals/reviewer-protocol.eval.yaml
  • .squad/skills/evals/reviewer-protocol.exec-eval.yaml
  • .squad/skills/evals/rework-rate.eval.yaml
  • .squad/skills/evals/run-evals.mjs
  • .squad/skills/evals/run-llm-evals.mjs
  • .squad/skills/evals/secret-handling.eval.yaml
  • .squad/skills/evals/secret-handling.exec-eval.yaml
  • .squad/skills/evals/session-recovery.eval.yaml
  • .squad/skills/evals/session-recovery.exec-eval.yaml
  • .squad/skills/evals/squad-conventions.eval.yaml
  • .squad/skills/evals/validate-schema.mjs
  • .squad/skills/evals/versioning-policy.eval.yaml
  • .squad/skills/external-comms/SKILL.md
  • .squad/skills/gh-auth-isolation/SKILL.md
  • .squad/skills/humanizer/SKILL.md
  • .squad/skills/model-selection/SKILL.md
  • .squad/skills/personal-squad/SKILL.md
  • .squad/skills/pr-screenshots/SKILL.md
  • .squad/skills/ralph-two-pass-scan/SKILL.md
  • .squad/skills/release-process/SKILL.md
  • .squad/skills/session-recovery/SKILL.md
  • .squad/skills/versioning-policy/SKILL.md
  • .squad/team.md
  • .squad/templates/skill-review-checklist.md
  • .squad/templates/skill.md

These files affect team routing, agent charters, and decisions.
If intentional, ensure approval from the team lead.

@github-actions
Copy link
Copy Markdown

github-actions Bot commented Apr 6, 2026

🛫 PR Readiness Check

ℹ️ This comment updates on each push. Last checked: commit 8e4a56e

PR Scope: 🔧 Infrastructure

⚠️ 3 item(s) to address before review

Status Check Details
Single commit 1 commit — clean history
Not in draft Ready for review
Branch up to date dev is 39 commit(s) ahead — rebase recommended
Copilot review No Copilot review yet — it may still be processing
Changeset present No source files changed — changeset not required
Scope clean ⚠️ PR includes 68 .squad/ file(s) — ensure these are intentional
No merge conflicts No merge conflicts
Copilot threads resolved No Copilot review threads
CI passing 16 check(s) still running

Files Changed (88 files, +6424 −1001)

File +/−
.copilot/skills/agent-collaboration/SKILL.md +8 −3
.copilot/skills/agent-conduct/SKILL.md +8 −3
.copilot/skills/architectural-proposals/SKILL.md +9 −10
.copilot/skills/ci-validation-gates/SKILL.md +8 −3
.copilot/skills/cli-wiring/SKILL.md +13 −0
.copilot/skills/client-compatibility/SKILL.md +8 −3
.copilot/skills/distributed-mesh/SKILL.md +8 −3
.copilot/skills/git-workflow/SKILL.md +8 −3
.copilot/skills/github-multi-account/SKILL.md +11 −5
.copilot/skills/history-hygiene/SKILL.md +10 −5
.copilot/skills/init-mode/SKILL.md +9 −7
.copilot/skills/model-selection/SKILL.md +8 −3
.copilot/skills/release-process/SKILL.md +8 −3
.copilot/skills/reskill/SKILL.md +8 −3
.copilot/skills/reviewer-protocol/SKILL.md +8 −3
.copilot/skills/secret-handling/SKILL.md +10 −5
.copilot/skills/squad-conventions/SKILL.md +8 −3
.squad/agents/span/charter.md +38 −0
.squad/agents/span/history.md +23 −0
.squad/casting/registry.json +7 −0
.squad/routing.md +1 −0
.squad/skill.md +13 −9
.squad/skills/CONTRIBUTING.md +263 −0
.squad/skills/cross-machine-coordination/SKILL.md +13 −0
.squad/skills/cross-squad/SKILL.md +10 −11
.squad/skills/economy-mode/SKILL.md +9 −4
.squad/skills/evals/README.md +307 −0
.squad/skills/evals/agent-collaboration.eval.yaml +46 −0
.squad/skills/evals/agent-conduct.eval.yaml +46 −0
.squad/skills/evals/architectural-proposals.eval.yaml +46 −0
.squad/skills/evals/ci-validation-gates.eval.yaml +46 −0
.squad/skills/evals/ci-validation-gates.exec-eval.yaml +66 −0
.squad/skills/evals/cli-wiring.eval.yaml +46 −0
.squad/skills/evals/client-compatibility.eval.yaml +46 −0
.squad/skills/evals/cross-machine-coordination.eval.yaml +46 −0
.squad/skills/evals/cross-squad.eval.yaml +46 −0
.squad/skills/evals/distributed-mesh.eval.yaml +46 −0
.squad/skills/evals/economy-mode.eval.yaml +46 −0
.squad/skills/evals/economy-mode.exec-eval.yaml +63 −0
.squad/skills/evals/external-comms.eval.yaml +46 −0
.squad/skills/evals/external-comms.exec-eval.yaml +64 −0
.squad/skills/evals/fact-checking.eval.yaml +46 −0
.squad/skills/evals/gh-auth-isolation.eval.yaml +46 −0
.squad/skills/evals/gh-auth-isolation.exec-eval.yaml +51 −0
.squad/skills/evals/git-workflow.eval.yaml +46 −0
.squad/skills/evals/git-workflow.exec-eval.yaml +51 −0
.squad/skills/evals/github-multi-account.eval.yaml +46 −0
.squad/skills/evals/history-hygiene.eval.yaml +46 −0
.squad/skills/evals/humanizer.eval.yaml +46 −0
.squad/skills/evals/init-mode.eval.yaml +46 −0
... +38 more files

Total: +6424 −1001


This check runs automatically on every push. Fix any ❌ items and push again.
See CONTRIBUTING.md and PR Requirements for details.

@github-actions
Copy link
Copy Markdown

github-actions Bot commented Apr 6, 2026

🔴 Impact Analysis — PR #5

Risk tier: 🔴 CRITICAL

📊 Summary

Metric Count
Files changed 88
Files added 50
Files modified 38
Files deleted 0
Modules touched 3

🎯 Risk Factors

  • 88 files changed (>50 → CRITICAL)
  • 3 modules touched (2-4 → MEDIUM)

📦 Modules Affected

copilot-config (17 files)
  • .copilot/skills/agent-collaboration/SKILL.md
  • .copilot/skills/agent-conduct/SKILL.md
  • .copilot/skills/architectural-proposals/SKILL.md
  • .copilot/skills/ci-validation-gates/SKILL.md
  • .copilot/skills/cli-wiring/SKILL.md
  • .copilot/skills/client-compatibility/SKILL.md
  • .copilot/skills/distributed-mesh/SKILL.md
  • .copilot/skills/git-workflow/SKILL.md
  • .copilot/skills/github-multi-account/SKILL.md
  • .copilot/skills/history-hygiene/SKILL.md
  • .copilot/skills/init-mode/SKILL.md
  • .copilot/skills/model-selection/SKILL.md
  • .copilot/skills/release-process/SKILL.md
  • .copilot/skills/reskill/SKILL.md
  • .copilot/skills/reviewer-protocol/SKILL.md
  • .copilot/skills/secret-handling/SKILL.md
  • .copilot/skills/squad-conventions/SKILL.md
root (3 files)
  • templates/skills/nap/SKILL.md
  • templates/skills/rework-rate/SKILL.md
  • templates/skills/squad-conventions/SKILL.md
squad-state (68 files)
  • .squad/agents/span/charter.md
  • .squad/agents/span/history.md
  • .squad/casting/registry.json
  • .squad/routing.md
  • .squad/skill.md
  • .squad/skills/CONTRIBUTING.md
  • .squad/skills/cross-machine-coordination/SKILL.md
  • .squad/skills/cross-squad/SKILL.md
  • .squad/skills/economy-mode/SKILL.md
  • .squad/skills/evals/README.md
  • .squad/skills/evals/agent-collaboration.eval.yaml
  • .squad/skills/evals/agent-conduct.eval.yaml
  • .squad/skills/evals/architectural-proposals.eval.yaml
  • .squad/skills/evals/ci-validation-gates.eval.yaml
  • .squad/skills/evals/ci-validation-gates.exec-eval.yaml
  • .squad/skills/evals/cli-wiring.eval.yaml
  • .squad/skills/evals/client-compatibility.eval.yaml
  • .squad/skills/evals/cross-machine-coordination.eval.yaml
  • .squad/skills/evals/cross-squad.eval.yaml
  • .squad/skills/evals/distributed-mesh.eval.yaml
  • .squad/skills/evals/economy-mode.eval.yaml
  • .squad/skills/evals/economy-mode.exec-eval.yaml
  • .squad/skills/evals/external-comms.eval.yaml
  • .squad/skills/evals/external-comms.exec-eval.yaml
  • .squad/skills/evals/fact-checking.eval.yaml
  • .squad/skills/evals/gh-auth-isolation.eval.yaml
  • .squad/skills/evals/gh-auth-isolation.exec-eval.yaml
  • .squad/skills/evals/git-workflow.eval.yaml
  • .squad/skills/evals/git-workflow.exec-eval.yaml
  • .squad/skills/evals/github-multi-account.eval.yaml
  • .squad/skills/evals/history-hygiene.eval.yaml
  • .squad/skills/evals/humanizer.eval.yaml
  • .squad/skills/evals/init-mode.eval.yaml
  • .squad/skills/evals/model-selection.eval.yaml
  • .squad/skills/evals/model-selection.exec-eval.yaml
  • .squad/skills/evals/nap.eval.yaml
  • .squad/skills/evals/optimize-description.mjs
  • .squad/skills/evals/personal-squad.eval.yaml
  • .squad/skills/evals/pr-screenshots.eval.yaml
  • .squad/skills/evals/ralph-two-pass-scan.eval.yaml
  • .squad/skills/evals/release-process.eval.yaml
  • .squad/skills/evals/release-process.exec-eval.yaml
  • .squad/skills/evals/reskill.eval.yaml
  • .squad/skills/evals/reviewer-protocol.eval.yaml
  • .squad/skills/evals/reviewer-protocol.exec-eval.yaml
  • .squad/skills/evals/rework-rate.eval.yaml
  • .squad/skills/evals/run-evals.mjs
  • .squad/skills/evals/run-llm-evals.mjs
  • .squad/skills/evals/secret-handling.eval.yaml
  • .squad/skills/evals/secret-handling.exec-eval.yaml
  • .squad/skills/evals/session-recovery.eval.yaml
  • .squad/skills/evals/session-recovery.exec-eval.yaml
  • .squad/skills/evals/squad-conventions.eval.yaml
  • .squad/skills/evals/validate-schema.mjs
  • .squad/skills/evals/versioning-policy.eval.yaml
  • .squad/skills/external-comms/SKILL.md
  • .squad/skills/gh-auth-isolation/SKILL.md
  • .squad/skills/humanizer/SKILL.md
  • .squad/skills/model-selection/SKILL.md
  • .squad/skills/personal-squad/SKILL.md
  • .squad/skills/pr-screenshots/SKILL.md
  • .squad/skills/ralph-two-pass-scan/SKILL.md
  • .squad/skills/release-process/SKILL.md
  • .squad/skills/session-recovery/SKILL.md
  • .squad/skills/versioning-policy/SKILL.md
  • .squad/team.md
  • .squad/templates/skill-review-checklist.md
  • .squad/templates/skill.md

This report is generated automatically for every PR. See #733 for details.

@github-actions
Copy link
Copy Markdown

github-actions Bot commented Apr 6, 2026

🏗️ Architectural Review

⚠️ Architectural review: 1 warning(s), 1 info.

Severity Category Finding Files
ℹ️ info template-sync Template files changed in templates/ but not in other template locations. If these templates should stay in sync, consider updating the others too. Changed: templates/, Unchanged: .squad-templates/, packages/squad-cli/templates/, .github/workflows/
🟡 warning sweeping-refactor This PR touches 88 files (88 modified/added, 0 deleted). Large PRs are harder to review — consider splitting if possible.

Automated architectural review — informational only.

@jazz127 jazz127 merged commit 18e6a37 into jazz127:dev Apr 6, 2026
19 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants