[copilot-cli-research] Copilot CLI Deep Research - 2026-03-19 #21858
Replies: 4 comments
-
|
🤖 Beep boop! The smoke test agent was here! Testing all systems... ✅ Running checks on PR #21752. Stay tuned for the full report! Note 🔒 Integrity filtering filtered 2 itemsIntegrity filtering activated and filtered the following items during workflow execution.
|
Beta Was this translation helpful? Give feedback.
-
|
🎉 The smoke test robot stopped by for a visit!
11/12 tests passed in run §23318554061. The only hiccup? Serena MCP isn't around at the party 🎈. Everything else is rocking! 🤖💚 Note 🔒 Integrity filtering filtered 2 itemsIntegrity filtering activated and filtered the following items during workflow execution.
|
Beta Was this translation helpful? Give feedback.
-
|
💥 WHOOSH! 🦸♂️ KAPOW! The smoke test agent bursts onto the scene! ⚡ ZAP! Claude Engine ACTIVATED — Run 23318554140 is NOMINAL! "With great automation comes great responsibility!" — The Smoke Test Agent 🌟 BOOM! All systems GO! The agentic workflows are ALIVE and KICKING! 🚀 Note 🔒 Integrity filtering filtered 1 itemIntegrity filtering activated and filtered the following item during workflow execution.
|
Beta Was this translation helpful? Give feedback.
-
|
This discussion has been marked as outdated by Copilot CLI Deep Research Agent. A newer discussion is available at Discussion #22031. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Analysis Date: 2026-03-19
Repository: github/gh-aw
Triggered by:
@pelikhanScope: 175 total workflows, 83 using Copilot engine (47%)
📊 Executive Summary
This first comprehensive analysis of Copilot CLI usage across 175 agentic workflows reveals a significant gap between available features and actual adoption. Of 83 Copilot-engine workflows, only 12 (14%) use the AWF sandbox firewall, while 7 of 9 custom agent files go unused. Key advanced features like
max-continuations,startup-timeout,engine.env, and granular GitHub tool permissions are almost entirely absent despite being fully supported.The most impactful opportunity is broader AWF sandbox adoption — 71 Copilot workflows currently run with full network access, creating unnecessary security exposure. The second biggest opportunity is leveraging existing custom agent files (
.github/agents/*.agent.md) to provide specialized personas and behavioral instructions to specific workflow types.Primary Recommendation: Systematically enable AWF sandbox for security-sensitive workflows and wire up unused custom agent files to appropriate workflows.
🔴 Critical Findings
Security: 71 of 83 Copilot Workflows Run Without AWF Sandbox
Most Copilot workflows execute with unrestricted network access. Only 12 workflows use
sandbox: agent: awf. This is the highest-impact opportunity — especially for workflows that process untrusted input (issues, PRs, comments).Underutilization: 7/9 Custom Agent Files Are Never Used
The repository has 9
.github/agents/*.agent.mdfiles, but onlyci-cleanerandtechnical-doc-writerare wired to workflows. The following agents are orphaned:agentic-workflows.agent.mdcontribution-checker.agent.mdcreate-safe-output-type.agent.mdcustom-engine-implementation.agent.mdgrumpy-reviewer.agent.mdinteractive-agent-designer.agent.mdw3c-specification-writer.agent.md🟡 Medium Priority Issues
max-continuationsNearly Absent: Only 1 workflow (smoke-copilot) usesmax-continuations: 2. Copilot CLI explicitly supports--autopilot --max-autopilot-continues, making this ideal for complex multi-step workflows. Complex workflows likeci-doctor,code-scanning-fixer, andhourly-ci-cleanercould benefit significantly.GitHub Tool Permissions Too Broad: 122 workflows use
toolsets: [default]which grants broad GitHub MCP access. Only 8 workflows define granularallowed:lists. For read-only workflows, specifyingtoolsets: [repos, issues]instead of[default]would reduce attack surface.startup-timeoutNever Used: Despite being a supported feature, no workflow setsstartup-timeout. This helps detect hung Copilot CLI processes that never initialize, avoiding wasted runner time.1️⃣ Current State Analysis
View Copilot CLI Capabilities Inventory
Available Engine Configuration Options
CLI Flags Auto-Generated by gh-aw
--add-dir(workspace, /tmp/gh-aw/, cache-memory dirs)--log-level all --log-dir (logs_folder)--disable-builtin-mcps--allow-tool(per configured tool)--autopilot --max-autopilot-continues N(when max-continuations > 1)--agent (id)(when engine.agent is set)Available Sandbox Options
Available Feature Flags
View Usage Statistics
Most Common Tools:
githubwithtoolsets: [default]— 122 usesgithubsimple (no toolsets) — ~8 usesbashwith specific commands — manybash: ["*"](allow all) — 10 workflowsweb-fetch— 20 workflowsEngine Distribution:
engine: copilot(simple) — 82 workflowsengine: {id: copilot, ...}(extended) — ~4 workflowsengine: claude/engine: codex— 20+ workflowsengine: gemini— handful2️⃣ Feature Usage Matrix
engine.modelengine.agentengine.versionengine.argsengine.envengine.api-targetsandbox.agent: awfsandbox.agent.mountssandbox.agent.memorymax-continuationsstartup-timeoutgithub.allowed(granular)block-domainsfeatures.copilot-requestsfeatures.mcp-gatewayfeatures.disable-xpia-promptfeatures.mcp-scriptscache-memorytool3️⃣ Missed Opportunities
🔴 High Priority Opportunities
Opportunity 1: Enable AWF Sandbox for Security-Sensitive Workflows
issues,pull_request,issue_comment, ordiscussioneventssandbox: agent: awfand ensurenetwork.allowedlists only required domainsCandidate workflows:
auto-triage-issues,ai-moderator,bot-detection,contribution-check,code-scanning-fixer,breaking-change-checker, and ~30 others triggered by user contentOpportunity 2: Wire Unused Custom Agent Files to Workflows
.github/agents/files are never referenced by workflowsagentic-workflows.agent.mdcreate-agentic-workflow.md,update-agentic-workflow.mdcontribution-checker.agent.mdcontribution-check.mdgrumpy-reviewer.agent.mdcode-simplifier.md,cli-consistency-checker.mdw3c-specification-writer.agent.mdweekly-blog-post-writer.md,technical-doc-writer.mdinteractive-agent-designer.agent.mdworkflow-generator.md,create-agentic-workflow.md🟡 Medium Priority Opportunities
Opportunity 3: Enable
max-continuationsfor Complex Workflowsmax-continuations, but Copilot CLI fully supports autopilot modeci-doctor.md,code-scanning-fixer.md,hourly-ci-cleaner.md,tidy.md,update-astro.mdNote from the codebase: This translates to
--autopilot --max-autopilot-continues 3.Opportunity 4: Add
startup-timeoutto All Copilot Workflowsstartup-timeoutdespite it being a supported fieldtimeout-minutesvalue. A startup timeout of 2-3 minutes catches initialization failures quickly.Opportunity 5: Use Granular GitHub Tool Permissions
toolsets: [default]which includes repos, issues, pull requests, code scanning, and more — even when only a subset is neededOpportunity 6: Pin Copilot CLI Version for Critical Workflows
engine: copilotwithout a version pin, meaning they always installlatesthourly-ci-cleaner,daily-news,daily-copilot-token-report,weekly-blog-post-writerOpportunity 7: Use
engine.envfor Workflow-Specific Configurationengine.envdespite it being available for all engines🟢 Low Priority Opportunities
Opportunity 8: Set
sandbox.agent.memoryfor Resource-Intensive WorkflowsOpportunity 9: Use
block-domainsfor Defense in DepthOpportunity 10: Enable
features.mcp-gatewayfor Better MCP Routingmcp-gatewayfeature (was tested in smoke workflows previously but removed)Opportunity 11: Fix
ci-coach.mdCache Path Bugci-coach.mdwrites to/tmp/cache-memory/ci-coach/last-analysis.jsonbut the correct path is/tmp/gh-aw/cache-memory/cache-memory: trueto the tools sectionThen change the agent prompt to use
/tmp/gh-aw/cache-memory/ci-coach/instead of/tmp/cache-memory/ci-coach/.Opportunity 12: Expand
max-continuationsAwareness in Commentsmax-continuations(maps to--max-autopilot-continues)hourly-ci-cleaner.mdcomment blockmax-continuationsas the Copilot equivalent4️⃣ Specific Workflow Recommendations
View High-Impact Workflow Changes
ci-doctor.mdgpt-5.1-codex-minimodel, no sandbox, no max-continuationsmax-continuations: 3to allow iterative diagnosis, andsandbox: agent: awffor safetyhourly-ci-cleaner.mdmax-continuationsmax-continuations: 3and remove the incorrect commentcontribution-check.mdengine: {id: copilot, agent: contribution-checker}to use the existingcontribution-checker.agent.mdcode-scanning-fixer.mdsandbox: agent: awf,max-continuations: 2ci-coach.md/tmp/cache-memory//tmp/gh-aw/cache-memory/and addcache-memory: trueto toolsworkflow-generator.mdassign-to-agentbut no specialized agent fileinteractive-agent-designer.agent.md5️⃣ Trends & Insights
View Historical Context (First Analysis)
This is the first comprehensive analysis of Copilot CLI usage in this repository. Future analyses will track:
max-continuationsgets adopted for complex workflowsstartup-timeoutgets standardizedBaseline established: 2026-03-19, Run §23317404420
6️⃣ Best Practice Guidelines
Based on this research, here are recommended best practices for Copilot engine workflows:
Always set
startup-timeout: Addstartup-timeout: 3to all Copilot workflows to catch hung initialization earlyUse AWF for untrusted input: Any workflow triggered by user-submitted content (issues, PRs, comments) should have
sandbox: agent: awfLeverage custom agents: Before writing long behavioral prompts, check if an existing
.github/agents/*.agent.mdmatches your use case — or create oneScope GitHub tool permissions: Use
toolsets: [repos, issues]instead oftoolsets: [default]when your workflow only reads issues and PRsUse
max-continuationsfor iterative tasks: CI fixers, code reviewers, and content generators benefit from multiple autonomous continuation loopsPin versions for critical scheduled workflows: Use
engine: {id: copilot, version: "X.Y.Z"}for high-frequency scheduled workflows to prevent surprise breakageUse correct cache-memory path: Always use
/tmp/gh-aw/cache-memory/(not/tmp/cache-memory/)7️⃣ Action Items
Immediate Actions (this week):
ci-coach.mdcache path bug (/tmp/cache-memory/→/tmp/gh-aw/cache-memory/)hourly-ci-cleaner.mdcomment about max-turns; addmax-continuations: 3contribution-checker.agent.mdtocontribution-check.mdShort-term (this month):
startup-timeout: 3as standard to all Copilot workflowsmax-continuations: 2-3toci-doctor,code-scanning-fixerLong-term (this quarter):
toolsets: [default]to specific toolsets where possiblefeatures.mcp-gatewayfor workflows with multiple MCP serversengine.envpattern for workflows with hardcoded configuration valuesView Supporting Evidence & Methodology
Research Methodology
pkg/workflow/copilot_*.go) to inventory available features and CLI flagsdocs/src/content/docs/reference/engines.mdfor documented capabilities.github/workflows/*.mdfiles for feature usage patternspkg/constants/constants.gofor feature flags and engine configurationcopilot_engine_execution.goto understand how features map to CLI flagsKey Files Analyzed
pkg/workflow/copilot_engine.go— Engine interface and capabilitiespkg/workflow/copilot_engine_execution.go— CLI argument constructionpkg/workflow/copilot_engine_tools.go— Tool permission logicpkg/workflow/copilot_mcp.go— MCP server configurationpkg/workflow/sandbox.go— Sandbox configuration structuresdocs/src/content/docs/reference/engines.md— Engine documentationData Sources
.github/workflows/*.md.github/agents/*.agent.mdReferences:
Beta Was this translation helpful? Give feedback.
All reactions