Agent Persona Exploration - 2026-04-29 #29038
Closed
Replies: 2 comments 1 reply
-
|
/plan |
Beta Was this translation helpful? Give feedback.
1 reply
-
|
This discussion has been marked as outdated by Agent Persona Explorer. A newer discussion is available at Discussion #29241. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Overview
This report analyzes the agentic-workflows custom agent (configured in
.github/aw/create-agentic-workflow.md) against 4 representative software worker personas and their automation scenarios. Since the custom agent is invoked via GitHub Copilot Chat (/agent agentic-workflows) and is not directly callable from a workflow environment, this analysis is documentation-based — evaluating how well the agent's instructions prepare it to handle each scenario..github/aw/create-agentic-workflow.md(59.5 KB)Key Findings
safe-outputsfor all write operations — well-documented with explicit ❌ / ✅ examplesdaily,weekly,daily on weekdays) is thoroughly documented with rationale (avoid load spikes, avoid Monday backlog)api.github.comdirect access won't work and that the GitHub MCP server must be used insteadworkflow_runtrigger (critical for DevOps monitoring scenarios) has only 4 mentions with no dedicated examplesadd-labelsafe-output documented; the closest options areupdate-issueandadd-commentScenario Results
View High-Quality Responses (S4, S5)
S4 — QA Tester: Test Coverage Analysis on PRs (avg 4.6/5)
The agent is well-prepared for this scenario. The
pull_requesttrigger is thoroughly covered,add-commentviasafe-outputsis the primary documented output pattern, and the GitHub MCP toolsets provide everything needed to read PR diffs and check runs. The main gap is no pre-built guidance on coverage tool output parsing (e.g., lcov, coverage.py), but the agent can infer this viabashsteps.S5 — Product Manager: Weekly Feature Digest (avg 5.0/5)
Perfect fit. The agent explicitly documents: fuzzy scheduling with
schedule: weekly, weekday-only preference to avoid Monday backlog,create-discussionas the recommended safe-output for digests/reports, and GitHub toolsets for reading closed PRs and issues. No gaps identified.View Areas for Improvement (S1, S3)
S1 — Backend Engineer: DB Schema Review (avg 4.4/5)
The
pull_requesttrigger with path filters (paths: ['db/migrations/*.sql']) is supported but the create-agentic-workflow.md does not include an example of path-filtered PR triggers. Additionally, applying a label (e.g.,needs-db-review) to a PR has no explicit safe-output — the agent would need to guide users toward theupdate-issuesafe-output or a custom safe-output job, which isn't obvious.S3 — DevOps Engineer: Deployment Monitoring (avg 3.4/5)
This is the weakest scenario. The
workflow_runtrigger is mentioned only 4 times with no complete examples. The agent may correctly identify the architectural constraint (can't wait for deployments) and redirect to simpler alternatives, but lacks concrete guidance onworkflow_run-based monitoring patterns. A DevOps engineer asking for "monitor failed deployments" may receive a vague or redirected response.Top Patterns
pull_request(PR automation) andschedule: weekly/daily(digests/monitoring) dominate agent guidancetoolsets: [default]) is the near-universal first tool;web-fetchandplaywrightfor external dataRecommendations
Add
workflow_runtrigger examples to.github/aw/create-agentic-workflow.md— This is the natural trigger for DevOps monitoring scenarios (react to failed CI/CD runs). Add 1-2 examples with the patternon: workflow_run: workflows: [...] types: [completed]showing how to accessgithub.event.workflow_runcontext for failure detection.Document
add-labelsafe-output capability in.github/aw/create-agentic-workflow.md— Ifadd-labelis a supported safe-output operation, it should appear in the safe-outputs guidance section. If it's not supported, add an explicit note directing users to useupdate-issuewith label arrays instead. This prevents confusion for Backend/QA scenarios where labeling is a natural output.Add a path-filtered PR trigger example — Include a frontmatter snippet showing `pull_request: paths: [...]" filtering in the agent file. Backend engineers routinely need schema change detection, and this is a common but non-obvious GitHub Actions pattern.
References:
Beta Was this translation helpful? Give feedback.
All reactions