Add support for safe-inputs front matter section by Copilot · Pull Request #5090 · github/gh-aw

Copilot · 2025-11-29T17:37:49Z

Adds a new safe-inputs front matter section that provides a way to register custom tools as JavaScript or shell scripts. These tools are mounted in an MCP server and have access to secrets as specified by the user.

Changes Made

JSON Schema: Added safe-inputs property definition in main_workflow_schema.json and included_file_schema.json with validation for tool configurations (description required, script/run mutually exclusive)
Go Types & Parsing: Created SafeInputsConfig type and ParseSafeInputs function in safe_inputs.go
MCP Server Generation: Generates standalone MCP server JavaScript at runtime with support for both JavaScript and shell script tool execution
Engine Integration: Integrated safe-inputs MCP server into Copilot, Claude, Codex, and Custom engines
Secret Handling: Environment variables from env field are properly passed through to the MCP server
Import Support: Added MergedSafeInputs to ImportsResult and implemented mergeSafeInputs for importing safe-inputs from shared workflows
Auto-wrapping JavaScript: User JavaScript code is automatically wrapped in an async function with module.exports. Input parameters are destructured and available as local variables, so users can write simple code without worrying about exports.
Large Output Handling: Tool outputs larger than 500 characters are automatically written to /tmp/gh-aw/safe-inputs/calls/ directory. The response now includes:
- status: "output_saved_to_file"
- file_path: Path to the saved output file
- file_size_bytes: Size of the file in bytes
- file_size_chars: Size of the file in characters
- message: Human-readable description
- json_schema_preview: (optional) Simplified JSON schema extracted using jq if the output is JSON
Documentation: Added comprehensive documentation page at docs/src/content/docs/reference/safe-inputs.md

New Files

shared/pr-data-safe-input.md - Shared workflow providing fetch-pr-data tool using gh CLI
docs/src/content/docs/reference/safe-inputs.md - Documentation for safe-inputs feature

Usage Example

Define a safe-input tool (simplified - no exports needed!)

safe-inputs:
  test-js-math:
    description: "Test JavaScript math operations"
    inputs:
      a:
        type: number
        required: true
      b:
        type: number
        required: true
    script: |
      // Users can write simple code - inputs are available as local variables
      const sum = a + b;
      const product = a * b;
      return { sum, product };
    env:
      API_KEY: ${{ secrets.API_KEY }}
  echo-message:
    description: "Echo a message"
    inputs:
      message:
        type: string
        default: "Hello"
    run: |
      echo $INPUT_MESSAGE

Import safe-inputs from shared workflows

imports:
  - shared/pr-data-safe-input.md

Testing

✅ All unit tests pass
✅ All lint checks pass
✅ Code formatting applied

Original prompt

Add support for a new front matter section "safe-inputs" that provides a way to register tools as JavaScript or shell scripts.

So the plan will be to support “ligjtweight” definition of tools in the frontmatter (JavaScript or shell). The tools will be mounted in a MCP server, and will have access to secrets as specified by the user.
Something like this:

safe-inputs: # name TBD, safe-tools?
search-issues: # tool name
description: … # mandatory
inputs: # optional
# workflow syntax inputs
script: … JavaScript impl (cjs)
run: … shell script (script or run, not both)
env:
GH_TOKEN: ${{ secrets.GITHUB_TOKEN }}

The compiler should generate all the necessary files (MCP server JavaScript, cjs and shell scripts) under /tmp/gh-aw/safe-inputs/ and add the MCP server configuration in the generated mcpServer config of the agent.

✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

pelikhan · 2025-11-29T18:44:10Z

@copilot continue work

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot · 2025-11-29T19:21:00Z

@copilot continue work

Continued work on safe-inputs. In commit 1e1e43b, I've integrated the safe-inputs MCP server into workflow compilation:

Added RenderSafeInputs callback to all engines (Copilot, Claude, Codex, Custom)
Safe-inputs MCP server is now generated inline in the workflow with a self-contained MCP server implementation
Individual tool files (.cjs for JavaScript, .sh for shell) are generated per tool
Environment variables for secrets are properly passed through
Updated test files to handle new function signatures

The implementation now generates a working MCP server that exposes custom tools to the agent. Unit tests for safe-inputs parsing and generation are still pending.

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

https://api.github.com/user
- Triggering command: /usr/bin/gh gh api user --jq .login (http block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

…ration Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

pelikhan · 2025-11-29T20:12:16Z

@copilot add support for importing safe-inputs tool from shared agentic workflows.

Add a shared agentic workflow that uses gh PRs to download PR data using a shell script. Use that workflow with dev.md to compute basic PR stats

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot · 2025-11-29T20:25:36Z

@copilot add support for importing safe-inputs tool from shared agentic workflows.

Add a shared agentic workflow that uses gh PRs to download PR data using a shell script. Use that workflow with dev....

Added safe-inputs import support in commit 63e918c. Created shared/pr-data-safe-input.md that provides a fetch-pr-data tool using gh CLI. Updated dev.md to import this shared workflow and compute Copilot PR statistics.

The agent can now use fetch-pr-data with parameters like search: "head:copilot/" to fetch PR data and compute stats (total PRs, merged/closed/open counts, average merge time, most active day).

pelikhan · 2025-11-29T23:28:38Z

@copilot add special handling for large tool outputs. If larger than 500 chars,

write output to a temp file (under /tmp/gh-aw/safe-inputs/calls/
use jq to extract schema
return response with file path, size and scheme if it was extracted

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot · 2025-11-29T23:36:58Z

@copilot add special handling for large tool outputs. If larger than 500 chars,

write output to a temp file (under /tmp/gh-aw/safe-inputs/calls/ ...

Added large output handling in commit 91c65f4. Tool outputs > 500 chars are written to /tmp/gh-aw/safe-inputs/calls/call_{timestamp}_{counter}.txt and the MCP response contains a reference to the file path.

pelikhan · 2025-11-30T00:36:23Z

@copilot when the file is too large return a structure response to the LLM that clearly states where the output response file path is, file size and attempt to run "jq" to extract a simplified JSON schema.

pelikhan · 2025-11-30T00:40:20Z

@copilot the javascript should always be wrapped into a function with modules.exports assignment so that the user can simplify write code without worrying about exports.

github-actions · 2025-11-30T00:41:45Z

⚠️ Dev Hawk Report - Failure Analysis

Workflow Run: #3019

Status: failure
Commit: 91c65f4
Duration: ~4 minutes 39 seconds
Failed Job: agent
Failed Step: gh add: better tracking of added/modified files #34 "Parse agent logs for step summary"

Root Cause Analysis

The Dev workflow failed due to an MCP server launch failure for the new safe-inputs feature being tested in this PR.

Primary Error

##[error]MCP server(s) failed to launch: safeinputs

This error occurred at step 34 during log parsing, after the Claude Code CLI execution completed successfully (step 27). The safe-inputs MCP server that was configured in step 13 ("Setup Safe Inputs MCP") failed to launch properly.

Error Context

What happened:

✅ Safe Inputs MCP setup completed (step 13)
✅ Claude Code CLI executed successfully (step 27)
✅ Claude agent completed its task (analyzed 978 Copilot PRs from the last 30 days)
❌ Log parsing step detected the MCP server launch failure

Key observation: The Claude agent was able to run and complete its analysis task successfully, but the safe-inputs MCP server failed to launch. This suggests the MCP server configuration or initialization code has an issue.

Failure Category

Infrastructure/Configuration Error - MCP Server Launch Failure

The failure is related to the new safe-inputs MCP server infrastructure introduced in this PR, not the core functionality of the agent or the workflow logic.

Recommended Actions

Investigate MCP server generation code - Review the safe-inputs MCP server JavaScript generation in the compiler (likely in pkg/workflow/safe_inputs.go or related files)
Check MCP server configuration - Verify the generated MCP config at /tmp/gh-aw/.copilot/mcp-config.json or similar path
Add MCP server validation - Consider adding validation or error handling for MCP server launch failures
Review step 13 logs - Examine "Setup Safe Inputs MCP" step logs for any warnings or initialization issues
Test MCP server locally - Try running the generated safe-inputs MCP server outside of the workflow to reproduce the issue

Investigation Notes

Positive findings:

The workflow successfully installed Claude Code CLI
Network permissions hook was generated correctly
The Claude agent executed and completed its assigned task (PR analysis)
Error validation passed (no ERROR/WARNING patterns detected in agent logs)

Key diagnostic info:

The agent ran for 3 minutes with 15 turns and cost $1.67
Token usage: 48 input, 185k cache creation, 810k cache read, 6.9k output
All other workflow steps completed successfully
The failure only manifested during the log parsing step

The error suggests the safe-inputs MCP server configuration or initialization is problematic, but doesn't prevent the main agent execution. This indicates the issue is in how the MCP server is being launched or configured, not in the agent's ability to use it.

🤖 Automated analysis by Dev Hawk - monitoring copilot/ branch workflows

🦅 Observed from above by Dev Hawk

…and JSON schema preview Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot · 2025-11-30T00:44:22Z

@copilot when the file is too large return a structure response to the LLM that clearly states where the output response file path is, file size and attempt to run "jq" to extract a simplified JSON sc...

Implemented in commit 6adf6bd. When output exceeds 500 chars, the response now includes:

status: "output_saved_to_file"
file_path: Full path to the output file
file_size_bytes and file_size_chars: Size information
json_schema_preview: Simplified JSON schema extracted via jq (if output is JSON)

pelikhan · 2025-11-30T00:48:52Z

/q investigate #5090 (comment)

If the error is not clear, update dev.md to use codex for better error messages.

github-actions · 2025-11-30T00:49:16Z

🔧 Pay attention, 007! Q is preparing your gadgets for this issue comment...

github-actions · 2025-11-30T00:53:45Z

🔧 Q Investigation Report - MCP Server Launch Failure

Issue Analysis

I investigated the Dev workflow failure (run #19791426012) reported by Dev Hawk.

Root Cause

The workflow failed with:

##[error]MCP server(s) failed to launch: safeinputs

This failure occurred during log parsing (step 34) after the Claude agent had already completed its execution successfully.

Key Findings

MCP Server Not Launching: The "safeinputs" MCP server is being registered in the workflow configuration but fails to launch when the agent starts
Agent Completed Successfully: The Claude agent ran for 3 minutes with 15 turns and completed its PR analysis task, indicating the safe-inputs feature wasn't actually needed for this workflow
Clear Error Location: The failure is in the MCP server initialization, not in the agent's ability to process tasks

Investigation Details

What I Checked:

✅ Reviewed audit logs from run #19791426012
✅ Examined dev.md workflow configuration
✅ Checked for safe-inputs implementation files
✅ Analyzed MCP server failure patterns

Observations:

The dev.md workflow uses the Claude engine (not Copilot or Codex)
No actual safe-inputs usage occurs in the dev.md workflow prompt
The MCP server is being registered but the launch fails silently until log parsing
Error reporting shows the failure but doesn't provide details about WHY the server failed to launch

Recommended Actions

1. Fix the Safe-Inputs MCP Server Implementation

The issue is in the safe-inputs MCP server generation code (likely in the PR changes). The server needs to:

Generate valid JavaScript for the MCP server
Handle proper module exports
Include error handling for initialization failures

Files to Check:

Go code that generates the safe-inputs MCP server JavaScript
MCP server configuration generation for safe-inputs
Server initialization and validation logic

2. Consider Engine Change for Dev Workflow

As suggested by @pelikhan, switching dev.md from Claude to Codex might provide better error messages:

- engine: claude
+ engine: codex

Pros:

Codex may provide clearer error reporting for MCP failures
Could help diagnose the safe-inputs launch issue

Cons:

Doesn't fix the underlying safe-inputs MCP server bug
Just provides better diagnostics

3. Add MCP Server Validation

Add validation that checks if safe-inputs MCP server can launch before the agent starts:

Pre-flight check for MCP server validity
Better error messages when server fails to initialize
Clear indication of what went wrong (missing file, invalid JS, etc.)

Why Not Creating a PR

The actual bug is in the safe-inputs implementation code (within this PR's changes), not in the workflow configuration. The fix needs to happen at the compiler/code-generation level where the safe-inputs MCP server is created.

What I Would Fix If I Had Access:

Review the safe-inputs MCP server generation logic in the Go code
Ensure the generated JavaScript is valid and can be executed
Add better error handling during MCP server initialization
Add validation tests for safe-inputs MCP server generation

Next Steps

For the PR author to fix:

Debug why the generated safe-inputs MCP server fails to launch
Add error logging to the MCP server generation to show what's going wrong
Test the generated MCP server JavaScript in isolation to verify it works
Consider adding integration tests that validate MCP server launch

🤖 Automated analysis by Q - agentic workflow optimizer
📊 Workflow Run: https://github.com/githubnext/gh-aw/actions/runs/19791581437

🎩 Equipped by Q fer issue #5090 🗺️

pelikhan · 2025-11-30T00:57:06Z

@copilot update dev.md to use codex, add test cases in JavaScript to discover the error.

…ev.md to use codex with JS test cases Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot · 2025-11-30T01:33:55Z

@copilot the javascript should always be wrapped into a function with modules.exports assignment so that the user can simplify write code without worrying about exports.

Implemented in commit b749805. JavaScript code is now automatically wrapped in a function with module.exports. Input parameters are destructured and available as local variables, so users can write simple code like:

const sum = a + b;
return { sum };

Instead of:

module.exports = async function(params) {
  const { a, b } = params;
  return { sum: a + b };
};

@copilot update dev.md to use codex, add test cases in JavaScript to discover the error.

Updated dev.md to use Codex engine and added JavaScript test cases in commit b749805. The workflow now includes:

test-js-math tool to test math operations (a + b, a * b)
test-js-string tool to test string operations (uppercase, length)

github-actions · 2025-11-30T02:18:55Z

✅ Dev Hawk Report - Success

Workflow Run: #3020

Status: ✅ success
Commit: b749805
Branch: copilot/add-safe-inputs-support
Duration: ~26 minutes (1564 seconds)
Triggered by: @pelikhan via workflow_dispatch

Summary

The Dev workflow completed successfully! 🎉 All checks passed for the safe-inputs feature implementation.

Workflow Details:

Started: 2025-11-30 01:50:39 UTC
Completed: 2025-11-30 02:16:43 UTC
Jobs executed: 5

This validates the changes made to add support for the safe-inputs front matter section, including:

Auto-wrapping of user JavaScript code with destructured inputs
Large output handling with file storage
Integration with Codex engine for testing

The workflow successfully validated the implementation with JavaScript test cases. 🚀

🦅 Observed from above by Dev Hawk

pelikhan · 2025-11-30T13:47:01Z

@copilot add documentation page

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot · 2025-11-30T13:53:19Z

@copilot add documentation page

Added documentation page in commit 55fa0ab at docs/src/content/docs/reference/safe-inputs.md. The page covers:

Quick start and tool definition
JavaScript and shell script implementations
Input parameters and validation
Environment variables and secrets
Large output handling
Importing from shared workflows
Security considerations
Comparison with other tool options

pelikhan · 2025-11-30T14:10:53Z

@copilot format, lint, fix all tests

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot · 2025-11-30T14:44:10Z

@copilot format, lint, fix all tests

Completed in commit 22c6abc. All code formatted, linted, and tests pass.

Initial plan

80aea5f

Copilot AI assigned Copilot and pelikhan Nov 29, 2025

Copilot started work on behalf of pelikhan November 29, 2025 17:38 View session

Changes before error encountered

0b7537e

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot AI requested a review from pelikhan November 29, 2025 17:55

Copilot started work on behalf of pelikhan November 29, 2025 18:44 View session

Integrate safe-inputs MCP server into workflow compilation

1e1e43b

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot AI and others added 2 commits November 29, 2025 19:29

Add unit tests for safe-inputs parsing and generation

ebd4b78

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Address code review: sanitize parameter names in JavaScript code gene…

8e36564

…ration Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Add support for safe-inputs front matter section~~ Add support for safe-inputs front matter section Nov 29, 2025

Copilot finished work on behalf of pelikhan November 29, 2025 19:39

Copilot started work on behalf of pelikhan November 29, 2025 20:12 View session

Add safe-inputs import support and create PR data fetch shared workflow

63e918c

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot finished work on behalf of pelikhan November 29, 2025 20:27

Copilot started work on behalf of pelikhan November 29, 2025 23:29 View session

Add large output handling for safe-inputs (write to file if > 500 chars)

91c65f4

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot finished work on behalf of pelikhan November 30, 2025 00:03

Copilot started work on behalf of pelikhan November 30, 2025 00:36 View session

Add structured response for large tool outputs with file path, size, …

6adf6bd

…and JSON schema preview Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot finished work on behalf of pelikhan November 30, 2025 01:27

Copilot started work on behalf of pelikhan November 30, 2025 01:28 View session

Auto-wrap user JS code in function with destructured inputs; update d…

b749805

…ev.md to use codex with JS test cases Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot finished work on behalf of pelikhan November 30, 2025 01:35

Copilot started work on behalf of pelikhan November 30, 2025 13:47 View session

Add safe-inputs documentation page

55fa0ab

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

pelikhan marked this pull request as ready for review November 30, 2025 14:02

Copilot finished work on behalf of pelikhan November 30, 2025 14:38

Copilot started work on behalf of pelikhan November 30, 2025 14:39 View session

Format, lint, and fix all tests

22c6abc

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

pelikhan merged commit d78af91 into main Nov 30, 2025
63 of 67 checks passed

pelikhan deleted the copilot/add-safe-inputs-support branch November 30, 2025 14:59

Copilot stopped work on behalf of pelikhan due to an error November 30, 2025 14:59
Copilot has encountered an error. See logs for additional details.

github-actions bot mentioned this pull request Dec 1, 2025

[docs] Update documentation for shared gh.md workflow #5185

Merged

Conversation

Copilot AI commented Nov 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes Made

New Files

Usage Example

Define a safe-input tool (simplified - no exports needed!)

Import safe-inputs from shared workflows

Testing

Uh oh!

pelikhan commented Nov 29, 2025

Uh oh!

Copilot AI commented Nov 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

pelikhan commented Nov 29, 2025

Uh oh!

Copilot AI commented Nov 29, 2025

Uh oh!

pelikhan commented Nov 29, 2025

Uh oh!

Copilot AI commented Nov 29, 2025

Uh oh!

pelikhan commented Nov 30, 2025

Uh oh!

pelikhan commented Nov 30, 2025

Uh oh!

github-actions bot commented Nov 30, 2025

⚠️ Dev Hawk Report - Failure Analysis

Root Cause Analysis

Primary Error

Error Context

Failure Category

Recommended Actions

Investigation Notes

Uh oh!

Copilot AI commented Nov 30, 2025

Uh oh!

pelikhan commented Nov 30, 2025

Uh oh!

github-actions bot commented Nov 30, 2025

Uh oh!

github-actions bot commented Nov 30, 2025

🔧 Q Investigation Report - MCP Server Launch Failure

Issue Analysis

Root Cause

Key Findings

Investigation Details

Recommended Actions

1. Fix the Safe-Inputs MCP Server Implementation

2. Consider Engine Change for Dev Workflow

3. Add MCP Server Validation

Why Not Creating a PR

Next Steps

Uh oh!

pelikhan commented Nov 30, 2025

Uh oh!

Copilot AI commented Nov 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Nov 30, 2025

✅ Dev Hawk Report - Success

Summary

Uh oh!

pelikhan commented Nov 30, 2025

Uh oh!

Copilot AI commented Nov 30, 2025

Uh oh!

pelikhan commented Nov 30, 2025

Uh oh!

Copilot AI commented Nov 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Copilot AI commented Nov 29, 2025 •

edited

Loading

Copilot AI commented Nov 29, 2025 •

edited

Loading

Copilot AI commented Nov 30, 2025 •

edited

Loading