Multi agent orchestrator by MasterMind7777777 · Pull Request #5190 · openai/codex

MasterMind7777777 · 2025-10-14T23:59:59Z

Summary

add the delegate_agent tool so the main Codex assistant can launch sub-agents without user prompts or MCP plumbing
keep the multi-agent design notes in ai-temp/ temporarily while the architecture settles (they will move or vanish before GA)
ship a read-only sample Codex home plus ai-temp/example-conversation.md to demonstrate the ideas-provider → critic workflow end-to-end

Why

Deep, specialized operations exhaust the primary context or dilute instructions. By shunting targeted work to sub-agents, we:

prevent large intermediate artifacts (e.g., heavy JSON, security checklists) from bloating the main conversation
unlock tailored system prompts, models, and reasoning effort per task without manual toggles
keep the UI and toolset identical to native Codex: the user still sees the same transcript while the orchestrator manages contexts/config under the hood
support heterogeneous models/efforts inside a single flow—simple steps on smaller models, heavy audits on higher-effort variants—without manual switching

Status

Work in progress. Delegation already functions for read-only demos, but polished instructions, diagnostics, and more sampling are still ahead.

Usage

# from the repo root
cd codex-rs
env RUSTFLAGS="" cargo build -p codex-cli

# launch the CLI against the sample multi-agent home
CODEX_HOME="$(git rev-parse --show-toplevel)/ai-temp/example-codex-home" target/debug/codex

See ai-temp/example-conversation.md for a complete transcript exercising ideas-provider → critic delegation.

Testing

cargo build -p codex-cli
manual CLI session with CODEX_HOME=…/ai-temp/example-codex-home target/debug/codex

alexx-ftw · 2025-10-15T00:09:14Z

Please make it be able to use a different model than its parent

MasterMind7777777 · 2025-10-15T00:13:59Z

Please make it be able to use a different model than its parent

Will do. My idea is that agents should have its own config.toml to configure them in exactly same way as base codex.

alexx-ftw · 2025-10-15T00:17:37Z

Please make it be able to use a different model than its parent

Will do. My idea is that agents should have its own config.toml to configure them in exactly same way as base codex.

Sounds good. We should be able to even set a different Base URL, API key and model for the subagents, to combine different providers and have a powerful orchestration system

towry · 2025-10-15T03:37:42Z

Please make it be able to use a different model than its parent

Will do. My idea is that agents should have its own config.toml to configure them in exactly same way as base codex.

subagent can just use a different profile. Currently I am using Codex as mcp server in claude code with different profile for different models and config.

…strain example prompts

MasterMind7777777 · 2025-10-15T23:26:53Z

Made a feature that allows switching into a child agent and working in it as in usual Codex, then returning to the main agent with the context collected in the sub agent. For fine-grained control over sub agents. To use, run /agent and select the agent to enter in to.

alexx-ftw · 2025-10-15T23:45:03Z

Made a feature that allows switching into a child agent and working in it as in usual Codex, then returning to the main agent with the context collected in the sub agent. For fine-grained control over sub agents. To use, run /agent and select the agent to enter in to.

That is a novelty. Haven't seen any other tool do that. Really interesting

alexx-ftw · 2025-10-16T11:40:10Z

i really hope all this work is appreciated and considered to be merged by the OpenAI Codex team

alexx-ftw · 2025-10-16T22:32:52Z

@codex review

chatgpt-codex-connector · 2025-10-16T22:41:36Z

Codex Review: Didn't find any major issues. Already looking forward to the next diff.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting

@codex fix this CI failure
@codex address that feedback

bogzbonny · 2025-10-18T17:35:44Z

Made a feature that allows switching into a child agent and working in it as in usual Codex, then returning to the main agent with the context collected in the sub agent. For fine-grained control over sub agents. To use, run /agent and select the agent to enter in to.

@MasterMind7777777 & @alexx-ftw. Wouldn't it be typical that one wouldn't want this? I could very well me missing something, but as I understand it, one of the core benefits of subagents is you keep the context paired down and clean in the parent agent, it doesn't receive the entire context of the subagent but only a summary which the subagent generates. Alternative to a subagent summary, I could see a highly compacted context also be useful (which effectively acts as a summary). What is the scenario which one would want the full context of the subagent? I suppose if the context is small enough or the LLM is large and good enough it wouldn't matter.

I also wanted to tag a previous discussion I've had on subagents for a different project (bosun-ai/kwaak#352) @MasterMind7777777 maybe you will find some of that discussion useful or relevant

alexx-ftw · 2025-10-18T17:38:54Z

Made a feature that allows switching into a child agent and working in it as in usual Codex, then returning to the main agent with the context collected in the sub agent. For fine-grained control over sub agents. To use, run /agent and select the agent to enter in to.

@MasterMind7777777 & @alexx-ftw. Wouldn't it be typical that one wouldn't want this? I could very well me missing something, but as I understand it, one of the core benefits of subagents is you keep the context paired down and clean in the parent agent, it doesn't receive the entire context of the subagent but only a summary which the subagent generates. Alternative to a subagent summary, I could see a highly compacted context also be useful (which effectively acts as a summary). What is the scenario which one would want the full context of the subagent? I suppose if the context is small enough or the LLM is large and good enough it wouldn't matter.

I also wanted to tag a previous discussion I've had on subagents for a different project (bosun-ai/kwaak#352) @MasterMind7777777 maybe you will find some of that discussion useful or relevant

Main Agent/Orchestrator still gets a summary. But this solution allows the human to control the subagent when wanted the same as it does with the main Agent, and then go back

MasterMind7777777 · 2025-10-18T17:56:10Z

Made a feature that allows switching into a child agent and working in it as in usual Codex, then returning to the main agent with the context collected in the sub agent. For fine-grained control over sub agents. To use, run /agent and select the agent to enter in to.

@MasterMind7777777 & @alexx-ftw. Wouldn't it be typical that one wouldn't want this? I could very well me missing something, but as I understand it, one of the core benefits of subagents is you keep the context paired down and clean in the parent agent, it doesn't receive the entire context of the subagent but only a summary which the subagent generates. Alternative to a subagent summary, I could see a highly compacted context also be useful (which effectively acts as a summary). What is the scenario which one would want the full context of the subagent? I suppose if the context is small enough or the LLM is large and good enough it wouldn't matter.

I also wanted to tag a previous discussion I've had on subagents for a different project (bosun-ai/kwaak#352) @MasterMind7777777 maybe you will find some of that discussion useful or relevant

Sub agents have their own context that they maintain (separate from parent agent) they only give last thing they produced as per request of main agent. But if you have seen their response of sub agent is bad, not what you expected then you might want to enter that agent to correct it by sending one or multiple clarifications so result that this sub agent provides to main agent is what you been looking for, and main agent using this updated response of subagent can handle final problem.

bogzbonny · 2025-10-18T18:24:38Z

OHHH I actually misunderstood what was being talked about (whoops!). Basically this is a feature with human interaction inside of the subagent... sounds useful. So is would the workflow as I imagine it:

main agent calls subagent
subagent produces subpar output, passing its summary back to main agent
maybe the main agent begins to work a bit
human halts the main agent (ctrl-c)
human enters subagent and sends additional prompts to improve the response of the subagent
codex starts up again with the new prompt, it starts working from the subagent
the subagent sends an additional response to the main agent
- QUESTION does the main agent still have all of its context (aka context genereted in (3.) OR does the main agent roll back its context to before it got its first response from the subagent (1.)

Is that workflow correct? what happens to the main context on subagent re-prompt?

You know what would be interesting too, to be able to have the main agent reprompt the subagent directly if it notices somethings is amiss with the subagent output. For a future PR though.

john-says-hi · 2025-10-19T04:27:30Z

nice contribution. This sounds like an exciting feature. Hope it works out good.

MasterMind7777777 · 2025-10-19T10:52:51Z

OHHH I actually misunderstood what was being talked about (whoops!). Basically this is a feature with human interaction inside of the subagent... sounds useful. So is would the workflow as I imagine it:

main agent calls subagent

subagent produces subpar output, passing its summary back to main agent

maybe the main agent begins to work a bit

human halts the main agent (ctrl-c)

human enters subagent and sends additional prompts to improve the response of the subagent

codex starts up again with the new prompt, it starts working from the subagent

the subagent sends an additional response to the main agent

QUESTION does the main agent still have all of its context (aka context genereted in (3.) OR does the main agent roll back its context to before it got its first response from the subagent (1.)

Is that workflow correct? what happens to the main context on subagent re-prompt?

You know what would be interesting too, to be able to have the main agent reprompt the subagent directly if it notices somethings is amiss with the subagent output. For a future PR though.

Yah that is how it works. Except context of parent agent is not cleared/altered when you come back from sub agent. Just not sure yet how will it look; if you enter agent after chatting with main agent for long time. Should we really wipe main context to that point where you last used that specific sub agent?
Mb add it as separate opt in feature to replace context instead of add when coming back from sub agent to main agent.

alexx-ftw · 2025-10-19T13:38:07Z

OHHH I actually misunderstood what was being talked about (whoops!). Basically this is a feature with human interaction inside of the subagent... sounds useful. So is would the workflow as I imagine it:

main agent calls subagent

subagent produces subpar output, passing its summary back to main agent

maybe the main agent begins to work a bit

human halts the main agent (ctrl-c)

human enters subagent and sends additional prompts to improve the response of the subagent

codex starts up again with the new prompt, it starts working from the subagent

the subagent sends an additional response to the main agent

QUESTION does the main agent still have all of its context (aka context genereted in (3.) OR does the main agent roll back its context to before it got its first response from the subagent (1.)

Is that workflow correct? what happens to the main context on subagent re-prompt?

You know what would be interesting too, to be able to have the main agent reprompt the subagent directly if it notices somethings is amiss with the subagent output. For a future PR though.

Yah that is how it works. Except context of parent agent is not cleared/altered when you come back from sub agent. Just not sure yet how will it look; if you enter agent after chatting with main agent for long time. Should we really wipe main context to that point where you last used that specific sub agent? Mb add it as separate opt in feature to replace context instead of add when coming back from sub agent to main agent.

To me it makes more sense to wipe the entire subagent chat history and reload the main agent chat, just like we do now when resuming a conversation/session

MasterMind7777777 · 2025-10-19T13:46:19Z

OHHH I actually misunderstood what was being talked about (whoops!). Basically this is a feature with human interaction inside of the subagent... sounds useful. So is would the workflow as I imagine it:

main agent calls subagent

subagent produces subpar output, passing its summary back to main agent

maybe the main agent begins to work a bit

human halts the main agent (ctrl-c)

human enters subagent and sends additional prompts to improve the response of the subagent

codex starts up again with the new prompt, it starts working from the subagent

the subagent sends an additional response to the main agent

QUESTION does the main agent still have all of its context (aka context genereted in (3.) OR does the main agent roll back its context to before it got its first response from the subagent (1.)

Is that workflow correct? what happens to the main context on subagent re-prompt?

You know what would be interesting too, to be able to have the main agent reprompt the subagent directly if it notices somethings is amiss with the subagent output. For a future PR though.

Yah that is how it works. Except context of parent agent is not cleared/altered when you come back from sub agent. Just not sure yet how will it look; if you enter agent after chatting with main agent for long time. Should we really wipe main context to that point where you last used that specific sub agent? Mb add it as separate opt in feature to replace context instead of add when coming back from sub agent to main agent.

To me it makes more sense to wipe the entire subagent chat history and reload the main agent chat, just like we do now when resuming a conversation/session

If we do that, there will be no way to bring context back from sub agent in to parent agent. So essentially work user done in sub agent will be lost and will not affect what happens in main agent. It might be useful in some use cases, but in others you might want to bring something back from sub agent to main agent.

MasterMind7777777 · 2025-10-19T13:53:44Z

Another option would be to set up tool for sub agent to communicate to parent agent explicitly. So it would be up to model what to serve upstream. Than you would be able to chat to sub agent without worrying about what context you want to be sent back to parent. But that will lose out on control over agents.

alexx-ftw · 2025-10-19T13:53:50Z

OHHH I actually misunderstood what was being talked about (whoops!). Basically this is a feature with human interaction inside of the subagent... sounds useful. So is would the workflow as I imagine it:

main agent calls subagent

subagent produces subpar output, passing its summary back to main agent

maybe the main agent begins to work a bit

human halts the main agent (ctrl-c)

human enters subagent and sends additional prompts to improve the response of the subagent

codex starts up again with the new prompt, it starts working from the subagent

the subagent sends an additional response to the main agent

QUESTION does the main agent still have all of its context (aka context genereted in (3.) OR does the main agent roll back its context to before it got its first response from the subagent (1.)

Is that workflow correct? what happens to the main context on subagent re-prompt?

You know what would be interesting too, to be able to have the main agent reprompt the subagent directly if it notices somethings is amiss with the subagent output. For a future PR though.

Yah that is how it works. Except context of parent agent is not cleared/altered when you come back from sub agent. Just not sure yet how will it look; if you enter agent after chatting with main agent for long time. Should we really wipe main context to that point where you last used that specific sub agent? Mb add it as separate opt in feature to replace context instead of add when coming back from sub agent to main agent.

To me it makes more sense to wipe the entire subagent chat history and reload the main agent chat, just like we do now when resuming a conversation/session

If we do that, there will be no way to bring context back from sub agent in to parent agent. So essentially work user done in sub agent will be lost and will not affect what happens in main agent. It might be useful in some use cases, but in others you might want to bring something back from sub agent to main agent.

Maybe I didn't explain myself properly:
I would treat the subagent conversation as a normal conversation internally, but when going back to the parent conversation, summarize it and provide as context to the main agent.
There could be a slash command for going back to main agent chat that summarizes or whatever exit process is done when finishing with the subagent.
This would also allow for the same subagent to be later invoked again if wanted, thus saving up time and tokens, by the main agent

alexx-ftw · 2025-10-19T13:57:15Z

Another option would be to set up tool for sub agent to communicate to parent agent explicitly. So it would be up to model what to serve upstream. Than you would be able to chat to sub agent without worrying about what context you want to be sent back to parent. But that will lose out on control over agents.

Subagents could be running in the background performing their tasks, and the main agent could probe their progress at any point, or trigger them to provide a summary of the work done so far and whats missing, send them corrective instructions, and also the main agent receive a trigger once the subagent has marked its work as 100% done.
The main agent AND the user should have the capability of forcefully killing a subagent at any point.
Another setting could be how many levels deep can subagents be spawned, defaulted to only 1.

MasterMind7777777 · 2025-10-19T14:02:19Z

OHHH I actually misunderstood what was being talked about (whoops!). Basically this is a feature with human interaction inside of the subagent... sounds useful. So is would the workflow as I imagine it:

main agent calls subagent

subagent produces subpar output, passing its summary back to main agent

maybe the main agent begins to work a bit

human halts the main agent (ctrl-c)

human enters subagent and sends additional prompts to improve the response of the subagent

codex starts up again with the new prompt, it starts working from the subagent

the subagent sends an additional response to the main agent

QUESTION does the main agent still have all of its context (aka context genereted in (3.) OR does the main agent roll back its context to before it got its first response from the subagent (1.)

Is that workflow correct? what happens to the main context on subagent re-prompt?

You know what would be interesting too, to be able to have the main agent reprompt the subagent directly if it notices somethings is amiss with the subagent output. For a future PR though.

Yah that is how it works. Except context of parent agent is not cleared/altered when you come back from sub agent. Just not sure yet how will it look; if you enter agent after chatting with main agent for long time. Should we really wipe main context to that point where you last used that specific sub agent? Mb add it as separate opt in feature to replace context instead of add when coming back from sub agent to main agent.

To me it makes more sense to wipe the entire subagent chat history and reload the main agent chat, just like we do now when resuming a conversation/session

If we do that, there will be no way to bring context back from sub agent in to parent agent. So essentially work user done in sub agent will be lost and will not affect what happens in main agent. It might be useful in some use cases, but in others you might want to bring something back from sub agent to main agent.

Maybe I didn't explain myself properly:

I would treat the subagent conversation as a normal conversation internally, but when going back to the parent conversation, summarize it and provide as context to the main agent.

There could be a slash command for going back to main agent chat that summarizes or whatever exit process is done when finishing with the subagent.

This would also allow for the same subagent to be later invoked again if wanted, thus saving up time and tokens, by the main agent

That is how it works right now. So when main agent invokes sub agent it prompts it, sub agent thinks, does its thing, than finally provides response(final result) we take that final result and feed it in to main agent and main agent continues working.
Later we have option to enter that sub agent run and prompt it more, than it will do it's thinking and produce result
When we done with sub agent and want to go back to main agent we take only user requests and final agent responses to each user request(without intermediate thinking)

MasterMind7777777 · 2025-10-19T14:11:51Z

Another option would be to set up tool for sub agent to communicate to parent agent explicitly. So it would be up to model what to serve upstream. Than you would be able to chat to sub agent without worrying about what context you want to be sent back to parent. But that will lose out on control over agents.

Subagents could be running in the background performing their tasks, and the main agent could probe their progress at any point, or trigger them to provide a summary of the work done so far and whats missing, send them corrective instructions, and also the main agent receive a trigger once the subagent has marked its work as 100% done. The main agent AND the user should have the capability of forcefully killing a subagent at any point. Another setting could be how many levels deep can subagents be spawned, defaulted to only 1.

Not shure if its possible to let model to terminate/correct in-flight agent, it would need to wait for agent turn to finish and than act on it, by accepting result and proceeding or asking follow up to agent.

As per what agents can can spawn
via config.toml

[multi_agent]
agents = ["request_summarizer", "ideas_provider", "critic"]

than sub agent ideas_provider config.toml would have its own agent set up

[multi_agent]
agents = ["creative_ideas", "conservative_ideas"]

so you can set up what sub agents can each agent invoke.

MasterMind7777777 · 2025-10-19T14:17:44Z

Allthough i did add capability to run detached agent that is not awaited by parent agent. So it is none blocking agent run that you may or may not join(only final responce, no intermidiate thinking) in main agent context.

bogzbonny · 2025-10-31T17:57:13Z

Well apparently new features aren't been considered by openai (see the other multi-agent PR). time 2 ✌️

etraut-openai · 2025-10-31T22:41:59Z

Thanks for the contribution, and apologies for the slow response. We've received many PRs, and we don't have the bandwidth on the codex team to review all of them.

We've updated our contribution guidelines to clarify that we're currently accepting contributions for bugs and security fixes, but we're not generally accepting new features at this time. We need to make sure that all new features compose well with both existing and upcoming features and fit into our roadmap. If you would like to propose a new feature, please file or upvote an enhancement request in the issue tracker. We will generally prioritize new features based on community feedback.

towry · 2025-11-01T03:08:49Z

Wasting other people's time

John0x · 2025-11-01T04:06:41Z

It's probably best to create a fork :)
I would argue that sub-agents are a pretty good selling point.

The "just-every/code" fork is doing pretty good too

MasterMind7777777 added 5 commits October 14, 2025 16:15

initial proto commit

78853ae

explicit agent call implementation

725f60f

finalised docs for agent invoke aproach

d894ca8

integrate sub-agent as tool call

542f590

update example codex-home to be simpler

23c6370

MasterMind7777777 marked this pull request as draft October 15, 2025 00:00

MasterMind7777777 mentioned this pull request Oct 15, 2025

feat: implement (multi) subagent orchestration system #3655

Closed

MasterMind7777777 added 2 commits October 15, 2025 14:51

doc: note delegate transition silence

525671b

Merge branch 'origin/main' into multi-agent-orchestrator

4329bdc

MasterMind7777777 force-pushed the multi-agent-orchestrator branch from c38c76d to 4329bdc Compare October 15, 2025 16:10

MasterMind7777777 added 4 commits October 15, 2025 17:06

plan doc for agent-switching feat

172f9f4

Tighten example agent prompts for lean delegation

fee47b6

Improve agent switching flow, show active delegate indicator, and con…

410e940

…strain example prompts

Remove inline #agent autocomplete from TUI

46a2eea

MasterMind7777777 added 2 commits October 16, 2025 11:36

Merge remote-tracking branch 'origin/main'

1a35a07

Apply clippy cleanups after merge

f59baab

MasterMind7777777 added 3 commits October 16, 2025 13:33

Enable nested delegation and update docs

521bed6

Introduce parallel delegate execution with batching

2da4372

Support detached agents with optional context reattach

738873e

Merge origin/main into multi-agent-orchestrator

5450d29

jiaqiwang969 added a commit to jiaqiwang969/codex-with-gemini-integration that referenced this pull request Oct 18, 2025

添加: /agent 参考于PR openai#5190

611c598

plan shadow clinet for agents to interacte with

baa22c6

MasterMind7777777 added 2 commits October 19, 2025 19:11

organize agent-to-agent streams and cache transcripts for clearer output

75ff775

Added follow-up delegation support

864461c

etraut-openai closed this Oct 31, 2025

github-actions Bot locked and limited conversation to collaborators Oct 31, 2025

openai unlocked this conversation Oct 31, 2025

Conversation

MasterMind7777777 commented Oct 14, 2025

Summary

Why

Status

Usage

Testing

Uh oh!

alexx-ftw commented Oct 15, 2025

Uh oh!

MasterMind7777777 commented Oct 15, 2025

Uh oh!

alexx-ftw commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

towry commented Oct 15, 2025

Uh oh!

MasterMind7777777 commented Oct 15, 2025

Uh oh!

alexx-ftw commented Oct 15, 2025

Uh oh!

alexx-ftw commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alexx-ftw commented Oct 16, 2025

Uh oh!

chatgpt-codex-connector Bot commented Oct 16, 2025

Uh oh!

bogzbonny commented Oct 18, 2025

Uh oh!

alexx-ftw commented Oct 18, 2025

Uh oh!

MasterMind7777777 commented Oct 18, 2025

Uh oh!

bogzbonny commented Oct 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

john-says-hi commented Oct 19, 2025

Uh oh!

MasterMind7777777 commented Oct 19, 2025

Uh oh!

alexx-ftw commented Oct 19, 2025

Uh oh!

MasterMind7777777 commented Oct 19, 2025

Uh oh!

MasterMind7777777 commented Oct 19, 2025

Uh oh!

alexx-ftw commented Oct 19, 2025

Uh oh!

alexx-ftw commented Oct 19, 2025

Uh oh!

MasterMind7777777 commented Oct 19, 2025

Uh oh!

MasterMind7777777 commented Oct 19, 2025

Uh oh!

MasterMind7777777 commented Oct 19, 2025

Uh oh!

bogzbonny commented Oct 31, 2025

Uh oh!

etraut-openai commented Oct 31, 2025

Uh oh!

towry commented Nov 1, 2025

Uh oh!

John0x commented Nov 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

alexx-ftw commented Oct 15, 2025 •

edited

Loading

alexx-ftw commented Oct 16, 2025 •

edited

Loading

bogzbonny commented Oct 18, 2025 •

edited

Loading