Agent Lab

A team of AI scientist colleagues — not assistants who execute, but peers who think, argue, and push back. Each has distinct values and taste. The value is in the dialectic.

Built on Claude Code Agent Teams. Lead is the team lead; other personas are teammates. You (the PI) steer through Lead. Coordination, messaging, and task management are handled by Agent Teams natively.

Key principle: Surface tensions, don't force consensus. Disagreements are logged, not resolved. You break ties.

Quick Start

Have Claude Code? Run this in Claude in any repo:

Set up the agent lab in this repo! https://github.com/harangju/agent-lab/

Claude will clone the template, copy the skills and config into your project, and walk you through setup.

Manual Setup

This is a GitHub template repo. Click "Use this template" to create your own copy, then:

Set your field. Open CLAUDE.md and replace [field] on line 1 with your research area (e.g., "computational neuroscience", "mechanism design", "synthetic biology").
Set your direction. In CLAUDE.md under "Current Direction", add 1–2 lines describing what the lab is working on now.
(Optional) Add a Semantic Scholar API key. The Semantic Scholar MCP server is pre-configured in .claude/settings.json. Replace your-api-key-here with your key, or remove the env block to use the free tier. Free tier is 100 requests / 5 min — an /explore can burn through this quickly. Get a key at semanticscholar.org/product/api.
Run Claude Code from the repo root. The skills will be available as slash commands.

How to Use

Pick the skill that matches your stage:

You have...	Run...
A topic, no idea yet	`/explore [topic]`
An idea or design	`/plan [idea/design]`
Results	`/analyze [results]`
Notes, need prose	`/draft [section]`
A draft or talk	`/review [path]`
Feedback to address	`/revise [feedback]`
Ready to start	`/team`
End of session	`/save`

Let them argue. Steer the debate as PI — break ties, redirect, ask follow-ups.

Personas

MECE decomposition of research critique. Each persona owns one dimension. Lead is the team lead (presents, defends, handles persistence); others are teammates. Spawn prompts live in personas.md.

Persona	Dimension	Asks	Prevents
Lead (team lead)	Presentation	Steel-mans the argument, defends it, handles persistence	Weak framing, unclear narrative
Skeptic	Novelty	"Has this been done? What's actually new?"	Wasting time on existing work
Methodologist	Rigor	"Is this sound? Would it generalize?"	Publishing claims you can't defend
Visionary	Impact	"So what? Who cares? Think bigger."	Working on incremental stuff that won't matter
Newcomer	Clarity	"You lost me. Explain it simply."	Jargon-heavy work that nobody reads
Connector	Integration	"How does this relate to X in [other field]?"	Missing complementary work, thinking in silos

Natural tensions:

Visionary wants to swing big → Methodologist asks if the methods support it
Visionary says "Nature material" → Skeptic asks what's actually new
Methodologist wants more experiments → Visionary asks if it's worth the time
Connector draws links to adjacent fields → Skeptic asks if those links are real
Newcomer says the framing is unclear → Lead defends, others weigh in on whether the confusion reveals a real gap

Workflows

Skills map to the research pipeline. Each skill is a stage where the group argues about your work.

explore → plan → [execute] → analyze → draft → review
                                                  ↓
                                                revise ←→ (loops back to any stage)

Bootstrapping: explore → plan. Revision can loop back to earlier stages.

Explore

Parallel exploration. No presenter — everyone searches independently and synthesizes. /explore [topic]

Plan

Present → critique → debate. Covers both early ideas (go/no-go) and concrete designs (methodology review). /plan [idea or design]

Analyze

Present results → argue over interpretation → build narrative. The group debates what the results mean, not just whether they're correct. /analyze [results]

Draft

Generate prose from notes using rhetorical structure templates. Each section has a template encoding the standard moves — Abstract (Nature summary paragraph format), Introduction (CARS model), Results (figure-driven blocks), Discussion (inverted funnel), Methods (technique-organized). /draft [section]

Review

Phased critique of any written artifact — manuscript, talk, poster. Phase 1: extract argument + search literature. Phase 2: evaluate by dimension. Phase 3: debate readiness. /review [path]

Revise

Triage incoming feedback — reviewer comments, collaborator notes, PI comments — and plan revision. The group debates what to concede vs. push back on, then produces a revision plan. /revise [feedback]

Reflect

Meta-learning. Synthesizes PI feedback from log/ into candidate lab norms and persona calibration suggestions. /reflect

Team

Spawn the full research group as a persistent agent team. Run this at the start of a session to bring all personas online. /team

Save

Checkpoint the session — Lead appends a log entry to log/ and rewrites context.md with compacted state. /save

Context Management

State persists across sessions through three layers.

`CLAUDE.md`	`context.md`	`log/`
How to behave	What matters now	Everything that happened
Stable	Rewritten every session	Append-only
You maintain	Lead maintains	Lead maintains

Session flow

Session starts  → agents read context.md
During session  → agents work and return results, no file writes
Session ends    → Lead appends session log to log/
                → Lead rewrites context.md via compaction

Agents don't write files during the session — avoids conflicts when running in parallel. All persistence happens at the end through Lead.

Learning loop

The lab improves by learning from your feedback. Two levels:

1. Lab norms → CLAUDE.md (all personas read)

Your steering — tie-breaks, corrections, quality signals — gets logged to log/ with everything else. Periodically run /reflect to synthesize recurring patterns into Lab Norms.

2. Persona tuning → personas.md (per-persona)

When a specific persona is consistently miscalibrated, update its spawn prompt in personas.md directly.

File Structure

├── CLAUDE.md                        # lab rules + session protocol (auto-loaded)
├── personas.md                      # spawn prompts (read by Lead on demand)
├── context.md                       # compacted state (Lead maintains)
├── log/                             # session logs (Lead appends)
│   └── YYYY-MM-DD-HHmm.md
└── .claude/
    ├── settings.json                # MCP config, permissions
    └── skills/
        ├── explore/SKILL.md
        ├── plan/SKILL.md
        ├── analyze/SKILL.md
        ├── draft/SKILL.md
        ├── review/SKILL.md
        ├── revise/SKILL.md
        ├── reflect/SKILL.md
        ├── team/SKILL.md
        └── save/SKILL.md

Vocabulary

Useful words when talking to Claude:

connective tissue — prose that links sections or ideas (drafting)
load-bearing — assumptions or components critical to the argument (ideation)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.claude		.claude
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
abstract.png		abstract.png
context.md		context.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agent Lab

Quick Start

Manual Setup

How to Use

Personas

Workflows

Explore

Plan

Analyze

Draft

Review

Revise

Reflect

Team

Save

Context Management

Session flow

Learning loop

File Structure

Vocabulary

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Agent Lab

Quick Start

Manual Setup

How to Use

Personas

Workflows

Explore

Plan

Analyze

Draft

Review

Revise

Reflect

Team

Save

Context Management

Session flow

Learning loop

File Structure

Vocabulary

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages