2b

A modular AI agent framework built with Bun. Features a plugin-based architecture, persistent semantic memory, and a rich tool set — designed to run against a local LLM via LM Studio.

Requirements

Bun v1.3.9+
LM Studio running locally with a model loaded (default: qwen/qwen3.5-35b-a3b)

Optional (enables specific plugins):

ffmpeg in PATH — FFmpegPlugin, YtDlpPlugin, microphone input
yt-dlp in PATH — YtDlpPlugin (video clip downloads)
whisper.cpp server — microphone transcription
Docker or Apple Container (macOS) — CodeSandboxPlugin (isolated Python execution)
TMDB API key — TMDBPlugin (movie/people lookup)

Setup

bun install

The agent will run without a .env file using defaults. To customise, create a .env at the project root:

MODEL=your-model-name
LM_STUDIO_URL=ws://127.0.0.1:1234
TMDB_API_KEY=your_key_here

Running

Interactive terminal UI (recommended):

bun src/ui/terminal/run.tsx

Renders a full Ink-based terminal chat UI with inline permission prompts, streaming output, and dynamic agent observability.

One-shot CLI mode — pass a message as an argument or pipe stdin:

bun run index.ts "what time is it?"
echo "summarize this" | bun run index.ts --quiet
cat error.log | bun run index.ts "explain this error"

Flags (both modes):

Flag	Description
`--model <name>`	Override the model (e.g. `qwen3:8b`)
`-m, --model <name>`	Override the model (CLI mode only)
`-q, --quiet`	Output response text only, no labels or colors (CLI mode only)
`--no-reasoning`	Suppress `<think>` reasoning output (CLI mode only)
`-t, --tools`	Print tool calls to stderr as they happen (CLI mode only)
`-h, --help`	Show help (CLI mode only)

Memory subcommands (CLI mode):

bun run index.ts memory list
bun run index.ts memory search <query>
bun run index.ts memory clear

With debug logging:

LOG_LEVEL=DEBUG bun src/ui/terminal/run.tsx

Architecture

src/ui/terminal/run.tsx
  └── CortexAgent (wraps BaseAgent + memory)
        ├── CortexMemoryPlugin   (auto — long-term semantic memory)
        ├── ThoughtPlugin        (auto — captures <think> blocks)
        ├── MetacognitionPlugin  (auto — cognitive state tracking)
        ├── explore_codebase     → SubAgentPlugin [CodeReaderAgent, own LLM]
        ├── DynamicAgentPlugin   (create_agent, call_agent, list_agents)
        │     ├── preset: media  → HeadlessAgent [YtDlp, FFmpeg, ImageVision, Download]
        │     └── preset: info   → HeadlessAgent [TMDB, Weather, Wikipedia, RSS]
        ├── FileSystemPlugin     (sandboxed to cwd)
        ├── ShellPlugin          (read-only shell commands)
        ├── MinimalTools         (get_current_time, echo)
        ├── ScratchPlugin        (session scratch pad)
        ├── MemoryPlugin         (short-term conversation history)
        └── LMStudioProvider     (LLM backend, shared with sub-agents)

The central BaseAgent manages two input queues:

Direct — requires a response (addDirect())
Ambient — background context; agent can reply [IGNORE] to skip (addAmbient())

Each tick, the agent collects history and context from all plugins, assembles a fresh system prompt, calls the LLM, and emits the response via a "speak" event.

CortexAgent wraps BaseAgent and automatically registers CortexMemoryPlugin (long-term semantic memory), ThoughtPlugin (reasoning capture), and MetacognitionPlugin (cognitive state tracking and tool saturation detection).

DynamicAgentPlugin replaces the old static sub-agent roster. Two preset agents (media, info) are created at startup and are immediately callable. The orchestrator can also spawn new HeadlessAgent or CortexSubAgent instances at runtime via create_agent using any combination of capability plugins. Sub-agent tool calls bubble up as subagent_tool_call events, surfaced in the terminal UI.

explore_codebase remains a static SubAgentPlugin because its CodeReaderAgent instantiates its own LLM connection (a code-specific model) that cannot be replicated through the generic capability system.

Terminal UI is built with Ink. Ink owns stdin — there is no CLIInputSource in this mode. InkPermissionManager prompts inline when tools require permission.

Plugins

Plugins implement the AgentPlugin interface and contribute system prompt fragments, per-turn context, tool definitions, and lifecycle hooks. All hooks are optional — implement only what you need.

Plugin	Description
`CortexMemoryPlugin`	Persistent semantic memory with embedding-based search — auto-registered by `CortexAgent`; tools: `search_memory`, `save_memory`, `save_behavior`, `save_procedure`, `edit_memory`, `delete_memory`, and more
`ThoughtPlugin`	Auto-captures `<think>` reasoning blocks as thought memories — auto-registered by `CortexAgent`
`MetacognitionPlugin`	Tracks cognitive state per turn; detects tool saturation; tools: `introspect`, `memory_status`, `show_active_rules`, `list_registered_plugins`, `list_available_tools`, `get_system_prompt` — auto-registered by `CortexAgent`
`MemoryPlugin`	Short-term conversation history (max 15 messages, auto-summarize)
`SubAgentPlugin`	Wraps a `HeadlessAgent` as a single callable tool on the orchestrator
`DynamicAgentPlugin`	Spawns and calls sub-agents at runtime from a capability registry; preset agents (`media`, `info`) are created at startup; tools: `create_agent`, `call_agent`, `list_agents`, `list_capabilities`
`ScratchPlugin`	Session-scoped scratch pad in `/tmp/agent-{sessionId}/`; tools: `scratch_write`, `scratch_read`, `scratch_list`, `scratch_delete`
`FileSystemPlugin`	File read/write/copy/move/delete tools sandboxed to cwd
`ImageVisionPlugin`	Image analysis via a local vision model; tools: `analyze_image_url`, `analyze_image_file`
`WebSearchPlugin`	Web search via DuckDuckGo instant answers
`WebReaderPlugin`	Fetch and extract readable content from HTTPS pages
`WikipediaPlugin`	Search and fetch Wikipedia articles (no API key required)
`RSSPlugin`	Fetch and parse RSS/Atom feeds
`TMDBPlugin`	Movie and people lookup via TMDB API; requires `TMDB_API_KEY`
`WeatherPlugin`	Current weather via Open-Meteo (no API key required)
`NotesPlugin`	Persistent markdown notes saved to `notes/`
`ShellPlugin`	Read-only shell command execution (ls, git, cat, grep, etc.)
`ClipboardPlugin`	macOS clipboard read/write
`YtDlpPlugin`	Download video clips from Twitch, YouTube, etc. via yt-dlp
`FFmpegPlugin`	Video editing: trim, convert, crop, resize, extract audio, merge, and more
`CodeSandboxPlugin`	Execute Python 3.11 in an isolated container; a coding model generates code from a plain-language task description
`AudioPlugin`	Classifies microphone speech as direct/ambient via intent detection

Memory

Long-term memory is stored in SQLite at ~/.local/share/2b/data/2b.cortex.sqlite (respects XDG_DATA_HOME). The CortexMemory system stores embeddings alongside text and retrieves relevant memories via cosine similarity on each turn.

Four memory types:

factual — specific facts and decisions
thought — internal reasoning (auto-captured from <think> blocks)
behavior — persistent behavioral rules injected into the system prompt every turn
procedure — step-by-step instructions for recurring tasks

Environment Variables

Variable	Default	Description
`MODEL`	`qwen/qwen3.5-35b-a3b`	Chat model name as shown in LM Studio
`LM_STUDIO_URL`	`ws://127.0.0.1:1234`	LM Studio WebSocket endpoint
`CODE_MODEL`	`qwen2.5-coder-7b-instruct-mlx`	Model used by `CodeSandboxPlugin` to generate Python
`VISION_MODEL`	`google/gemma-3-4b`	Model used by `ImageVisionPlugin` for image analysis
`WHISPER_ENDPOINT`	`http://localhost:8080/inference`	whisper.cpp HTTP endpoint for microphone transcription
`TMDB_API_KEY`	—	Required for `TMDBPlugin`
`LOG_LEVEL`	`OFF`	`DEBUG` / `INFO` / `WARN` / `ERROR` / `OFF`

Extending

Add a plugin — implement AgentPlugin (see src/plugins/CLAUDE.md for the full lifecycle reference) and register it in src/ui/terminal/run.tsx.

Add a capability — add a new entry to the capability registry in DynamicAgentPlugin so the agent can include it when spawning runtime sub-agents.

Add a preset sub-agent — add an entry to the presets map in DynamicAgentPlugin in src/ui/terminal/run.tsx with a system prompt and capability list.

Add a static sub-agent (when it needs its own LLM or specialized setup) — create a factory in src/agents/sub-agents/, then register it via SubAgentPlugin in run.tsx. See src/agents/sub-agents/CLAUDE.md.

Swap the LLM backend — implement LLMProvider from src/providers/llm/LLMProvider.ts and pass it to BaseAgent or CortexAgent.

Testing

bun test

Tests use bun:test. Memory tests pass :memory: as the memoryDbPath to avoid touching the filesystem.

Name		Name	Last commit message	Last commit date
Latest commit History 337 Commits
.claude		.claude
docs		docs
prompts		prompts
scripts		scripts
src		src
.gitignore		.gitignore
2b.ts		2b.ts
CLAUDE.md		CLAUDE.md
LICENSE.md		LICENSE.md
README.md		README.md
bun.lock		bun.lock
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

2b

Requirements

Setup

Running

Architecture

Plugins

Memory

Environment Variables

Extending

Testing

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

2b

Requirements

Setup

Running

Architecture

Plugins

Memory

Environment Variables

Extending

Testing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages