Conductor — cockpit, memory, and MCP for Claude Code, Codex, and agent fleets

The problem

You can't leave an agent alone — so you babysit ten

Agentic coding made it cheap to run five, ten, fifteen autonomous sessions at once. But each one can quietly deploy, send, delete, or spend the moment it decides to — so you can't actually walk away. You sit there alt-tabbing, ready to yank the wheel, doing the one job a fleet was supposed to free you from. The bottleneck isn't compute. It's that you don't trust them to run unwatched.

Autopilot, not hands-on

Hand off the cruise; keep takeoff and landing

Conductor lets the reversible work run end-to-end — edits, tests, commits, the back-and-forth a window can answer for itself. It reads the live .jsonl trails your agent CLIs already write, from ~/.claude/projects/ to ~/.codex/sessions/, so it knows what every agent is doing without you watching.

→ Auto-continues the safe, reversible steps so a window never stalls waiting on a routine "yes".
→ Floats the one window that actually needs you — wedged, done, or stopped at the gate — to the top.
→ Supervise by exception: Working now / Waiting / Open / Idle, problems first, one row per window.

append-only trails: ~/.claude/projects + ~/.codex/sessions

# discover → liveness → parse → group → status → sort
$ conductor
# one row per live window, problems first

  ● Build SOAG grid          claude · 4s
  ● Patch Codex labels       codex  · 11s
  ● Deploy to prod?          GATED · 1m
  ● Site design review       codex  · 17h
          

The safety model

The autopilot flies the cruise. You own the landing.

This is the whole product. The fleet runs reversible work on its own — but the moment a window's next step would do something irreversible, Conductor physically stops and hands that one decision back to you. Auto-continue is commodity; auto-continue plus a gate you can't cross without a human thumb is the part you can trust. You can look away because it can't ship a deploy or move funds without you.

✓ AUTO-CONTINUE

“Tests pass — shall I update the README and commit?”

reversible → sends “continue”, work keeps moving

⛔ GATED → HUMAN

“Ready to deploy to prod / send the email / spend 0.4 SOL?”

touches deploy·send·delete·spend → refuses, returns reason

The bias is to stop when unsure: a false gate costs one manual reply; a false pass can ship a bad deploy or move real funds. The irreversibility gate lives in policy.js — auditable, no model in the loop.

Three surfaces, one engine

The cockpit is where you look when the fleet needs you

The cockpit isn't the product — the bounded autonomy is. These are just the three ways to keep one eye on a fleet that's running itself, and to land the calls only you can make.

📊

The cockpit

Run conductor up for a live web view. Every window a card, color-coded by status, auto-refreshing — so a glance tells you the fleet's still flying and nothing's hit the gate. When one stops for a yes/no, it surfaces with one-tap Yes / No / Continue / Review.

🔌

The MCP server

Conductor speaks the Model Context Protocol over stdio. An orchestrator agent calls list_sessions, whats_left, pending_questions and the gated control tools to run the fleet for you — and still can't push past the irreversibility gate.

⌘

The Codex adapter

Codex sessions get the same cockpit treatment: project labels, intent, last action, waiting state, and managed replies. Launch codex-conductor up when your fleet is Codex-first.

🎛️

Opt-in control

Launch managed windows through tmux, or adopt an existing one (forked, history intact) so Conductor can drive it. Even then, control stops at the gate — and read-only stays read-only.

The abstraction

If it narrates itself to disk, Conductor can watch it

Conductor is supervisory awareness over a fleet of semi-autonomous workers that already emit an append-only trail. Claude Code and Codex are adapters. Swap the adapter, keep the engine — grouping, status, and all three surfaces come free.

Adapter

Reads

Control

claude-code

Your ~/.claude/projects transcripts; liveness from a live claude process

tmux send-keys (managed)

codex-code

Your ~/.codex/sessions transcripts plus session_index.jsonl; infers project context even when Codex starts from home

tmux send-keys (managed)

fleet

Trading-bot events.jsonl; derives wedged orders & drawdown, session PnL

pause · resume · flatten

mev-searcher

Searcher event logs; derives feed-dead, losing-every-race, bleeding-after-gas

pause · kill · unwind (gated)

validator-fleet

The chain itself — one batched RPC poll learns delinquency, catchup, skip-rate

observe-only by default

A domain fits when it has all four: many units with intent · a trail that already exists · a liveness signal · supervise-by-exception status. Write one file at adapters/<name>.js and every surface works with --adapter <name>.

Codex support

Codex gets the same supervisory layer

Codex writes durable local transcripts under ~/.codex/sessions. Conductor turns those trails into the same status model: what is working now, what is open, what is waiting, and which project a session actually belongs to.

Classic Conductor for Claude Code

Use this when the active fleet is Claude Code windows, or when you want the broader adapter set for bots, validators, MEV searchers, and sales agents.

$ npm install -g conductor-cli
$ conductor up
$ conductor mcp

Codex Conductor for Codex sessions

Use this when the work is happening in Codex. It reads Codex transcripts, resolves project labels, and exposes the same cockpit and MCP control plane.

$ npm install -g @yksanjo/codex-conductor
$ codex-conductor up
$ codex-conductor mcp

Unified memory · companion MCP

Give the fleet one explicit memory vault

Conductor tells you what every agent is doing. The companion memory MCP gives those agents one shared place to store durable context: preferences, project decisions, recurring workflows, known pitfalls, and team rules that should not disappear when a window closes.

One local vault, many MCP clients

Memory stays in a user-controlled vault under ~/.unified-memory-mcp. Codex connects over stdio. ChatGPT apps and remote agents can use the same tools through the optional Streamable HTTP endpoint when you put it behind HTTPS and auth.

Codex and Conductor sessions search first Relevant preferences and project context come back as explicit records, not vague recall.

Agents propose stable memories Store only things worth reusing: decisions, workflows, conventions, and durable context.

You can inspect and delete the source of truth The vault is plain JSON plus an audit log. It is not hidden model memory.

The memory tool surface

The MCP exposes small, boring tools. That keeps orchestration predictable and lets each client decide when it should read, write, or forget context.

remember

search_memory

list_memories

get_memory

update_memory

forget_memory

# clone the companion memory MCP
$ git clone https://github.com/yksanjo/unified-memory-mcp ~/unified-memory-mcp
$ cd ~/unified-memory-mcp && npm install

# add it to Codex
$ codex mcp add unified_memory \
      --env MEMORY_MCP_HOME=$HOME/.unified-memory-mcp \
      -- node ~/unified-memory-mcp/src/server.js

Use memory for context, not authority. Required repo rules still belong in AGENTS.md or checked-in docs. Never store secrets, tokens, private keys, or raw sensitive personal data in the vault.

Get Memory MCP →

Conductor V2 · swarms

Design the formation, then fire it

V1 watches windows you opened by hand. Its sibling Conductor V2 flips the order: pick a formation, set one purpose, press FIRE — and a fleet of Claude Code windows launches into tmux and coordinates through two dumb, reliable channels: a shared swarm directory for artifacts and a per-swarm swarm-say helper for one-line handoffs. The formation decides who talks to whom and who starts.

🏛

Hierarchical

Orchestrator → workers

      ┌─ ORC ─┐
  ┌───┼───┬───┼───┐
  w1  w2  w3  w4

One orchestrator decomposes the mission, delegates a task per worker, collects reports, and synthesizes. Best when the work splits into independent chunks.

⛓

Pipeline

Sequential stages

s1 ─▶ s2 ─▶ s3 ─▶ s4
 each stage hands
 off to the next

Stages run in order; each consumes the previous stage's output and hands off. Best for a natural assembly line — recon → audit → verify → report.

🕸

Mesh

Peer-to-peer

  p1 ─── p2
   │ ╲ ╱ │
   │ ╱ ╲ │
  p3 ─── p4

Equal peers self-organize: each claims a distinct angle, works it, and broadcasts findings to the rest. Best for breadth — sweep a space from several directions at once.

Fire it from any agent. The V2 MCP exposes list_formations, plan_swarm (dry-run), fire_swarm, list_swarms, and stop_swarm — plus presets for deep research, market-bot sweeps, and web3 security reviews.

Explore swarms & formations →

# register the V2 swarm MCP
$ claude mcp add conductor2 --scope user -- node ~/conductor-v2/mcp.js

# then, from any session:
› plan a mesh swarm of 4 to research x402, then fire it

Conductor · cameras

Point the same cockpit at a camera

V1 watches your Claude windows; V2 fires swarms. The newest sibling, Conductor Camera Monitor, watches cameras. Each RTSP feed becomes an AI detective — a Claude session with a one-line mandate. It samples frames, Claude's own vision does the seeing (no model to train — it watches anything you can describe), a motion gate keeps 24/7 watch cheap, and every finding lands in a hash-chained alert ledger you can verify.

🔒

Security

front door · after dark

watch front_door:
 alert when a
 person appears
 after 10pm

A person enters frame at night → alert. The same person still standing there next frame stays quiet — dedup fires only on change.

🌱

Farming

greenhouse · crop health

watch greenhouse:
 warn when a
 crop bed looks
 dry or wilted

No ML model, no labels. If you can describe what "dry" looks like, the detective can flag it — and tell you which bed, with the frame as evidence.

🏭

Production

packing line · uptime

watch line_1:
 critical if the
 conveyor is empty
 for over 30s

Object counts, no-go zones, defects, stoppages — any rule that's easier to say than to train. Escalates honestly: info → warn → alert → critical.

One prompt onboards a detective. The MCP exposes add_feed, grab_frame (returns the frame as an image to look at + a motion score), emit_alert (hash-chained, auto-deduped), and recent_alerts — plus a live web cockpit: a wall of camera tiles with a color-coded alert stream.

Get Camera Monitor →

# register the camera MCP, then add a feed
$ claude mcp add camera-monitor -- node ~/conductor-camera-monitor/bin/ccm.mjs serve
$ ccm add front_door "rtsp://…" --mandate "alert when a person appears"

# onboard the detective (prints the one prompt to paste) + open the cockpit
$ ccm detective front_door
$ ccm dashboard          # → localhost:4055

Local-first by design

It only reads your own machine

Watching is read-only observation of trails that already exist — nothing leaves your laptop. The cockpit binds to 127.0.0.1 only.

✓ Reads your own ~/.claude and ~/.codex trails — never another user's transcripts.
✓ Unified memory is opt-in and explicit — a local vault, not hidden cross-product model memory.
✓ State-changing requests require a local origin + CSRF header.
✓ Destructive control (flatten / broadcast / close) needs a confirm token.
✓ No destructive broadcast — ever. The desk-wide button is always a safe stop.

honest limits — v1

Control is managed-only. Plain terminal windows you opened yourself stay read-only — there's no reliable way to inject input into them.

“What's left” is inferred from the transcript, not a real todo list. Best-effort, honestly labelled.

“Live” = recently touched. Per-row time always shows true last activity.

Get started

Run Conductor where your agents already work

Free · open source

Add the MCP server

Wire Conductor into Claude Code or Codex and make it available wherever you orchestrate. Run an orchestrator that flies the fleet — triaging windows, continuing the reversible work, and stopping dead at the gate on anything that deploys, sends, deletes, or spends.

# clone once
$ git clone https://github.com/yksanjo/conductor ~/conductor

# Claude Code fleet
$ claude mcp add conductor --scope user \
      -- node ~/conductor/mcp.js

# Codex fleet
$ codex-conductor mcp

Get the MCP →

Visual · 1 command

Launch the cockpit

Prefer to watch? npm link puts a global conductor on your PATH — no build, no dependencies. Use conductor up for Claude Code fleets or codex-conductor up for Codex fleets.

$ cd ~/conductor && npm link
$ conductor            # glance: table
$ conductor up         # Claude cockpit
$ codex-conductor up   # Codex cockpit

Read the docs →

FAQ

Frequently asked questions

Isn't this just a nicer view of my terminals?

No — a tidy dashboard is table stakes (tmux already gives you that). The point is that you can leave. The fleet runs the reversible work unattended, and a hard gate stops it before anything that can't be undone. The cockpit is just where you look when one window finally needs your call. You're buying the ability to look away, not a prettier grid.

Does it need me to instrument my agents?

No. It reads the .jsonl trails your tools already write, including ~/.claude/projects/ and ~/.codex/sessions/. Zero instrumentation, zero new infrastructure, zero dependencies. Other workers just need a small adapter file.

Can it ship something on its own?

Not anything irreversible. Ordinary work auto-continues, but the moment a window's question — or a proposed reply — touches deploy, send, delete, or spend, the gate in policy.js refuses and returns the reason so a human decides. Approving an irreversible action is always your call.

Is my code or my transcripts sent anywhere?

No. Everything is local-first. The cockpit binds to 127.0.0.1, reads only your own ~/.claude / ~/.codex, and state-changing requests require a local origin plus a CSRF header. Run it for yourself, not as a service.

Is unified memory the same as ChatGPT or Codex memory?

No. The memory MCP is an explicit vault you own and inspect. It gives MCP-aware clients a shared remember / search_memory surface, while built-in product memory remains separate. Use it for stable context and preferences; keep mandatory rules in checked-in docs.

What about windows I opened by hand?

Watched, always. Controlled, only if you adopt them — Conductor forks the session into a managed tmux window (full history intact) so it can inject replies. Plain terminals the OS won't let anything type into stay read-only, by design.