Code Orb is a CLI-first coding agent project with a future path to desktop.
The repository is now in an early runnable stage.
Today it provides:
- a one-shot CLI path with
orb run "<task>" - an interactive foreground CLI with
orb chat - local session artifacts under
.orb/sessions/ - git-aware session reporting for pre-existing vs current-run changes
- benchmark-backed validation for key milestone paths
The current product shape is still intentionally narrow. The shell and runtime are real, but the CLI is still experimental and not yet a production-ready coding tool.
Release status:
0.4.0Session And Git Awareness: released0.5.0Interactive CLI: released on 2026-04-080.6.0Extensibility: released on 2026-04-080.7.0Query Loop And Tool Runtime: released on 2026-04-080.8.0Usable CLI: released on 2026-04-090.9.0CLI UX And TUI: in progress as of 2026-04-09
- AGENTS.md
- docs/product/vision.md
- docs/product/v0-scope.md
- docs/architecture/overview.md
- docs/roadmap/roadmap.md
Install dependencies:
pnpm installConfigure a real provider:
cp .env.example .env.localor:
bash scripts/setup-local-env.shThen edit .env.local and set:
OPENAI_API_KEYOPENAI_MODEL- optionally
OPENAI_BASE_URLfor an OpenAI-compatible endpoint
Provider notes:
- Code Orb automatically loads
.envand.env.localfrom the active working directory - shell environment variables still override
.envfile values - if no provider variables are set, Code Orb falls back to the mock model client
Interactive CLI:
pnpm run cli:run -- chatOne-shot task execution:
pnpm run cli:run -- run "<task>"Run against another repository:
pnpm run cli:run -- --cwd /path/to/repo chat
pnpm run cli:run -- --cwd /path/to/repo run "<task>"Session inspection:
pnpm run cli:run -- sessions list
pnpm run cli:run -- sessions show <session-id>Interactive commands:
/help/history/status/exit/quitexitquit
Primary benchmark path:
pnpm run benchmark:failing-test-fixInteractive CLI milestone benchmark:
pnpm run benchmark:interactive-multi-turnRepository instruction visibility benchmark:
pnpm run benchmark:project-instructionsCode Orb is currently best suited for:
- fixture or small local-repository tasks
- explicit edit-and-verify requests
- interactive multi-turn iteration in one foreground session
- session-aware follow-up work and repository-state inspection
It is not yet a broadly capable general coding agent. The current runtime is still narrow in task interpretation and tool selection, but repository-root AGENTS.md guidance is now loaded and surfaced during execution while the broader instruction model remains intentionally limited.
apps/ App entry points such as the CLI and, later, desktop shells.
packages/ Shared runtime packages used by multiple apps.
docs/ Product, architecture, ADR, roadmap, engineering, and reference docs.
prompts/ Prompt assets managed as first-class project artifacts.
tests/ Unit, integration, end-to-end, and fixture repositories.
benchmarks/ Benchmark tasks and evaluation repositories for milestone validation.
scripts/ Automation scripts that support development and release work.
- contributor and agent rules: AGENTS.md
- product scope: docs/product/v0-scope.md
- product direction: docs/product/vision.md
- architecture overview: docs/architecture/overview.md
- execution model: docs/architecture/execution-model.md
- provider runtime: docs/architecture/provider-runtime.md
- tool system: docs/architecture/tool-system.md
- roadmap: docs/roadmap/roadmap.md
0.5.0interactive CLI milestone: docs/roadmap/releases/0.5.0.md0.7.0query loop and tool runtime milestone: docs/roadmap/releases/0.7.0.md0.7.0execution checklist: docs/roadmap/releases/0.7.0-checklist.md0.8.0usable CLI milestone: docs/roadmap/releases/0.8.0.md0.8.0execution checklist: docs/roadmap/releases/0.8.0-checklist.md0.9.0CLI UX and TUI milestone: docs/roadmap/releases/0.9.0.md0.9.0execution checklist: docs/roadmap/releases/0.9.0-checklist.md
AGENTS.mdis the canonical guide for coding agents working in this repository.CLAUDE.mdis intentionally thin and points back toAGENTS.md.- Architecture changes should update the relevant docs in the same patch.
- ADRs are used for decisions that would be expensive to reverse later.
