Sunbelt Computer Software

Codebook

Codebook is a customized fork of DeusData/codebase-memory-mcp. Full credit to the original authors for the architecture, tree-sitter integration, and the core idea. This fork lives as a separate product with its own release identity and light UX changes, while keeping the underlying codebase broadly compatible upstream where practical.

Every time an AI agent explores your codebase, it burns thousands of tokens grepping through files, rebuilding the same understanding from scratch. This MCP server indexes your code into a persistent knowledge graph -- one query returns what would take dozens of file reads. 99% fewer tokens, sub-millisecond responses, survives session restarts.

Single Go binary. No Docker, no databases, no API keys.

Getting Started

Requires Go 1.25+ and a C compiler (xcode-select --install on macOS).

git clone https://github.com/mightycogs/codebook.git
cd codebook
make install
codebook install

Restart your editor. The install command auto-detects Claude Code, Codex CLI, Cursor, Windsurf, Gemini CLI, VS Code, and Zed -- registers the MCP server and installs task-specific skills.

Now open any project and say "Index this project". That's it. The graph persists in ~/.codebook/ and auto-syncs when files change.

Everyday Usage

Once indexed, just talk to your AI agent naturally. The graph tools are called automatically behind the scenes.

"What calls ProcessOrder?" -- traces inbound call chains across files and packages:

trace_call_path(function_name="ProcessOrder", direction="inbound", depth=3)

"Find dead code" -- functions with zero callers, excluding entry points:

search_graph(label="Function", relationship="CALLS", direction="inbound",
             max_degree=0, exclude_entry_points=true)

"What changed and what might break?" -- maps your git diff to affected symbols with risk labels:

detect_changes(scope="staged", depth=3)

"Show me the architecture" -- languages, packages, entry points, hotspots, clusters, all in one call:

get_architecture(aspects=["all"])

For complex structural queries, the agent writes Cypher on the fly:

query_graph(query="MATCH (a)-[r:HTTP_CALLS]->(b) WHERE r.confidence > 0.5
                   RETURN a.name, b.name, r.url_path ORDER BY r.confidence DESC")

Search is case-insensitive by default. The graph covers 64 languages -- from Python and Go to COBOL and Verilog.

MCP Tools

12 tools exposed over MCP:

Indexing: index_repository, list_projects, delete_project

Querying: search_graph, trace_call_path, detect_changes, query_graph, get_graph_schema, get_code_snippet, get_architecture, manage_adr

Text search: search_code -- grep-like search within indexed files

All search tools support pagination (limit/offset) and return has_more and total counts. Full parameter reference: docs/MCP_TOOLS.md. Query examples: docs/EXAMPLES.md. Cypher syntax: docs/CYPHER.md.

Development

make              # show all targets
make test         # run tests (PKG=./internal/store/ VERBOSE=1)
make coverage     # run tests + coverage gate (COVER_MIN=85)
make report       # generate JSON report, coverage.txt, coverage.html (used by CI)
make check        # lint + tests
make build        # build binary to bin/
make install      # build + copy to ~/.local/bin/

All test/coverage targets accept PKG=./internal/... to scope to a single package.

CLI Mode

Every tool works from the command line too -- no MCP client needed:

codebook cli search_graph '{"name_pattern": ".*Handler.*"}'
codebook cli trace_call_path '{"function_name": "main", "direction": "outbound"}'
codebook cli --raw query_graph '{"query": "MATCH (f:Function) RETURN f.name LIMIT 5"}' | jq

See docs/CLI.md for more.

Graph Model

The graph stores Function, Class, Module, Route, Package, File and other node types connected by edges like CALLS, HTTP_CALLS, IMPORTS, IMPLEMENTS, TESTS, and CONFIGURES. Edge properties carry metadata -- HTTP calls have confidence scores, routes have method and path.

Full schema: docs/GRAPH-MODEL.md

Excluding Files

Place a .cgrignore in your project root (one glob pattern per line) to skip directories or files during indexing. Common directories like .git, node_modules, vendor, __pycache__, dist, and build are always excluded.

Reference

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
.github/workflows		.github/workflows
cmd/codebook		cmd/codebook
docs		docs
internal		internal
scripts		scripts
testdata/smoke		testdata/smoke
tree-sitter-form		tree-sitter-form
tree-sitter-magma		tree-sitter-magma
.gitattributes		.gitattributes
.gitignore		.gitignore
.golangci.yml		.golangci.yml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
server.json		server.json

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Codebook

Getting Started

Everyday Usage

MCP Tools

Development

CLI Mode

Graph Model

Excluding Files

Reference

License

About

Uh oh!

Contributors

Uh oh!

Languages

Sunbelt Computer Software

PL/B Language Development and Support

Folders and files

Latest commit

History

Repository files navigation

Codebook

Getting Started

Everyday Usage

MCP Tools

Development

CLI Mode

Graph Model

Excluding Files

Reference

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages