Model selection matters less than the harness around the agent. Give Claude Code, Cursor, and Codex a centralized API and dependency graph, refreshed at development speed, so they reason about cross-repo change without anyone checking out the full codebase locally.
It works best when Claude has enough starting context to know where to look... Teams that invest in codebase setup see better results.
The same model that ships a clean PR on a 10k-line project struggles on a monorepo. The bottleneck moves from the model to the context layer around it. That layer is what an engineering org actually has to build.
The harness around the agent (context, permissions, tools, freshness) is what determines whether a competent model succeeds on a real codebase or wastes tokens on grep.
Conventions, repo orientation, "where the tests live," coding standards. These are prose guidance an agent reads once. They scale to setup, not to "which 14 services call this RPC."
Cross-repo consumer mapping. Field-level usage. Change-impact analysis. The questions an agent has to answer to safely modify a shared API in a system it didn't write.
Two deployment models. The local MCP server is free and runs against the developer's checkout. The Enterprise tier is centralized: cloud or on-prem, and serves context to your agents from a catalog no developer ever needs to clone.
The MCP server runs as a local process and indexes whatever code is checked out locally. No infrastructure to stand up. Free.
Cloud or on-premises. Connects directly to GitHub, Bitbucket, or GitLab. Auto-scans every selected repo from your SCM, no local checkout needed. Your developer's agent queries a centralized MCP server that already holds the answer.
See the engine generate the catalog, serve context to a coding agent, and reason about a cross-service change end to end.
Numbers below are typical patterns reported by enterprise pilot teams running AI coding agents across monorepos. Your codebase and workload will move them in either direction.
One structured catalog call replaces a fan-out of grep, read, and find calls that an agent uses to rebuild context it never had.
"Who calls this RPC?" returns in a single tool call instead of a minute of probabilistic scanning.
When the answer is in the catalog, the agent stops trying to derive it from rg, find, and Python one-liners.
Deterministic. No embeddings. Source code is processed only inside your environment: local on the developer's machine, in your CI, in your Enterprise tenant, or fully on-premises if required. Only the catalog itself is centrally queryable. The static analyzer is deterministic by construction, so the same commit always produces the same catalog. No LLM in the analysis loop.
Model quality is necessary but not sufficient. As you evaluate Claude Code, Cursor, and Codex for your codebase, look hard at how contextual information is generated, maintained, and delivered through the SDLC. The teams that invest in that layer, as Anthropic's own guidance puts it, are the ones that see better results.
Early access is rolling out now. Bring a real codebase. We will plug it into your agent the same day.