claude-code

Graphify Is a Context-Compression Skill for the Part of Agent Work That Keeps Breaking: Remembering the Repo

Anatoliy Kolodkin

01 Jun 2026 • 4 min read

The least glamorous failure mode in agentic coding is also the most common: the agent forgets what matters. It reads the wrong file, misses the architectural boundary, over-compresses the previous session, or confidently edits against a half-remembered version of the repo. Everyone wants a smarter model. Many teams would get more value from a better memory substrate.

That is the useful frame for Graphify, which released v0.8.28 on June 1. The project describes itself as a Claude Code skill invoked with /graphify, but its ambition is broader: turn code, schemas, docs, papers, images, and diagrams into a persistent knowledge graph that coding agents can query instead of repeatedly shoving raw files into context and hoping the model notices the important part.

The repo is not small. At research time it showed 57,912 stars, 6,048 forks, 280 open issues, an MIT license, and a same-day push after the v0.8.28 release. The headline claim is aggressive: 71.5x fewer tokens per query on a mixed corpus of Karpathy repositories, five papers, and four images. The README also reports a more modest 5.4x on a smaller mixed corpus and roughly 1x on a six-file synthetic library. That spread is the honest part. Graph compression is not magic; it pays off when there is enough structure to compress.

The June 1 release is mostly practical plumbing, which is a good sign. v0.8.28 adds graphify install --platform kilo, installs a native Kilo Code skill plus /graphify command, and registers a .kilo plugin without rewriting existing .kilo/kilo.jsonc comments. It also fixes Linux and macOS install failures by making the BYOND DreamMaker parser optional instead of requiring source compilation where no Windows wheel exists. Most importantly for long-lived generated context, it stabilizes community IDs by assigning them through a total order based on size and node IDs, preventing meaningless label churn across runs.

Stable context beats impressive context

That community-ID fix sounds microscopic until you have reviewed generated architecture artifacts for more than five minutes. If every run renumbers the graph, humans cannot tell whether the system changed or the labels did. Agents have the same problem, except they respond by confidently building on noise. Stable generated artifacts are not polish; they are the difference between a useful map and a kaleidoscope.

Graphify’s output surface is intentionally broad: graph.html, an Obsidian vault export, an agent-crawlable wiki, GRAPH_REPORT.md, graph.json, cache files, optional SVG/GraphML/Neo4j exports, and an MCP stdio server mode. Extraction spans code through tree-sitter and call graphs, documentation through Claude, PDFs through citation and concept extraction, and images through Claude vision. Edges are labeled EXTRACTED, INFERRED, or AMBIGUOUS. That last detail matters because a graph that cannot distinguish evidence from inference is just a very organized hallucination.

This is where Graphify fits the Claude Code moment. Claude Code skills, MCP servers, long-running sessions, and dynamic workflows all increase the amount of context an agent can accumulate. They do not automatically make that context usable. A persistent graph gives the agent a routing layer: find communities, identify “god nodes,” follow paths, surface surprising connections, and decide which files deserve direct inspection. That is a better operating model than “read the whole repo again” or “trust the summary from three compactions ago.”

But the graph is an index, not ground truth. Engineers should treat it like a senior developer’s architecture sketch: useful for orientation, dangerous as the sole basis for edits. If Graphify says module A influences module B, the agent should use that to decide what to read next, not to skip reading. Compression changes the failure mode. Instead of drowning in tokens, the model may overfit to whatever became central in the graph and miss the boring edge file that actually enforces the invariant.

The harness still has to remember to ask

A fresh Reddit thread captured the adoption problem neatly: a user said Graphify context “works like magic in Claude Code,” while other platforms such as OpenCode and Cline did not reliably keep reading or updating it. That is the core product lesson. The knowledge base can exist, but the harness needs habits. Skills, hooks, commands, and workflow conventions determine whether the agent consults the graph at the right moment and refreshes it after meaningful changes.

That is why v0.8.28’s Kilo Code support is not just platform checkboxing. Graphify is trying to become portable agent infrastructure. Claude Code may be the strongest current workflow because skills and commands make consultation natural, but the underlying need exists across Codex, OpenCode, Cursor, Gemini CLI, Kilo Code, and whatever comes next. The winning context layer will not be the one with the prettiest graph. It will be the one agents actually use without humans begging them every turn.

For practitioners, the sane adoption path is narrow. Pick a repo with real onboarding or debugging pain. Run Graphify. Inspect GRAPH_REPORT.md yourself. Ask Claude Code questions whose answers you already know: where is auth enforced, what path writes invoices, which modules touch tenant boundaries, what code owns retry behavior. Measure whether the graph helps the agent find the right primary files faster and with fewer hallucinated connections.

If it works, wire it into the workflow carefully. Use watch mode or hooks where appropriate, but keep generated artifacts reviewable. Commit them only if the team wants the graph as part of project state; otherwise treat them as local cache. Add a rule to your agent instructions: graph context routes attention, direct file reads authorize edits. That one sentence will prevent a surprising amount of beautiful nonsense.

The broader market read is that token budget is forcing architecture. Long-context models help, but they do not remove the need for selection. Agents need memory that is persistent, queryable, and honest about uncertainty. Graphify is one strong example of that shift: context governance, not graph eye candy. The diff is approved, with the usual request: verify before you edit.

Sources: Graphify GitHub repository, Graphify v0.8.28 release, Reddit practitioner discussion, Claude Code overview

Stable context beats impressive context

The harness still has to remember to ask

Sign up for more like this.