Graphify: Instant Knowledge Graph for Claude Code/Antigravity (FREE)
Decision Card
Effort: ~20 min to install and build your first graph on one project (uv tool install graphifyy && graphify install, then /graphify), plus ~12 min of unattended graph-building time per large repo as shown in the video.
Honest take: The presenter is refreshingly honest that his own test showed only ~7–8% token savings, not the headline “71.5x” — that benchmark compares Graphify against a strawman workflow (pasting all 52 files into context) that nobody actually uses, so the real win is answer quality and fewer wrong-direction reads, not raw token count.
Concrete next steps:
- Install on one mixed-content project (code + docs) and run
/graphify, then ask 2–3 architecture questions you’d normally ask cold (~20 min). (repo) - If you keep it, wire
graphify hook installso the graph rebuilds on commit/branch-switch (~2 min). - Skip if your projects are small, single-language, or you mostly run short one-off sessions — the savings “grow with session length” and project size, so small/short workflows see little benefit 09:08.
TL;DR
Graphify scans a project once, builds a queryable knowledge graph (code via local AST parsing, audio/video via local Whisper, docs via a one-time Claude pass), and injects a graph summary at the start of every AI session so the assistant makes a few targeted reads instead of re-reading everything. The presenter’s own A/B test found modest token savings (~7–8%) but noticeably better answer quality, and he debunks the repo’s “71.5x” claim as measuring an unrealistic baseline.
Key Points
- Without a persistent map, every new AI session rebuilds project understanding from scratch, re-reading the same files and burning the same tokens 00:38
- Graphify reads the project once, builds a knowledge graph of connections, and the assistant reads that graph at session start instead of re-reading files 00:54
- The system is three passes: local code parsing (no API/tokens), local audio/video transcription via Faster-Whisper, and a one-time Claude sub-agent pass over docs/PDFs/images 02:12
- Only pass three touches the Claude API and only runs once; afterward every session reads the cached graph for free 03:06
- In the presenter’s 10-question A/B test, token use was 120k (no Graphify) vs 113k (with) — under 8% difference, not the advertised figure 04:03
- Answer quality was visibly better with Graphify — e.g., it explained each of three phases in detail plus loop-detection/replanning, where the baseline just named functions 04:23
- The “71.5x” benchmark measures naive load-all-52-files (~123k tokens) vs Graphify’s ~1,700 tokens/query — a workflow the presenter says nobody actually uses 08:17
- Install is
pip install graphifyy(double-y) thengraphify install, thengraphify claude installto auto-load it every session 05:23 - Output lands in
graphify-out/withGRAPH_REPORT.mdand an interactivegraph.html; the demo found 4,041 nodes, 20,900 edges, 185 communities 06:24 - It works on non-code folders too — research notes, transcripts, PDFs, meeting recordings — making it useful for research vaults and planning folders 09:36
Notable Quotes
“Every session, it rebuilds the same understanding from scratch. It burns the same tokens, makes the same searches, re-reads the same files every time.” 00:46
“Pass three is the only part that touches Claude’s API and it only runs once. After that, every session reads the cached graph for free.” 03:06
“Nobody works by pasting 52 files into a Claude conversation… So that 71 times benchmark is comparing against a workflow most people don’t actually use.” 08:47
Verified Claims
Graphify is a real open-source tool with ~25,000 stars (at recording time). 02:04
- github.com/safishamsi/graphify — now shows ~55.7k stars
- Verdict: Confirmed (star count has since grown well past 25k)
Code is parsed locally via tree-sitter with no external API calls or tokens used. 02:27
- Graphify README: “Code is extracted locally with no API calls (AST via tree-sitter)”
- Verdict: Confirmed
Audio/video is transcribed locally and for free using Faster-Whisper. 02:47
- SYSTRAN/faster-whisper — open-source CTranslate2 reimplementation of Whisper, up to ~4x faster, runs locally
- Verdict: Confirmed
An algorithm groups related concepts into “neighborhoods”/communities. 03:17
- Graphify README: uses Leiden clustering to identify semantic communities (NetworkX + Leiden)
- Verdict: Confirmed
The 71.5x figure is measured against loading all 52 files (~123k tokens) directly, at ~1,700 tokens/query. 08:17
- Graphify README / repo reports 71.5x fewer tokens per query on a mixed corpus (Karpathy repos + papers + images)
- PyShine writeup repeats the 71.5x-vs-raw-files framing
- Verdict: Confirmed (the number is accurate; the presenter’s caveat that the baseline is unrealistic is fair)
Only pass three (docs/PDFs/images) uses the Claude API. 03:06
- Graphify README: “Docs, PDFs, and images require API calls to your assistant’s model,” while code and media transcription run locally
- Verdict: Confirmed
The tool was inspired by Andrej Karpathy’s “/raw folder” idea and shipped shortly after he described it. 02:04
- MindStudio writeup and themenonlab blog describe Graphify as Karpathy-inspired; the repo includes a
graphify clonefor his nanoGPT repo - Verdict: Inconclusive on the exact “48 hours after” timing, but the Karpathy-inspiration framing is widely corroborated
Install is pip install graphifyy (double-y) then graphify install.
05:23
- graphifyy on PyPI — package name is indeed
graphifyy;uv tool install graphifyy/pipx install graphifyyalso documented - Verdict: Confirmed
Tools, Papers & Standards Mentioned
- Graphify — github.com/safishamsi/graphify · graphify.net · PyPI package
graphifyy - Faster-Whisper — github.com/SYSTRAN/faster-whisper
- OpenAI Whisper (underlying model) — github.com/openai/whisper
- tree-sitter (AST parsing, used by Graphify per its README) — tree-sitter.github.io
- Leiden clustering / NetworkX (community detection) — referenced in the Graphify README
- Claude Code (host environment) — github.com/safishamsi/graphify lists supported assistants (Claude Code, Codex, OpenCode, Cursor, Gemini CLI)
Follow-up Questions
- In real day-to-day sessions (tagging a folder and asking questions, not pasting files), what is the measured token and quality delta across project sizes — does the benefit actually scale with file count as claimed?
- How stale does the graph get between rebuilds, and does the git-hook auto-rebuild add meaningful latency or token cost on large repos with frequent commits?
- For privacy-sensitive codebases, what exactly is sent to the Claude API during pass three (docs/PDFs/images), and can that pass be disabled while keeping the local code+media graph?
Sources
- https://github.com/safishamsi/graphify
- https://graphify.net/
- https://pypi.org/project/graphifyy/
- https://github.com/SYSTRAN/faster-whisper
- https://github.com/openai/whisper
- https://tree-sitter.github.io/tree-sitter/
- https://www.mindstudio.ai/blog/graphify-claude-code-knowledge-graph-large-codebase-70x
- https://themenonlab.blog/blog/graphify-knowledge-graph-claude-code-karpathy
- https://pyshine.com/Graphify-AI-Knowledge-Graph-From-Any-Codebase/