This Open Source Repo Just Solved Claude Code's #1 Problem
How Graphify turns any codebase into a queryable knowledge graph and cuts Claude Code's token bill by 60%.
June 5thHow to give your LightRAG knowledge graph the power to ingest PDFs, charts, and images without changing how you query it.
RAG Anything solves the plain-text ceiling of most knowledge graph systems by running a free local parser that converts any document type into the same vector database and entity graph LightRAG already uses.
Most RAG systems can only ingest plain text, which breaks the moment you feed them scanned PDFs, charts, or images. RAG Anything -- from the same team that built LightRAG -- fixes this with a local document parser called MinerU that classifies every element in a PDF (text, image, chart, LaTeX) and routes each through a specialized pipeline. The text bucket and image bucket each produce a vector database and a knowledge graph, which are merged together, then merged with your existing LightRAG instance. The output is structurally identical to a text-only setup, and querying through Claude Code works exactly the same way as before.
Sign in and you get 23 free chat messages on us — ask for the hook, quote a framework, find the exact transcript moment, generate a markdown action plan. Bring your own key when you want unlimited.
Create a free account →
Hook on the plain-text limitation of RAG systems. RAG Anything introduced as the fix. Prerequisites established: assumes LightRAG is already running.

High-level: RAG Anything is a multimodal wrapper for LightRAG from the same HKUDS team. Handles PDFs, images, charts. Brief sponsor mention (Chase AI+ masterclass). All output routes to the same knowledge graph.

Architecture walkthrough: MinerU runs locally and classifies PDF elements into text, images, charts, LaTeX. Text bucket -> PaddleOCR -> LLM -> embeddings + entities. Image bucket -> screenshot -> LLM (OCR + vision) -> embeddings + entities. Each path produces a vector DB and knowledge graph. All four are merged into one pair, then merged with the existing LightRAG instance.

One-shot Claude Code prompt installs RAG Anything, updates storage paths, swaps models to GPT-5.4 Nano, and patches the embedding double-wrap bug. Non-text ingestion requires a Python script wrapped into a Claude Code skill. Demo: querying a fake NovaTech SaaS PDF with a bar chart -- Claude Code returns monthly revenue data Jan-Sep 2025.

MinerU runs locally on CPU/GPU -- API cost only for the LLM embedding step. Free Skool community has the one-shot prompt and Claude Code skill.
The plain-text wall breaks most self-hosted RAG pipelines -- and the fix is a local parsing layer that is completely invisible at query time.
“Almost every RAG system suffers from the exact same problem. They can only handle text documents.”
“Why don't we just treat this entire thing as a screenshot? Because it's expensive and slow.”
“In the end, you didn't notice a dang thing. Again, as the user, all of this is invisible to you.”
See every word as it's spoken — crank it to 2× and still catch all of it. The same dual-channel trick behind Amazon's Kindle + Audible.
The ceiling of most RAG pipelines is not compute or cost -- it is document type. The moment a knowledge base encounters a scanned PDF or a chart embedded in a slide deck, the pipeline silently fails. This video is the fix.
MinerU splits any document into a text path and an image path before sending to an LLM. Each path produces its own vector DB and knowledge graph. All four artifacts are merged into one.
“Make sure to check out Chase AI plus if you wanna get your hands on that Claude Code masterclass”
Soft pitch at the very end after all value delivered; free community alternative offered throughout
00:00
00:16
00:40
00:48
01:09
01:19
01:39
01:51
02:01
02:19
02:33
02:47
03:01
03:15
03:22
03:48
03:57
04:12
04:26
04:36
04:55
05:12
05:26
05:38
05:55
06:07
06:21
06:35
06:50
07:03
07:19
07:33
07:47
08:02
08:16
08:30
08:45
08:59
09:13
09:28
09:39
09:57
10:11
10:25
10:40
10:47
11:08
11:23
11:37
11:51
12:06
12:20
12:29
12:49
13:03
13:18
13:33
13:47
14:02
14:17
14:25
14:46
15:01
15:15
15:30
15:45
16:06
16:14
16:29
16:42
16:58
17:13
17:27
17:46
17:49
18:11
18:19
18:44
18:54
19:12How Graphify turns any codebase into a queryable knowledge graph and cuts Claude Code's token bill by 60%.
June 5thA 19-minute screen-share walkthrough of the hybrid AI-image-plus-HTML approach that keeps social carousels from looking like everyone else's Claude Code output.
June 2ndEverything you need to know about Claude Code skills — what they are, how they load, how to trigger them, and how to build benchmarked custom ones — in under ten minutes.
March 16thHow a plain markdown vault with one index file replaces a vector database for most solo builders.
April 4thAn 8-minute first look at Anthropic’s new visual design tool — what it does, how it compares to Stitch and Lovable, and why the visual layer matters even when the code underneath is identical.
April 17thA 14-minute listicle that makes the case for CLIs over MCPs and hands you the stack to prove it.
March 21st