Code Repository Intelligence Embedding Tools
Tools for indexing, analyzing, and semantically searching code repositories using embeddings and AST parsing. Includes code understanding, commit message generation, and code-aware Q&A systems. Does NOT include general code review platforms, CI/CD tools, or non-semantic code search.
There are 126 code repository intelligence tools tracked. 9 score above 50 (established tier). The highest-rated is cocoindex-io/cocoindex at 65/100 with 6,438 stars. 3 of the top 10 are actively maintained.
Get all 126 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=code-repository-intelligence&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
cocoindex-io/cocoindex
Data transformation framework for AI. Ultra performant, with incremental... |
|
Established |
| 2 |
dtsola/xiaoyaosearch
小遥搜索,听懂你的话、看懂你的图,用AI找到本地任何文件。让搜索像聊天一样简单。XiaoyaoSearch: Understands your... |
|
Established |
| 3 |
Ryandonofrio3/osgrep
Open Source Semantic Search for your AI Agent |
|
Established |
| 4 |
justincasher/lean-explore
A search engine for Lean 4 declarations |
|
Established |
| 5 |
yoanbernabeu/grepai
Semantic Search & Call Graphs for AI Agents (100% Local) |
|
Established |
| 6 |
infinilabs/coco-server
🥥 Coco AI Server - Search, Connect, Collaborate, AI-powered Enterprise... |
|
Established |
| 7 |
probelabs/probe
AI-friendly semantic code search engine for large codebases. Combines... |
|
Established |
| 8 |
starthackHQ/Contextinator
Turning messy repos into weapons of mass structured context. |
|
Established |
| 9 |
scarletkc/vexor
A semantic search engine for files and code. |
|
Established |
| 10 |
revokslab/codecrawl
🌊 Turn entire codebases into LLM-ready data. Extract data, search, and... |
|
Emerging |
| 11 |
SireJeff/k0ntext
AI Context Engineering - Intelligent context for Claude, Copilot, Cline, and... |
|
Emerging |
| 12 |
M9nx/CodexA
Codexa is a local semantic code intelligence CLI designed to help AI... |
|
Emerging |
| 13 |
cyberytti/ToolHunt
This is a local search engine to search for cybersecurity tools. It has... |
|
Emerging |
| 14 |
tomohiro-owada/devrag
Markdown vector search MCP server for Claude Code. Natural language search... |
|
Emerging |
| 15 |
pwrdrvr/ghcrawl
Terminal UI and local CLI for crawling GitHub issues and pull requests,... |
|
Emerging |
| 16 |
auyelbekov/rawq
Context retrieval engine for AI agents — semantic + lexical search over codebases |
|
Emerging |
| 17 |
jwizenfeld04/Echo-Guard
Semantic linting CLI that detects codebase redundancy created by AI coding agents. |
|
Emerging |
| 18 |
ShunsukeHayashi/context-and-impact
Unified Context-to-Execution pipeline: Obsidian semantic search + GitNexus... |
|
Emerging |
| 19 |
chrisfentiman/claude-context-cli
Auto-indexing CLI for claude-context-mcp (npm: claude-context-cli) |
|
Emerging |
| 20 |
SylphxAI/coderag
Lightning-fast semantic code search with AST chunking (15+ languages) -... |
|
Emerging |
| 21 |
REASY/k8s-ariadne-rs
Query Kubernetes with natural language by compiling English to Cypher. No... |
|
Emerging |
| 22 |
definitive-io/code-indexer-loop
Code Indexer Loop is a Python library for indexing and retrieving source... |
|
Emerging |
| 23 |
XiaoConstantine/sgrep
CLI for semantic grep |
|
Emerging |
| 24 |
mvp-scale/aOa
Semantic compression Claude and Gemini. 5 angles of O(1) indexed search —... |
|
Emerging |
| 25 |
wjddusrb03/diffmind
AI Code Review Memory - learns from your team's bug history and warns when... |
|
Emerging |
| 26 |
rekal-dev/rekal-cli
Git-anchored decentralised intent(conversation) ledger for teams who build with AI |
|
Emerging |
| 27 |
sagarmk/beacon-plugin
Semantic code search plugin for Claude Code using hybrid vector search +... |
|
Emerging |
| 28 |
SecrinLabs/secrin
The living wiki that writes itself. |
|
Emerging |
| 29 |
billlzzz10/bl1nk-mcp-server
Modular, audit-ready memory system combining knowledge graph, semantic... |
|
Emerging |
| 30 |
chiraag-kakar/context-nexus
Context Nexus is an AI-native backend platform and SDK for managing,... |
|
Emerging |
| 31 |
iagocavalcante/claude-turbo-search
Optimized file search and semantic indexing for large codebases in Claude Code |
|
Emerging |
| 32 |
jamie8johnson/cqs
Code intelligence and RAG for AI agents. Semantic search, call graphs,... |
|
Emerging |
| 33 |
zircote/rlm-rs-plugin
Claude Code plugin for processing documents 100x larger than context limits... |
|
Experimental |
| 34 |
BjornMelin/codex-prompt-refinery
Local Codex CLI prompt refinery: ingest JSON/JSONL histories, dedupe, embed... |
|
Experimental |
| 35 |
jsbattig/code-indexer
Python application to index code locally and support running server with... |
|
Experimental |
| 36 |
dvcdsys/code-index
Semantic code search powered by embeddings. Search your codebase by meaning,... |
|
Experimental |
| 37 |
del-delprilia/mt5-device-fingerprint-tool
🔒 Generate unique device fingerprints for MetaTrader 5, ensuring secure... |
|
Experimental |
| 38 |
Tomcat132025/odino
🔍 Discover and access your code quickly with Odino, a fast local semantic... |
|
Experimental |
| 39 |
Carlosagamez2021/AI-Indexing
🚀 Simplify code indexing with structured prompts and testing, optimizing... |
|
Experimental |
| 40 |
lemoal-t/oriongraphdb
🚀 Optimize AI context retrieval with OrionGraphDB, a powerful engine that... |
|
Experimental |
| 41 |
karmaniverous/jeeves-watcher
Filesystem watcher that keeps a Qdrant vector store in sync with document... |
|
Experimental |
| 42 |
Rchrdgt/codex-prompt-refinery
🛠️ Refine and organize your OpenAI Codex CLI prompts effortlessly, using... |
|
Experimental |
| 43 |
CoderDayton/semantic-cache-mcp
MCP server that reduces LLM token usage by 80%+ through intelligent file... |
|
Experimental |
| 44 |
inanitionnn/tagall
Pet project: letterbox alternative |
|
Experimental |
| 45 |
evoleinik/claude-grep
Search Claude Code session history. Regex + semantic (vector) search. Single... |
|
Experimental |
| 46 |
sert-xx/unified-blueprint
Documentation-as-Code middleware — structure Markdown docs into a Document... |
|
Experimental |
| 47 |
Shun0212/Owl-CLI
Semantic code search using vector embeddings. Search your codebase with... |
|
Experimental |
| 48 |
Sharper-Flow/lgrep
Dual-engine code intelligence for OpenCode: semantic code search plus symbol... |
|
Experimental |
| 49 |
damiandelmas/flex
Local search and retrieval for AI Agents |
|
Experimental |
| 50 |
Aliipou/codebase_intelligence
AI-powered codebase intelligence: semantic search, dependency analysis, and... |
|
Experimental |
| 51 |
AndySze/session-seek
Search your AI coding agent session history with semantic + keyword search... |
|
Experimental |
| 52 |
dbhavery/mcplex
MCP server for local AI models — expose Ollama, embeddings, and vision... |
|
Experimental |
| 53 |
sarupurisailalith/codebase-cortex
AI-powered documentation autopilot — commit code, docs update themselves.... |
|
Experimental |
| 54 |
jsuppe/loom
🧵 Requirements traceability for AI-assisted development. Extract... |
|
Experimental |
| 55 |
The-Cloud-Clock-Work/agentibridge
MCP server that indexes Claude Code CLI transcripts and exposes them via 16... |
|
Experimental |
| 56 |
Eyram233/CodeRAG
Build semantic vector databases from code and docs to enable AI agents to... |
|
Experimental |
| 57 |
damartr23/FischROBLOX
Automate Roblox gameplay and improve scripting with FischROBLOX’s reliable... |
|
Experimental |
| 58 |
beeftapareseller/docs
🛠 Build powerful applications with Puter.js using comprehensive... |
|
Experimental |
| 59 |
Veddd27/MTPulse
🛠️ Manage MTProto proxies easily with MTPulse. Add, remove, and control... |
|
Experimental |
| 60 |
TheMailmans/deepindex
Local-first semantic code search + MCP tools for Claude Code. No cloud, no... |
|
Experimental |
| 61 |
usercise/spelunk
Local context engine for AI coding agents. Semantic code search, code graph,... |
|
Experimental |
| 62 |
tulinever/code-historian
🕵️♂️ Track code changes effortlessly with AI-powered history tools and... |
|
Experimental |
| 63 |
andycandy/CausewayAI
Next-gen semantic retrieval system. Combines the power of Qdrant vector... |
|
Experimental |
| 64 |
solomonneas/code-search-api
Local semantic code search with Ollama embeddings, SQLite, and hybrid... |
|
Experimental |
| 65 |
Aarchi-07/CausewayAI
Next-gen semantic retrieval system. Combines the power of Qdrant vector... |
|
Experimental |
| 66 |
siropkin/budi
Context buster for Claude Code. Local retrieval that injects the right code... |
|
Experimental |
| 67 |
ZAKZOUK406/claude-turbo-search
🔍 Streamline codebase navigation with fast file search and semantic indexing... |
|
Experimental |
| 68 |
quasilyte/vscode-gogrep
Structural, syntax-aware search for Go code for VS Code. |
|
Experimental |
| 69 |
Sammed101/FuzzAI
Intelligent fuzzing tool integrating LLM-driven wordlist selection,... |
|
Experimental |
| 70 |
sltnsrh/knowledge-base
Semantic search knowledge base with vector embeddings and Claude MCP integration |
|
Experimental |
| 71 |
maciek-O-digiaidev/CodeRAG
Intelligent codebase context engine for AI coding agents. Semantic RAG from... |
|
Experimental |
| 72 |
Ahmed-aleryani/claude-context-local-plugin
Claude Code plugin for semantic code search using claude-context-local MCP... |
|
Experimental |
| 73 |
usr-wwelsh/botdocs
Turn md into a pretty site with chatbot |
|
Experimental |
| 74 |
Pomilon/Kestr
High-performance daemon for real-time codebase indexing. Generates semantic... |
|
Experimental |
| 75 |
moabualruz/rice-search
A fully local, production-ready code search platform with hybrid BM25 +... |
|
Experimental |
| 76 |
ShadReyes/cortex-recall
Semantic code & git history search CLI — tree-sitter parsing, pluggable... |
|
Experimental |
| 77 |
souldriver007/karp-word-graph
AI-powered KJV Bible study companion for Claude Desktop. Semantic scripture... |
|
Experimental |
| 78 |
nshkrdotcom/portfolio_coder
Code Intelligence Platform: Repository analysis, semantic code search,... |
|
Experimental |
| 79 |
nshkrdotcom/portfolio_manager
AI-native personal project intelligence system - manage, track, and search... |
|
Experimental |
| 80 |
souldriver007/karp-bible-code
AI-assisted ELS (Equidistant Letter Spacing) Bible code research engine for... |
|
Experimental |
| 81 |
mvp-scale/aOa-legacy
5 angles. 1 attack. O(1) indexed search. Up to 95% fewer tokens per... |
|
Experimental |
| 82 |
jonwraymond/tooldiscovery
Tool registry, search, semantic indexing, and documentation |
|
Experimental |
| 83 |
Stahldavid/sensegrep
Semantic + structural code search for AI-native development |
|
Experimental |
| 84 |
AssahBismarkabah/42context
A Local Context Engine |
|
Experimental |
| 85 |
copyleftdev/tala
Intent-native narrative execution layer. Reimagines Linux shell history as a... |
|
Experimental |
| 86 |
PEACEBINFLOW/mindscript-search
Semantic & structural search engine for the MindScript ecosystem. Index... |
|
Experimental |
| 87 |
servesys-labs/oriongraphdb
A context database for AI agents. Multi-channel retrieval (semantic,... |
|
Experimental |
| 88 |
KirtiJha/code-historian
🕰️ AI-powered VS Code extension for code history tracking with RAG-based... |
|
Experimental |
| 89 |
ThinkerYzu/kb-indexer
LLM-powered knowledge base indexer that builds a growing semantic layer of... |
|
Experimental |
| 90 |
Siddhhhh/ai-code-intelligence
AI-powered platform that analyzes GitHub repositories using local LLM... |
|
Experimental |
| 91 |
Suh0161/CodeScope
Search deeper. Know your codebase. Intelligent codebase search and analysis... |
|
Experimental |
| 92 |
oroinc/documentation-markdown
Markdown variant for AI |
|
Experimental |
| 93 |
bhavesh-kalluru/genai-project-2026-03-26
AI-powered Python code review tool that detects anti-patterns using... |
|
Experimental |
| 94 |
bhavesh-kalluru/genai-project-2026-03-27
AI-powered CLI that analyzes git diffs and generates conventional commit... |
|
Experimental |
| 95 |
Sermilion/telegram-private-search
Local-first MCP server and Kotlin/JVM CLI for indexing private Telegram... |
|
Experimental |
| 96 |
moijafcor/glean
Ask plain-English questions about your projects — source code,... |
|
Experimental |
| 97 |
ftrou/Decodifier3.1
Deterministic method-first retrieval for AI coding agents. |
|
Experimental |
| 98 |
VenomEzra91/fisch-roblox-toolkit
Level Up Your Roblox Experience with FischROBLOX: Tips, Scripts, and Game Guides |
|
Experimental |
| 99 |
sathish-mass/codebase-intelligence-platform
AI-powered codebase intelligence platform with semantic search, grounded... |
|
Experimental |
| 100 |
pgib11/roblox-gameflow-toolkit
Ultimate Roblox Scripting Tools 2026 🚀 | Free Roblox Automation & Game Hacks... |
|
Experimental |
| 101 |
BenDavies1218/gitdex-semantic-search
Local code indexer with MCP-based semantic search. Parses Git repos with... |
|
Experimental |
| 102 |
Team-Indexa/indexa
Modern search engine focused on fast, structured, and AI-powered knowledge... |
|
Experimental |
| 103 |
parbhatkapila4/RepoDocs
An AI-Powered Code Documentation Platform Automated documentation engine... |
|
Experimental |
| 104 |
Agents365-ai/semanticscholar-skill
Claude Code skill for academic paper search using the Semantic Scholar API |
|
Experimental |
| 105 |
oceanremodeling/FischROBLOX
Automate Roblox game testing and development processes to improve efficiency... |
|
Experimental |
| 106 |
Dnzinnxz/binelek-vscode-extension
🔍 Explore and manage Binelek knowledge graphs and AI services with this VS... |
|
Experimental |
| 107 |
kuluruvineeth/openbeam
The open source Glean alternative. Enterprise search + AI agents across... |
|
Experimental |
| 108 |
PPierzc/hive
🐝🔍 Hive: A CLI Tool for Semantic Searching of Your Knowledge Base |
|
Experimental |
| 109 |
louisfghbvc/CppSeek
AI-Powered Semantic Search for C/C++ |
|
Experimental |
| 110 |
souldriver007/karp-graph-lite
"Personal knowledge graph for Claude Desktop — remember, recall, connect" |
|
Experimental |
| 111 |
tanuj077/codeatlas
CodeAtlas: AI-powered code search and chat system using AST parsing,... |
|
Experimental |
| 112 |
josehu07/codetective
Takes code, gives AI authorship detection in five clicks :mag_right: |
|
Experimental |
| 113 |
MohammedNasserAhmed/CodeXpert
CodeXpert: A cutting-edge AI-powered code analysis tool leveraging... |
|
Experimental |
| 114 |
Lukasdias/opencontext
Semantic code search with ranked file matches and contextual line snippets... |
|
Experimental |
| 115 |
davidteren/code_grasp
A CLI tool that uses the Qodo-Embed-1-1.5B embedding model to analyze code,... |
|
Experimental |
| 116 |
Clownstein/Insight-Investigator
OSINT Investigator is a Chrome Extension & Discord Bot that captures and... |
|
Experimental |
| 117 |
gantumurbattumur/Github-aware-RAG
Semantic search across your starred and own GitHub repos, right inside VS... |
|
Experimental |
| 118 |
luanvenancio/design-extractor
A self-hosted backend that captures websites, extracts design signals, and... |
|
Experimental |
| 119 |
jasjeev013/Git-Insight-Orchestrator-Agent
It is an AI-powered tool that clones any GitHub repository, chunks and... |
|
Experimental |
| 120 |
NeaByteLab/AI-Indexing
Code indexing examples for converting source code into structured repository... |
|
Experimental |
| 121 |
NeaByteLab/Dev-Knowledge
Build searchable knowledge bases by scraping developer documentation and... |
|
Experimental |
| 122 |
anasM0hammad/formAI-ext
FormAI Extension is a Chrome extension that revolutionizes the job... |
|
Experimental |
| 123 |
salman-aziz-4425/my-courser
An AI powered code assistant with an IDE-like web interface. Index your... |
|
Experimental |
| 124 |
The-Focus-AI/embeddings-search-skill
Claude Code plugin for hybrid document search (grep + semantic embeddings) |
|
Experimental |
| 125 |
Lioness100/decimeta
A website to help you find the correct Dewey Decimal number for any subject using AI. |
|
Experimental |
| 126 |
GAUTAMSINGH102/CodeGen
A Smart VS-Code Extension for Developers from all around the Globe!! |
|
Experimental |