Code Repository Intelligence Embedding Tools

Tools for indexing, analyzing, and semantically searching code repositories using embeddings and AST parsing. Includes code understanding, commit message generation, and code-aware Q&A systems. Does NOT include general code review platforms, CI/CD tools, or non-semantic code search.

There are 126 code repository intelligence tools tracked. 9 score above 50 (established tier). The highest-rated is cocoindex-io/cocoindex at 65/100 with 6,438 stars. 3 of the top 10 are actively maintained.

Get all 126 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=code-repository-intelligence&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 cocoindex-io/cocoindex

Data transformation framework for AI. Ultra performant, with incremental...

65
Established
2 dtsola/xiaoyaosearch

小遥搜索,听懂你的话、看懂你的图,用AI找到本地任何文件。让搜索像聊天一样简单。XiaoyaoSearch: Understands your...

65
Established
3 Ryandonofrio3/osgrep

Open Source Semantic Search for your AI Agent

64
Established
4 justincasher/lean-explore

A search engine for Lean 4 declarations

63
Established
5 yoanbernabeu/grepai

Semantic Search & Call Graphs for AI Agents (100% Local)

59
Established
6 infinilabs/coco-server

🥥 Coco AI Server - Search, Connect, Collaborate, AI-powered Enterprise...

56
Established
7 probelabs/probe

AI-friendly semantic code search engine for large codebases. Combines...

56
Established
8 starthackHQ/Contextinator

Turning messy repos into weapons of mass structured context.

54
Established
9 scarletkc/vexor

A semantic search engine for files and code.

52
Established
10 revokslab/codecrawl

🌊 Turn entire codebases into LLM-ready data. Extract data, search, and...

47
Emerging
11 SireJeff/k0ntext

AI Context Engineering - Intelligent context for Claude, Copilot, Cline, and...

44
Emerging
12 M9nx/CodexA

Codexa is a local semantic code intelligence CLI designed to help AI...

43
Emerging
13 cyberytti/ToolHunt

This is a local search engine to search for cybersecurity tools. It has...

43
Emerging
14 tomohiro-owada/devrag

Markdown vector search MCP server for Claude Code. Natural language search...

42
Emerging
15 pwrdrvr/ghcrawl

Terminal UI and local CLI for crawling GitHub issues and pull requests,...

42
Emerging
16 auyelbekov/rawq

Context retrieval engine for AI agents — semantic + lexical search over codebases

41
Emerging
17 jwizenfeld04/Echo-Guard

Semantic linting CLI that detects codebase redundancy created by AI coding agents.

40
Emerging
18 ShunsukeHayashi/context-and-impact

Unified Context-to-Execution pipeline: Obsidian semantic search + GitNexus...

40
Emerging
19 chrisfentiman/claude-context-cli

Auto-indexing CLI for claude-context-mcp (npm: claude-context-cli)

38
Emerging
20 SylphxAI/coderag

Lightning-fast semantic code search with AST chunking (15+ languages) -...

38
Emerging
21 REASY/k8s-ariadne-rs

Query Kubernetes with natural language by compiling English to Cypher. No...

37
Emerging
22 definitive-io/code-indexer-loop

Code Indexer Loop is a Python library for indexing and retrieving source...

36
Emerging
23 XiaoConstantine/sgrep

CLI for semantic grep

36
Emerging
24 mvp-scale/aOa

Semantic compression Claude and Gemini. 5 angles of O(1) indexed search —...

35
Emerging
25 wjddusrb03/diffmind

AI Code Review Memory - learns from your team's bug history and warns when...

35
Emerging
26 rekal-dev/rekal-cli

Git-anchored decentralised intent(conversation) ledger for teams who build with AI

34
Emerging
27 sagarmk/beacon-plugin

Semantic code search plugin for Claude Code using hybrid vector search +...

34
Emerging
28 SecrinLabs/secrin

The living wiki that writes itself.

34
Emerging
29 billlzzz10/bl1nk-mcp-server

Modular, audit-ready memory system combining knowledge graph, semantic...

32
Emerging
30 chiraag-kakar/context-nexus

Context Nexus is an AI-native backend platform and SDK for managing,...

32
Emerging
31 iagocavalcante/claude-turbo-search

Optimized file search and semantic indexing for large codebases in Claude Code

32
Emerging
32 jamie8johnson/cqs

Code intelligence and RAG for AI agents. Semantic search, call graphs,...

31
Emerging
33 zircote/rlm-rs-plugin

Claude Code plugin for processing documents 100x larger than context limits...

29
Experimental
34 BjornMelin/codex-prompt-refinery

Local Codex CLI prompt refinery: ingest JSON/JSONL histories, dedupe, embed...

26
Experimental
35 jsbattig/code-indexer

Python application to index code locally and support running server with...

25
Experimental
36 dvcdsys/code-index

Semantic code search powered by embeddings. Search your codebase by meaning,...

25
Experimental
37 del-delprilia/mt5-device-fingerprint-tool

🔒 Generate unique device fingerprints for MetaTrader 5, ensuring secure...

24
Experimental
38 Tomcat132025/odino

🔍 Discover and access your code quickly with Odino, a fast local semantic...

24
Experimental
39 Carlosagamez2021/AI-Indexing

🚀 Simplify code indexing with structured prompts and testing, optimizing...

24
Experimental
40 lemoal-t/oriongraphdb

🚀 Optimize AI context retrieval with OrionGraphDB, a powerful engine that...

23
Experimental
41 karmaniverous/jeeves-watcher

Filesystem watcher that keeps a Qdrant vector store in sync with document...

23
Experimental
42 Rchrdgt/codex-prompt-refinery

🛠️ Refine and organize your OpenAI Codex CLI prompts effortlessly, using...

23
Experimental
43 CoderDayton/semantic-cache-mcp

MCP server that reduces LLM token usage by 80%+ through intelligent file...

23
Experimental
44 inanitionnn/tagall

Pet project: letterbox alternative

23
Experimental
45 evoleinik/claude-grep

Search Claude Code session history. Regex + semantic (vector) search. Single...

23
Experimental
46 sert-xx/unified-blueprint

Documentation-as-Code middleware — structure Markdown docs into a Document...

23
Experimental
47 Shun0212/Owl-CLI

Semantic code search using vector embeddings. Search your codebase with...

23
Experimental
48 Sharper-Flow/lgrep

Dual-engine code intelligence for OpenCode: semantic code search plus symbol...

23
Experimental
49 damiandelmas/flex

Local search and retrieval for AI Agents

23
Experimental
50 Aliipou/codebase_intelligence

AI-powered codebase intelligence: semantic search, dependency analysis, and...

22
Experimental
51 AndySze/session-seek

Search your AI coding agent session history with semantic + keyword search...

22
Experimental
52 dbhavery/mcplex

MCP server for local AI models — expose Ollama, embeddings, and vision...

22
Experimental
53 sarupurisailalith/codebase-cortex

AI-powered documentation autopilot — commit code, docs update themselves....

22
Experimental
54 jsuppe/loom

🧵 Requirements traceability for AI-assisted development. Extract...

22
Experimental
55 The-Cloud-Clock-Work/agentibridge

MCP server that indexes Claude Code CLI transcripts and exposes them via 16...

22
Experimental
56 Eyram233/CodeRAG

Build semantic vector databases from code and docs to enable AI agents to...

22
Experimental
57 damartr23/FischROBLOX

Automate Roblox gameplay and improve scripting with FischROBLOX’s reliable...

22
Experimental
58 beeftapareseller/docs

🛠 Build powerful applications with Puter.js using comprehensive...

22
Experimental
59 Veddd27/MTPulse

🛠️ Manage MTProto proxies easily with MTPulse. Add, remove, and control...

22
Experimental
60 TheMailmans/deepindex

Local-first semantic code search + MCP tools for Claude Code. No cloud, no...

22
Experimental
61 usercise/spelunk

Local context engine for AI coding agents. Semantic code search, code graph,...

22
Experimental
62 tulinever/code-historian

🕵️♂️ Track code changes effortlessly with AI-powered history tools and...

22
Experimental
63 andycandy/CausewayAI

Next-gen semantic retrieval system. Combines the power of Qdrant vector...

22
Experimental
64 solomonneas/code-search-api

Local semantic code search with Ollama embeddings, SQLite, and hybrid...

22
Experimental
65 Aarchi-07/CausewayAI

Next-gen semantic retrieval system. Combines the power of Qdrant vector...

22
Experimental
66 siropkin/budi

Context buster for Claude Code. Local retrieval that injects the right code...

22
Experimental
67 ZAKZOUK406/claude-turbo-search

🔍 Streamline codebase navigation with fast file search and semantic indexing...

22
Experimental
68 quasilyte/vscode-gogrep

Structural, syntax-aware search for Go code for VS Code.

21
Experimental
69 Sammed101/FuzzAI

Intelligent fuzzing tool integrating LLM-driven wordlist selection,...

21
Experimental
70 sltnsrh/knowledge-base

Semantic search knowledge base with vector embeddings and Claude MCP integration

21
Experimental
71 maciek-O-digiaidev/CodeRAG

Intelligent codebase context engine for AI coding agents. Semantic RAG from...

21
Experimental
72 Ahmed-aleryani/claude-context-local-plugin

Claude Code plugin for semantic code search using claude-context-local MCP...

21
Experimental
73 usr-wwelsh/botdocs

Turn md into a pretty site with chatbot

20
Experimental
74 Pomilon/Kestr

High-performance daemon for real-time codebase indexing. Generates semantic...

20
Experimental
75 moabualruz/rice-search

A fully local, production-ready code search platform with hybrid BM25 +...

20
Experimental
76 ShadReyes/cortex-recall

Semantic code & git history search CLI — tree-sitter parsing, pluggable...

20
Experimental
77 souldriver007/karp-word-graph

AI-powered KJV Bible study companion for Claude Desktop. Semantic scripture...

19
Experimental
78 nshkrdotcom/portfolio_coder

Code Intelligence Platform: Repository analysis, semantic code search,...

19
Experimental
79 nshkrdotcom/portfolio_manager

AI-native personal project intelligence system - manage, track, and search...

19
Experimental
80 souldriver007/karp-bible-code

AI-assisted ELS (Equidistant Letter Spacing) Bible code research engine for...

19
Experimental
81 mvp-scale/aOa-legacy

5 angles. 1 attack. O(1) indexed search. Up to 95% fewer tokens per...

19
Experimental
82 jonwraymond/tooldiscovery

Tool registry, search, semantic indexing, and documentation

19
Experimental
83 Stahldavid/sensegrep

Semantic + structural code search for AI-native development

19
Experimental
84 AssahBismarkabah/42context

A Local Context Engine

17
Experimental
85 copyleftdev/tala

Intent-native narrative execution layer. Reimagines Linux shell history as a...

16
Experimental
86 PEACEBINFLOW/mindscript-search

Semantic & structural search engine for the MindScript ecosystem. Index...

16
Experimental
87 servesys-labs/oriongraphdb

A context database for AI agents. Multi-channel retrieval (semantic,...

15
Experimental
88 KirtiJha/code-historian

🕰️ AI-powered VS Code extension for code history tracking with RAG-based...

15
Experimental
89 ThinkerYzu/kb-indexer

LLM-powered knowledge base indexer that builds a growing semantic layer of...

15
Experimental
90 Siddhhhh/ai-code-intelligence

AI-powered platform that analyzes GitHub repositories using local LLM...

15
Experimental
91 Suh0161/CodeScope

Search deeper. Know your codebase. Intelligent codebase search and analysis...

15
Experimental
92 oroinc/documentation-markdown

Markdown variant for AI

14
Experimental
93 bhavesh-kalluru/genai-project-2026-03-26

AI-powered Python code review tool that detects anti-patterns using...

14
Experimental
94 bhavesh-kalluru/genai-project-2026-03-27

AI-powered CLI that analyzes git diffs and generates conventional commit...

14
Experimental
95 Sermilion/telegram-private-search

Local-first MCP server and Kotlin/JVM CLI for indexing private Telegram...

14
Experimental
96 moijafcor/glean

Ask plain-English questions about your projects — source code,...

14
Experimental
97 ftrou/Decodifier3.1

Deterministic method-first retrieval for AI coding agents.

14
Experimental
98 VenomEzra91/fisch-roblox-toolkit

Level Up Your Roblox Experience with FischROBLOX: Tips, Scripts, and Game Guides

14
Experimental
99 sathish-mass/codebase-intelligence-platform

AI-powered codebase intelligence platform with semantic search, grounded...

14
Experimental
100 pgib11/roblox-gameflow-toolkit

Ultimate Roblox Scripting Tools 2026 🚀 | Free Roblox Automation & Game Hacks...

14
Experimental
101 BenDavies1218/gitdex-semantic-search

Local code indexer with MCP-based semantic search. Parses Git repos with...

14
Experimental
102 Team-Indexa/indexa

Modern search engine focused on fast, structured, and AI-powered knowledge...

14
Experimental
103 parbhatkapila4/RepoDocs

An AI-Powered Code Documentation Platform Automated documentation engine...

14
Experimental
104 Agents365-ai/semanticscholar-skill

Claude Code skill for academic paper search using the Semantic Scholar API

14
Experimental
105 oceanremodeling/FischROBLOX

Automate Roblox game testing and development processes to improve efficiency...

14
Experimental
106 Dnzinnxz/binelek-vscode-extension

🔍 Explore and manage Binelek knowledge graphs and AI services with this VS...

14
Experimental
107 kuluruvineeth/openbeam

The open source Glean alternative. Enterprise search + AI agents across...

14
Experimental
108 PPierzc/hive

🐝🔍 Hive: A CLI Tool for Semantic Searching of Your Knowledge Base

13
Experimental
109 louisfghbvc/CppSeek

AI-Powered Semantic Search for C/C++

13
Experimental
110 souldriver007/karp-graph-lite

"Personal knowledge graph for Claude Desktop — remember, recall, connect"

13
Experimental
111 tanuj077/codeatlas

CodeAtlas: AI-powered code search and chat system using AST parsing,...

13
Experimental
112 josehu07/codetective

Takes code, gives AI authorship detection in five clicks :mag_right:

13
Experimental
113 MohammedNasserAhmed/CodeXpert

CodeXpert: A cutting-edge AI-powered code analysis tool leveraging...

12
Experimental
114 Lukasdias/opencontext

Semantic code search with ranked file matches and contextual line snippets...

12
Experimental
115 davidteren/code_grasp

A CLI tool that uses the Qodo-Embed-1-1.5B embedding model to analyze code,...

12
Experimental
116 Clownstein/Insight-Investigator

OSINT Investigator is a Chrome Extension & Discord Bot that captures and...

12
Experimental
117 gantumurbattumur/Github-aware-RAG

Semantic search across your starred and own GitHub repos, right inside VS...

12
Experimental
118 luanvenancio/design-extractor

A self-hosted backend that captures websites, extracts design signals, and...

11
Experimental
119 jasjeev013/Git-Insight-Orchestrator-Agent

It is an AI-powered tool that clones any GitHub repository, chunks and...

11
Experimental
120 NeaByteLab/AI-Indexing

Code indexing examples for converting source code into structured repository...

11
Experimental
121 NeaByteLab/Dev-Knowledge

Build searchable knowledge bases by scraping developer documentation and...

11
Experimental
122 anasM0hammad/formAI-ext

FormAI Extension is a Chrome extension that revolutionizes the job...

11
Experimental
123 salman-aziz-4425/my-courser

An AI powered code assistant with an IDE-like web interface. Index your...

11
Experimental
124 The-Focus-AI/embeddings-search-skill

Claude Code plugin for hybrid document search (grep + semantic embeddings)

11
Experimental
125 Lioness100/decimeta

A website to help you find the correct Dewey Decimal number for any subject using AI.

10
Experimental
126 GAUTAMSINGH102/CodeGen

A Smart VS-Code Extension for Developers from all around the Globe!!

10
Experimental

Comparisons in this category