definitive-io/code-indexer-loop
Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuously and efficiently updated.
ArchivedLeverages tree-sitter for syntax-aware code chunking across 16+ languages, breaking only at valid AST nodes to preserve semantic boundaries. Integrates LlamaIndex, LangChain, and ChromaDB for embedding generation and storage, with watchdog filesystem monitoring and MD5-based caching for efficient incremental re-indexing on file changes. Token-based chunk sizing (configurable per model) ensures retrieval fits LLM context windows while maintaining exact source reconstruction.
176 stars. No commits in the last 6 months.
Stars
176
Forks
9
Language
Python
License
Apache-2.0
Category
Last pushed
Apr 09, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/definitive-io/code-indexer-loop"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
cocoindex-io/cocoindex
Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if...
dtsola/xiaoyaosearch
小遥搜索,听懂你的话、看懂你的图,用AI找到本地任何文件。让搜索像聊天一样简单。XiaoyaoSearch: Understands your words, reads your...
Ryandonofrio3/osgrep
Open Source Semantic Search for your AI Agent
justincasher/lean-explore
A search engine for Lean 4 declarations
yoanbernabeu/grepai
Semantic Search & Call Graphs for AI Agents (100% Local)