shibing624/TreeSearch
TreeSearch: Structure-aware document retrieval without embeddings. 毫秒检索万级文档和大型代码库,并保留文档结构。
Builds hierarchical tree structures from documents using language-specific parsers (Python AST, regex for Java/Go/JS/C++, heading extraction for Markdown) and indexes them with SQLite FTS5, eliminating the need for embeddings or vector databases. Supports mixed input types (directories, files, globs) with auto-discovery and `.gitignore` respect, plus dual search modes—tree traversal for hierarchical documents and flat FTS5 for code. Includes ripgrep-accelerated GrepFilter for precise symbol matching and optional extractors for PDF/DOCX/HTML via optional dependencies.
Stars
75
Forks
8
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 13, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/shibing624/TreeSearch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.