PageIndex and pageindex

These appear to be **competitors** — both implement hierarchical document indexing using LLM reasoning as an alternative to vector embeddings for RAG, with the starred VectifyAI version being the more established implementation.

PageIndex
65
Established
pageindex
22
Experimental
Maintenance 20/25
Adoption 10/25
Maturity 16/25
Community 19/25
Maintenance 13/25
Adoption 0/25
Maturity 9/25
Community 0/25
Stars: 21,374
Forks: 1,665
Downloads:
Commits (30d): 21
Language: Python
License: MIT
Stars:
Forks:
Downloads:
Commits (30d): 0
Language: Python
License: MIT
No Package No Dependents
No Package No Dependents

About PageIndex

VectifyAI/PageIndex

📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

Builds a hierarchical tree-index from documents—similar to a machine-generated table of contents—then uses LLM reasoning to traverse the tree for retrieval, eliminating the need for vector databases or artificial chunking. Achieves 98.7% accuracy on FinanceBench by reasoning over document structure rather than semantic similarity. Integrates via self-hosted Python, MCP protocol, or cloud API, with support for vision-based retrieval directly from PDF page images.

About pageindex

Ashish4144/pageindex

Build hierarchical document indexes using LLM reasoning for intuitive navigation and retrieval without chunking, embeddings, or vector databases.

Scores updated daily from GitHub, PyPI, and npm data. How scores work