msoedov/vector_lake
S3 vector database for LLM Agents and RAG.
Leverages Hierarchical Navigable Small World (HNSW) graphs for efficient approximate nearest neighbor search across distributed S3 shards, enabling cost-effective storage of massive vector datasets without database maintenance overhead. Supports custom data partitioning strategies and integrates with LangChain for RAG workflows, while offering flexible backend options including local volumes and network storage alongside S3.
Available on PyPI.
Stars
57
Forks
4
Language
Python
License
MIT
Category
Last pushed
Jan 28, 2026
Monthly downloads
16
Commits (30d)
0
Dependencies
8
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/msoedov/vector_lake"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
pixeltable/pixeltable
Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.
activeloopai/deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store,...
superlinked/VectorHub
VectorHub is a free, open-source learning website for people (software developers to senior ML...
hhblaze/DBreeze
C# .NET NOSQL ( key value, object store embedded TextSearch SemanticSearch Vector layer ) ACID...
nitaiaharoni1/vector-storage
Vector Storage is a vector database that enables semantic similarity searches on text documents...