Vector DB From Scratch Vector Databases
Educational and minimalist vector database implementations built to understand core concepts and internals. Includes toy/learning projects, lightweight engines, and pure-Python implementations prioritizing clarity over production features. Does NOT include enterprise databases, managed services, or specialized implementations (embedded SQLite variants, REST API wrappers, or domain-specific systems like NFT databases).
There are 157 vector db from scratch tools tracked. 1 score above 70 (verified tier). The highest-rated is MariaDB/server at 76/100 with 7,297 stars. 4 of the top 10 are actively maintained.
Get all 157 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=vector-db&subcategory=vector-db-from-scratch&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
MariaDB/server
MariaDB server is a community developed fork of MySQL server. Started by... |
|
Verified |
| 2 |
infiniflow/infinity
The AI-native database built for LLM applications, providing incredibly fast... |
|
Established |
| 3 |
AlayaDB-AI/AlayaLite
AlayaLite – A Fast, Flexible Vector Database for Everyone. |
|
Established |
| 4 |
oceanbase/seekdb
The AI-Native Search Database. Unifies vector, text, structured and... |
|
Established |
| 5 |
schwabauerbriantomas-gif/m2m-vector-search
Edge Vector search engine with Vulkan GPU acceleration, hierarchical... |
|
Established |
| 6 |
gusye1234/nano-vectordb
A simple, easy-to-hack Vector Database |
|
Established |
| 7 |
nnethercott/hannoy
Production-ready KV-backed HNSW implementation in Rust using LMDB |
|
Established |
| 8 |
dingodb/dingo
A multi-modal vector database that supports upserts and vector queries using... |
|
Established |
| 9 |
endee-io/endee
Endee.io – A high-performance vector database, designed to handle up to 1B... |
|
Established |
| 10 |
zilliztech/knowhere
Vector search engine inside Milvus, integrating FAISS, HNSW, DiskANN. |
|
Established |
| 11 |
dingodb/dingo-store
A distributed Key-Value Storage using Raft |
|
Established |
| 12 |
MinishLab/vicinity
Lightweight Nearest Neighbors with Flexible Backends |
|
Established |
| 13 |
muxi-ai/faissx
High-performance remote FAISS server for vector similarity search, with full... |
|
Established |
| 14 |
jina-ai/vectordb
A Python vector database you just need - no more, no less. |
|
Established |
| 15 |
VectorDB-NTU/RaBitQ-Library
A lightweight library for the RaBitQ algorithm and its applications in vector search. |
|
Established |
| 16 |
datawhalechina/easy-vecdb
📚 从零开始的向量数据库原理与实践教程,在线阅读地址:https://easy-vecdb.datawhale.cc/ |
|
Established |
| 17 |
thustorage/PipeANN
A low-latency, billion-scale, and updatable graph-based vector store on SSD. |
|
Emerging |
| 18 |
varshith-Git/valori
A high-performance vector database library for Python that provides... |
|
Emerging |
| 19 |
Veeresh-Hanni/DBDuck
Universal Data Object Model in Pytghon for SQL, Nosql, Graph, Vector DBMS |
|
Emerging |
| 20 |
vortezwohl/Bhakti
An easy-to-use vector database. |
|
Emerging |
| 21 |
rapidsai/cuvs-lucene
A Lucene codec for vector search and clustering on the GPU |
|
Emerging |
| 22 |
pomagrenate/pomaidb
PomaiDB Vector Database for low performance devices |
|
Emerging |
| 23 |
ejaasaari/lorann
Approximate Nearest Neighbor search using reduced-rank regression, with... |
|
Emerging |
| 24 |
AutoCookies/pomaidb
PomaiDB Vector Database for low performance devices |
|
Emerging |
| 25 |
nickna/Neighborly
An open-source vector database |
|
Emerging |
| 26 |
BBC-Esq/VectorDB-Plugin
Program that lets you ask questions about your documents including audio and... |
|
Emerging |
| 27 |
syalia-srl/beaver
All-in-one, pure-python, embedded database for relational data, documents,... |
|
Emerging |
| 28 |
vinerya/faiss_vector_aggregator
This Python library provides a suite of advanced methods for aggregating... |
|
Emerging |
| 29 |
epsilla-cloud/vectordb
Epsilla is a high performance Vector Database Management System |
|
Emerging |
| 30 |
VectorDB-NTU/Extended-RaBitQ
[SIGMOD 2025] Practical and Asymptotically Optimal Quantization of... |
|
Emerging |
| 31 |
ZeusDB/zeusdb
High-performance database management system |
|
Emerging |
| 32 |
makr-code/ThemisDB
Themis Database System - High-performance C++ hybrid-database... |
|
Emerging |
| 33 |
cgtuebingen/ggnn
GGNN: State of the Art Graph-based GPU Nearest Neighbor Search |
|
Emerging |
| 34 |
feather-store/feather
Embedded vector database + living context engine Part of Hawky.ai —... |
|
Emerging |
| 35 |
1yefuwang1/vectorlite
Fast, SQL powered, in-process vector search for any language with an SQLite driver |
|
Emerging |
| 36 |
knowusuboaky/VectrixDB
Where vectors come alive - A lightweight, visual-first vector database with... |
|
Emerging |
| 37 |
vitrivr/cottontaildb
Cottontail DB is a column store vector database aimed at multimedia... |
|
Emerging |
| 38 |
ShravanSunder/hnswlib-wasm
hnswlib-wasm attempts to create a browser friendly version of hnswlib |
|
Emerging |
| 39 |
Dripfarm/SVDB
Swift Vector Database. On-device, local vector database for building the... |
|
Emerging |
| 40 |
sauravniraula/fastembed-vectorstore
In-memory vector store with FastEmbed integration for Python applications. |
|
Emerging |
| 41 |
mmilunovic/m2vdb
vector db built by someone with no idea how to build a vector db |
|
Emerging |
| 42 |
MenxLi/tiny_vectordb
A small and fast Python JIT vector database |
|
Emerging |
| 43 |
BirchKwok/lynsedb
A pure Python-implemented, lightweight, server-optional, multi-end... |
|
Emerging |
| 44 |
krishcdbry/nexadb
NexaDB - A lightweight NoSQL database with vector search, TOON format, and... |
|
Emerging |
| 45 |
0xDebabrata/citrus
(distributed) vector database |
|
Emerging |
| 46 |
MChatzakis/DARTH
[SIGMOD 2026] DARTH: Declarative Recall Through Early Termination for... |
|
Emerging |
| 47 |
wibyuan/easyANN
This project implements 30+ variants of ANN algorithms to find the K nearest... |
|
Emerging |
| 48 |
atasoglu/sqlite-vec-client
A lightweight Python client around sqlite-vec for CRUD and similarity search. |
|
Emerging |
| 49 |
tylerpuig/tinyvec
TinyVecDB is an ultra fast embedded vector database. |
|
Emerging |
| 50 |
replikativ/proximum
Versioned, fast and scalable nearest neighbor search. |
|
Experimental |
| 51 |
thewebscraping/crossvector
Production-ready Python vector database library with unified API for... |
|
Experimental |
| 52 |
lynnlangit/learning-nosql
Companion repository to Linked In Learning course 'Cloud NoSQL for SQL Pros' |
|
Experimental |
| 53 |
firstbatchxyz/hollowdb-vector
A decentralized vector database for building vector search applications |
|
Experimental |
| 54 |
mihirahuja1/vectorwrap
Universal vector search wrapper for Postgres, MySQL, SQLite (pgvector,... |
|
Experimental |
| 55 |
sarabesh/PuppyDB
This is an experimental learning project to explore how vector databases... |
|
Experimental |
| 56 |
prrao87/db-hub-fastapi
Async bulk data ingestion and querying in various document, graph and vector... |
|
Experimental |
| 57 |
ToucanDB/ToucanDB
ToucanDB is a brand-new micro ML-first database engine 🦜 |
|
Experimental |
| 58 |
antarys-ai/python
Python client for Antarys vector database, optimized for large-scale vector... |
|
Experimental |
| 59 |
JadenGeller/similarity-topology
Efficient nearest neighbor search in Swift |
|
Experimental |
| 60 |
EmbedInAI/EmbedInDB
A vector database that empowers AI with persistent memory |
|
Experimental |
| 61 |
ashvardanian/JaccardIndex
Optimizing bit-level Jaccard Index and Population Counts for large-scale... |
|
Experimental |
| 62 |
mantzaris/LMDiskANN.jl
Julia Implementation of Low Memory Disk ANN (LM-DiskANN) |
|
Experimental |
| 63 |
maurocanuto/mempack
MemPack is a blazing-fast, lightweight alternative to heavy vector... |
|
Experimental |
| 64 |
klu-ai/EmbedKit
Swift library extending MLX Embed |
|
Experimental |
| 65 |
vortezwohl/Dipamkara
A light-weight vector database engine. |
|
Experimental |
| 66 |
skyzh/write-you-a-vector-db
A Vector Database Tutorial (over CMU-DB's BusTub system) |
|
Experimental |
| 67 |
rajathshttgr/zoro-db
A Vector Search Engine Built from Scratch in C++ |
|
Experimental |
| 68 |
vital-ai/vital-vitalsigns-python
Knowledge Model Runtime, Ontology management, and interface to Graph and... |
|
Experimental |
| 69 |
nhevers/vecstore
lightweight vector store with HNSW indexing |
|
Experimental |
| 70 |
ericmillsio/whiplash
Serverless, lightweight, and fast vector database on top of DynamoDB |
|
Experimental |
| 71 |
torinriley/VecStream
Efficient, scalable, and lightweight vector database |
|
Experimental |
| 72 |
NachoBrito/vulcano
An in-process, lightweight vector database written in modern Java |
|
Experimental |
| 73 |
UnrealJon/DTDR
Transform-domain representation enabling 3–4× storage reduction with direct... |
|
Experimental |
| 74 |
MukundaKatta/thoth
Thoth — Embedded Vector Database. Embedded vector database (SQLite for vectors) |
|
Experimental |
| 75 |
gsavla6-hue/java-vector-database
High-performance Java vector database implementation with HNSW indexing,... |
|
Experimental |
| 76 |
lexxai/django-mariadb-vector-demo
A minimal demo project showing how to build article recommendations using... |
|
Experimental |
| 77 |
leitoooatr/PythonVectorDB
🗄️ Manage and search large vector datasets efficiently with this pure Python... |
|
Experimental |
| 78 |
britorbs/consciousdb
🗄️ Streamline data analysis with ConsciousDB, a vector database that... |
|
Experimental |
| 79 |
ribagolx10/crossvector
🔗 Simplify vector database operations with CrossVector, a unified Python... |
|
Experimental |
| 80 |
bosekarmegam/vecforge
VecForge is a universal, local-first Python vector database with enterprise... |
|
Experimental |
| 81 |
kroq86/mcp_vector_db
VectorDB MCP server |
|
Experimental |
| 82 |
rizquuula/pyvectordb
Python wrapper for many Vector Databases |
|
Experimental |
| 83 |
JaneaSystems/jecq
Faiss-based library for efficient similarity search |
|
Experimental |
| 84 |
ksm26/vector-databases-embeddings-applications
Unlock the power of vector databases with the "Vector Databases: from... |
|
Experimental |
| 85 |
tsvet01/quiverdb
Embeddable vector database for edge AI. Lightning-fast semantic search that... |
|
Experimental |
| 86 |
NDXDeveloper/formation-mariadb
🐬 Formation complète MariaDB 11.8 LTS en français . SQL, HA, DevOps,... |
|
Experimental |
| 87 |
QDL123/Periplus
A remote cache for vector databases which allows for a dynamically updated... |
|
Experimental |
| 88 |
starkdg/hftrie
index binary vectors for efficient nearest neighbor search |
|
Experimental |
| 89 |
ehsanghaffar/vector-store-api
This project aims to provide an efficient and scalable API for embedding and... |
|
Experimental |
| 90 |
krejciad/kramdb
Simple in-RAM database system |
|
Experimental |
| 91 |
oneKn8/VectorVault
HNSW approximate nearest neighbor engine from scratch in C++20. AVX2... |
|
Experimental |
| 92 |
jmelovich/VectorDatabasePluginUE
A vector 'database' plugin for Unreal Engine 5. Built for leveraging the... |
|
Experimental |
| 93 |
gifton/VectorCore
CPU-bound vector math library with SIMD optimization, distance metrics, and... |
|
Experimental |
| 94 |
PranavBhatP/velox-db
An hobby project to construct a fully functioning vector database from... |
|
Experimental |
| 95 |
doganarif/vectordb
In-memory vector database with pluggable indexing algorithms, metadata... |
|
Experimental |
| 96 |
vectordbpipe/vectorDBpipe
A modular text embedding and vector database pipeline for local and cloud... |
|
Experimental |
| 97 |
maticly/LabHub
OLTP to OLAP ETL + Semantic Search Engine |
|
Experimental |
| 98 |
jwill9999/Vector-DB-Service
A microservice that allows upload of documents from google services, and... |
|
Experimental |
| 99 |
jerryli99/jerry_vectorDB
A lightweight vector database |
|
Experimental |
| 100 |
amhoba/vector-search-db
A high-performance, persistent vector search engine written in C++17 with... |
|
Experimental |
| 101 |
Icingworld/dreamdb
轻量级向量数据库 |
|
Experimental |
| 102 |
shlokkvaishnav/nano-db
Persistent Vector Search Engine built from scratch featuring disk-based HNSW... |
|
Experimental |
| 103 |
mingyu-hkustgz/Res-Infer
Distance Computaion for Vector Databases |
|
Experimental |
| 104 |
VQLite/VQLite
VQLite - Simple and Lightweight Vector Search Engine based on Google ScaNN |
|
Experimental |
| 105 |
AlexHaborets/vectordb
A minimalistic, pure-Python vector database for semantic search and RAG... |
|
Experimental |
| 106 |
maylad31/vector_sqlite
Faiss with sqlite |
|
Experimental |
| 107 |
capybara-brain346/capybaradb
capybaradb - a toy Vector DB implementation from scratch in Python. Explore... |
|
Experimental |
| 108 |
oscarcitoz/vector-db
A FastAPI-based API for managing vector database operations like creating... |
|
Experimental |
| 109 |
atisharma/fvdb
Thin porcelain around the FAISS vector database. |
|
Experimental |
| 110 |
N2FlowJS/nbase
NBase is a high-performance vector database for efficient similarity search,... |
|
Experimental |
| 111 |
haja-k/mysql-to-pgvector-embeddings
vectorizing data from mysql database to vector so it can be used by LLM in... |
|
Experimental |
| 112 |
cmessin02-cmyk/Sentry-Vector-The-AI-Powered-Immutable-Ledger
A high-performance, C++ based Vector Database with HMAC-SHA256 blockchain... |
|
Experimental |
| 113 |
mingyu-hkustgz/RESQ
High-Ratio Vector Quantization |
|
Experimental |
| 114 |
JGalego/VektorDB
A minimal vector database for educational purposes. |
|
Experimental |
| 115 |
SherifSystems/PythonVectorDB
Pure Python vector database • int8 quantized • ~1100 QPS @ 50k vectors •... |
|
Experimental |
| 116 |
matthewwangg/vector-database
A performant in-memory vector database with an HNSW index, data persistence,... |
|
Experimental |
| 117 |
kanitakadusic/bsc-thesis
Vector Databases: Use Cases, Algorithms and Key Features |
|
Experimental |
| 118 |
hritik2002/local-vectordb
Local vector database with embeddings & semantic search. Uses HNSW for fast... |
|
Experimental |
| 119 |
TekilaSS/Educational-Vector-Database
📚 Learn to build and understand Vector Databases step-by-step in Arabic,... |
|
Experimental |
| 120 |
waynewbishop/quiver
Quiver is a Swift package that provides vector mathematics, numerical... |
|
Experimental |
| 121 |
Linco2749/duckdb-s22
🦆 Explore DuckDB's powerful features for efficient data analysis and easy... |
|
Experimental |
| 122 |
deathbeam/vectorspace
Directory file watcher for automatically creating and querying vector embeddings. |
|
Experimental |
| 123 |
LongmaoTeamTf/ant
Open-source vector database built to embedding similarity search |
|
Experimental |
| 124 |
yichunzhao/python-learning
Taking it slow and easy—Python, here I come. 🐍✨ |
|
Experimental |
| 125 |
AWeirdDev/vdb37
A simple vector database. |
|
Experimental |
| 126 |
Ronakagrwal000/vector-cache-optimizer
⚡ Optimize vector searches with a hyper-efficient cache that uses machine... |
|
Experimental |
| 127 |
yusupwinata/Basic-VectorDB
Build vector database using LangChain, Hugging Face, Chroma and FAISS. |
|
Experimental |
| 128 |
JohnnyHyytiainen/glossary_db
Personal Glossary Database to help keep track on terms and theory for school... |
|
Experimental |
| 129 |
takurot/Pyrope
Pyrope is a high-performance, adaptive Vector Database built as an extension... |
|
Experimental |
| 130 |
gifton/VectorAccelerate
Swift6 GPU-accelerated vector operations using Metal4 shaders for Apple... |
|
Experimental |
| 131 |
thkbit-labs/vecmodel
A model-based, ORM-inspired abstraction for vector databases. |
|
Experimental |
| 132 |
lcj2021/mini-ivf
A cute toy of IVF (PQ). |
|
Experimental |
| 133 |
gtfintechlab/Universal-NFT-Vector-Database
The Universal NFT Vector Database: A Scalable Vector Database for NFT... |
|
Experimental |
| 134 |
starkdg/mvptree
multiple vantage point distance-based tree data structure |
|
Experimental |
| 135 |
jballo/vector-db-engine
A FastAPI service that lets users create, read, update, and delete document... |
|
Experimental |
| 136 |
RKirlew/SoraDB-A-Lightweight-Vector-Database
SoraDB is a custom-built vector storage engine designed to manage and query... |
|
Experimental |
| 137 |
colbertdb/colbertdb
Open source ColBERT based document database |
|
Experimental |
| 138 |
rosaia/vecworks
Seamlessly manage vectorized data in Python |
|
Experimental |
| 139 |
ocramz/vectordb
Simple vector database based on annoy and sqlite3 |
|
Experimental |
| 140 |
B-R-P/VStore
Embedded key-value store with vector similarity search |
|
Experimental |
| 141 |
Maverick0351a/consciousdb
ConsciousDB – Your Vector Database Is the Model |
|
Experimental |
| 142 |
danilop/knn-search-algorithm-comparison
KNN Search Algorithm Comparison – This project compares the performance of... |
|
Experimental |
| 143 |
nathangtg/dbms-research
This is the repository for ZGQ (Zone Graph Quantization)m Which is now... |
|
Experimental |
| 144 |
Scintirete/Scintirete
Scintirete 是一款基于 HNSW 算法实现的、嵌入式友好的、面向生产的向量数据库。Scintirete is a lightweight,... |
|
Experimental |
| 145 |
patw/InstructorVec
Create dense vectors using the instructor-large model, running on CPU in... |
|
Experimental |
| 146 |
natenberenstein/deep-dive-databases
Knowledge base covering database internals -- storage engines, data models,... |
|
Experimental |
| 147 |
tweedge/vectordb-docker-base
Python 3.10-slim with VectorDB (vectordb2==0.1.9) and certain models... |
|
Experimental |
| 148 |
RasaiStewart/Vector-database-using-vectordb
My attempt to create a vector database to store the names of books I have... |
|
Experimental |
| 149 |
tanushachoudhary/VectorDB
A production-ready vector database system that stores document embeddings... |
|
Experimental |
| 150 |
Flagro/VecMetaQ
Server over Python Faiss serverless implementation to match interfaces used... |
|
Experimental |
| 151 |
NautilusDB-cloud/nautilusdb-cli
The simple client of NautilusDB, a Clound-Native Vector Search Service |
|
Experimental |
| 152 |
1226085293/MiniVectorDB
Lightweight, self-hosted Node.js vector database using WASM-based HNSW with... |
|
Experimental |
| 153 |
timothyckl/iota
a minimal local embedding database. |
|
Experimental |
| 154 |
FoxRav/RL-astradb-
Astra Vector DB on Python-paketti, joka tallentaa dokumentteja DataStax... |
|
Experimental |
| 155 |
yezz123/vectorai
A Vector Database REST API with custom indexing algorithms |
|
Experimental |
| 156 |
mingyu-hkustgz/LabelANN
Label Filtering Vector Similarity Search |
|
Experimental |
| 157 |
Md-Emon-Hasan/Vector-Database
Designed to store and retrieve high-dimensional data, such as embeddings,... |
|
Experimental |