patricktrainer/duckdb-embedding-search
Fast similarity search using DuckDB
Leverages OpenAI embeddings stored in DuckDB's columnar format for fast vector similarity comparisons, enabling semantic search across large datasets like Hacker News comments. The modular architecture separates concerns across connection management, embedding generation, and vector operations, with lazy embedding creation that calls the OpenAI API on-demand for new queries. Targets Python developers building semantic search applications who need lightweight, embedded similarity lookup without managing separate vector databases.
146 stars. No commits in the last 6 months.
Stars
146
Forks
7
Language
Python
License
MIT
Category
Last pushed
Oct 30, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/patricktrainer/duckdb-embedding-search"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
meilisearch/meilisearch
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
nuclia/nucliadb
NucliaDB, The AI Search database for RAG
vespa-engine/vespa
AI + Data, online. https://vespa.ai
PrithivirajDamodaran/FlashRank
Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and...
ICIJ/datashare
A self‑hosted search engine for documents