patricktrainer/duckdb-embedding-search

Fast similarity search using DuckDB

34
/ 100
Emerging

Leverages OpenAI embeddings stored in DuckDB's columnar format for fast vector similarity comparisons, enabling semantic search across large datasets like Hacker News comments. The modular architecture separates concerns across connection management, embedding generation, and vector operations, with lazy embedding creation that calls the OpenAI API on-demand for new queries. Targets Python developers building semantic search applications who need lightweight, embedded similarity lookup without managing separate vector databases.

146 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 8 / 25

How are scores calculated?

Stars

146

Forks

7

Language

Python

License

MIT

Last pushed

Oct 30, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/patricktrainer/duckdb-embedding-search"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.