upstash/wikipedia-semantic-search
Semantic Search on Wikipedia with Upstash Vector
Embeds 144M Wikipedia articles across 11 languages using BGE-M3 for multilingual semantic search, with namespace-based language separation in Upstash Vector. Combines vector similarity retrieval with Upstash RAG Chat SDK and Llama-3 LLM APIs to enable cross-lingual search and RAG chatbot functionality, using Redis for session persistence.
471 stars.
Stars
471
Forks
35
Language
TypeScript
License
MIT
Category
Last pushed
Dec 12, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/upstash/wikipedia-semantic-search"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
meilisearch/meilisearch
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
nuclia/nucliadb
NucliaDB, The AI Search database for RAG
vespa-engine/vespa
AI + Data, online. https://vespa.ai
PrithivirajDamodaran/FlashRank
Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and...
ICIJ/datashare
A self‑hosted search engine for documents