Hironsan/bertsearch
Elasticsearch with BERT for advanced document search.
Converts document text into 768-dimensional BERT embeddings and stores them as dense vectors in Elasticsearch, enabling semantic similarity search beyond keyword matching. The Docker Compose setup orchestrates BERT inference and Elasticsearch services, with Python scripts handling vectorization of documents and index creation. Supports multiple pretrained BERT variants (Base/Large, cased/uncased, multilingual) with a Flask web interface for querying indexed documents.
898 stars. No commits in the last 6 months.
Stars
898
Forks
201
Language
Python
License
MIT
Category
Last pushed
May 01, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/Hironsan/bertsearch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.