naver/splade
SPLADE: sparse neural search (SIGIR21, SIGIR22)
Based on the README, here's a technical summary: Learns sparse query/document representations via BERT's MLM head with regularization, enabling efficient inverted-index retrieval while maintaining explicit lexical matching and interpretability advantages over dense methods. Implements multiple training approaches including hard-negative mining, distillation (MarginMSE, self-distil, ensemble-distil), and query-specific regularization; supports disjoint query/document encoders for efficiency parity with BM25. Built on Hydra for experiment management with end-to-end training, indexing, and retrieval pipelines integrated with MS MARCO and BEIR benchmarks, plus pre-trained models available on Hugging Face.
984 stars. No commits in the last 6 months.
Stars
984
Forks
94
Language
Python
License
—
Category
Last pushed
May 03, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/naver/splade"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
smart-on-fhir/cumulus-etl
Extract FHIR data, Transform with NLP and DEID tools, and then Load FHIR data into a SQL...
mirkosertic/FXDesktopSearch
A JavaFX based desktop search application.
opensemanticsearch/open-semantic-search
Open Source research tool to search, browse, analyze and explore large document collections by...
opensemanticsearch/open-semantic-etl
Python based Open Source ETL tools for file crawling, document processing (text extraction,...
bent10/boox
Search anything, instantly