princeton-nlp/SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
Provides both unsupervised and supervised training approaches—unsupervised leverages dropout-based noise on unlabeled data, while supervised incorporates NLI entailment pairs as positives and contradictions as hard negatives. Integrates seamlessly with HuggingFace Transformers and offers efficient similarity search via optional Faiss support, with pre-trained checkpoints available across BERT and RoBERTa architectures.
3,644 stars and 162 monthly downloads. Used by 1 other package. No commits in the last 6 months. Available on PyPI.
Stars
3,644
Forks
534
Language
Python
License
MIT
Category
Last pushed
Oct 16, 2024
Monthly downloads
162
Commits (30d)
0
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/princeton-nlp/SimCSE"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
n-waves/multifit
The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model...
yxuansu/SimCTG
[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation
Shark-NLP/OpenICL
OpenICL is an open-source framework to facilitate research, development, and prototyping of...
alibaba-edu/simple-effective-text-matching
Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".
alibaba-edu/simple-effective-text-matching-pytorch
A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer...