princeton-nlp/SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

63
/ 100
Established

Provides both unsupervised and supervised training approaches—unsupervised leverages dropout-based noise on unlabeled data, while supervised incorporates NLI entailment pairs as positives and contradictions as hard negatives. Integrates seamlessly with HuggingFace Transformers and offers efficient similarity search via optional Faiss support, with pre-trained checkpoints available across BERT and RoBERTa architectures.

3,644 stars and 162 monthly downloads. Used by 1 other package. No commits in the last 6 months. Available on PyPI.

Stale 6m
Maintenance 0 / 25
Adoption 16 / 25
Maturity 25 / 25
Community 22 / 25

How are scores calculated?

Stars

3,644

Forks

534

Language

Python

License

MIT

Last pushed

Oct 16, 2024

Monthly downloads

162

Commits (30d)

0

Reverse dependents

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/princeton-nlp/SimCSE"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.