Compositional Reasoning Embeddings NLP Tools

Research implementations focusing on compositional reasoning, modular structures in language models, and contrastive learning methods for semantic representations. Does NOT include general pre-training, task-specific applications, or single-language tools without compositional focus.

There are 73 compositional reasoning embeddings tools tracked. 2 score above 50 (established tier). The highest-rated is princeton-nlp/SimCSE at 63/100 with 3,644 stars and 162 monthly downloads.

Get all 73 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=compositional-reasoning-embeddings&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Tool	Score	Tier	Stars	Language
1	princeton-nlp/SimCSE [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings...	63	Established	3,644	Python
2	n-waves/multifit The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual...	51	Established	284	Jupyter Notebook
3	yxuansu/SimCTG [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation	47	Emerging	475	Python
4	Shark-NLP/OpenICL OpenICL is an open-source framework to facilitate research, development, and...	46	Emerging	584	Python
5	alibaba-edu/simple-effective-text-matching Source code of the ACL2019 paper "Simple and Effective Text Matching with...	42	Emerging	340	Python
6	alibaba-edu/simple-effective-text-matching-pytorch A pytorch implementation of the ACL2019 paper "Simple and Effective Text...	40	Emerging	305	Python
7	KwangKa/SIMCSE_unsup 中文无监督SimCSE Pytorch实现	39	Emerging	135	Python
8	Alibaba-NLP/ACE [ACL-IJCNLP 2021] Automated Concatenation of Embeddings for Structured Prediction	39	Emerging	312	Python
9	xlang-ai/UnifiedSKG [EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with...	36	Emerging	569	Python
10	yueyu1030/COSINE [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained...	35	Emerging	206	Python
11	yumeng5/SuperGen [NeurIPS 2022] Generating Training Data with Language Models: Towards...	35	Emerging	69	Python
12	SAP-samples/acl2022-self-contrastive-decorrelation Source code for ACL 2022 paper "Self-contrastive Decorrelation for Sentence...	32	Emerging	26	Python
13	aangelopoulos/conformal-risk Conformal prediction for controlling monotonic risk functions. Simple...	32	Emerging	79	Python
14	mesnico/ALADIN Official implementation of the paper "ALADIN: Distilling Fine-grained...	32	Emerging	28	Python
15	j6mes/acl2021-factual-error-correction ACL 2021	32	Emerging	27	Python
16	allenai/dont-stop-pretraining Code associated with the Don't Stop Pretraining ACL 2020 paper	31	Emerging	540	Python
17	GaryYufei/ACL2021MF Source Code For ACL 2021 Paper "Mention Flags (MF): Constraining...	31	Emerging	20	Python
18	perceptiveshawty/RankCSE Implementation of "RankCSE: Unsupervised Sentence Representation Learning...	30	Emerging	48	Python
19	zhuang-li/FactualSceneGraph [ACL 2023 Findings] FACTUAL dataset, the textual scene graph parser trained...	30	Emerging	127	Python
20	L-Zhe/BTmPG Code for paper Pushing Paraphrase Away from Original Sentence: A Multi-Round...	29	Experimental	14	Python
21	songyang-dev/uml-translation-3step Official repository for the paper Yang et al. 2022	28	Experimental	9	Python
22	hexuandeng/Mono4SiMT The implementation for our paper, "Improving Simultaneous Machine...	28	Experimental	12	Python
23	SAP-samples/acl2023-micse Source code for ACL 2023 paper "miCSE: Mutual Information Contrastive...	27	Experimental	9	Python
24	OpenMatch/COCO-DR [EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR:...	26	Experimental	50	Python
25	Smu-Tan/Remedy [EMNLP2025] Remedy: Learning Machine Translation Evaluation from Human...	26	Experimental	14	Python
26	cisnlp/Glot500 Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages...	25	Experimental	106	Python
27	yueyu1030/ReGen [ACL'23 Findings] This is the code repo for our ACL'23 Findings paper...	25	Experimental	24	Python
28	xlang-ai/icl-selective-annotation [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models...	25	Experimental	109	Python
29	LCS2-IIITD/ACL-FFLM Code for our ACL (Findings) Paper - Fingerprinting Fine-tuned Language...	25	Experimental	5	Python
30	jiacheng-ye/ZeroGen [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via...	25	Experimental	48	Python
31	rosewang2008/calibrate_your_listeners Calibrate your listeners! Robust communication-based training for pragmatic...	24	Experimental	4	Python
32	BM-K/KoSimCSE-SKT Simple Contrastive Learning of Korean Sentence Embeddings	24	Experimental	53	Python
33	Lingkai-Kong/Calibrated-BERT-Fine-Tuning Code for Paper: Calibrated Language Model Fine-Tuning for In- and...	24	Experimental	36	Python
34	limteng-rpi/mlmt Code for the paper "A Multi-lingual Multi-task Architecture for Low-resource...	24	Experimental	29	Python
35	YJiangcm/WebR [ACL 2025] Instruction-Tuning Data Synthesis from Scratch via Web Reconstruction	22	Experimental	11	Python
36	princeton-nlp/TRIME [EMNLP 2022] Training Language Models with Memory Augmentation...	22	Experimental	195	Python
37	TianduoWang/DiffAug [EMNLP 2022] Differentiable Data Augmentation for Contrastive Sentence...	22	Experimental	40	Python
38	chenllliang/MLS Source code of our paper "Focus on the Target’s Vocabulary: Masked Label...	22	Experimental	18	Python
39	SLAB-NLP/Linguistic-Blood-Bank Balanced Data Approach for Evaluating Cross-Lingual Transfer: Mapping the...	22	Experimental	9	Jupyter Notebook
40	xuanyuan14/ARES SIGIR'22 paper: Axiomatically Regularized Pre-training for Ad hoc Search	22	Experimental	23	Python
41	zjunlp/Revisit-KNN [CCL 2023] Revisiting k-NN for Fine-tuning Pre-trained Language Models	21	Experimental	10	Python
42	lifan-yuan/PLMCalibration Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained...	21	Experimental	11	Python
43	joisino/zeh Code for "Even GPT-5.2 Can’t Count to Five: The Case for Zero-Error Horizons...	21	Experimental	2	Python
44	YecanLee/Adaptive-Contrastive-Search [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive...	21	Experimental	41	Python
45	UKPLab/acl2021-metaphor-generation-conceptual This repository is for the paper Metaphor Generation with Conceptual...	20	Experimental	12	Python
46	alibaba/SimCSE-with-CARDS Source code for SIGIR 2022 paper.	20	Experimental	16	Python
47	yaushian/mSimCSE mSimCSE: Multilingual SimCSE	20	Experimental	33	Python
48	machelreid/afromt Code for the EMNLP 2021 Paper "AfroMT: Pretraining Strategies and...	19	Experimental	9	Python
49	yxuansu/TaCL [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning	19	Experimental	94	Python
50	zxlzr/DCL Code for the CICAI 2021 paper "Disentangled Contrastive Learning for...	18	Experimental	5	Python
51	LittletreeZou/Cross-Modal-Cloze-Task This repository is for ACL 2022 findings paper: Cross-Modal Cloze Task: A...	16	Experimental	4	Python
52	tejasvaidhyadev/Efficient-Co-RLSR Codebase accompanying the paper "Efficient Co-Regularised Least Squares Regression".	16	Experimental	3	Python
53	Sreyan88/CompA Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional...	15	Experimental	22	Python
54	megagonlabs/zett :see_no_evil: Code for Zero-shot Triplet Extraction by Template Infilling...	15	Experimental	21	Python
55	jlin816/rewards-from-language Code and data for "Inferring Rewards from Language in Context" [ACL 2022].	15	Experimental	16	Python
56	orionw/MTLvsIFT Code for the paper "When to Use Multi-Task Learning vs Intermediate...	15	Experimental	6	Python
57	IndexFziQ/COMMA The code of COMMA: Modeling Relationship among Motivations, Emotions and...	14	Experimental	12	Python
58	tic-top/LoraCSE 😜Constrative Learning of Sentence Embedding using LoRA (EECS487 final project)	14	Experimental	13	Jupyter Notebook
59	griff4692/calibrating-summaries This is the official PyTorch codebase for the ACL 2023 paper: "What are the...	14	Experimental	9	Python
60	THUNLP-MT/TRICE Code for our paper "Transfer Learning for Sequence Generation: from...	14	Experimental	11	Python
61	Yangyi-Chen/LM-TOAST Source code for ACL 2023 Findings paper "Making Pre-trained Language Models...	13	Experimental	7	Python
62	MattYoon/reasoning-models-confidence [NeurIPS 2025] Reasoning Models Better Express Their Confidence"	13	Experimental	22	Python
63	CAS-SIAT-XinHai/NUMCoT [ACL 2024] NUMCoT: Numerals and Units of Measurement in Chain-of-Thought...	13	Experimental	5	Python
64	zwhe99/SelfTraining4UNMT [ACL 2022] Bridging the Data Gap between Training and Inference for...	12	Experimental	31	Python
65	itayle/diverse-demonstrations Diverse Demonstrations Improve In-context Compositional Generalization	12	Experimental	12	Python
66	tran-khoa/joint-training-cascaded-st Code for the paper "Does Joint Training Really Help Cascaded Speech...	12	Experimental	4	Python
67	yass-ML/slm-few-shot-optimization An empirical investigation into optimizing few-shot prompting strategies for...	12	Experimental	1	Python
68	LarsHill/pointer-guided-pre-training Code for the ECML 2024 paper "Pointer-Guided Pre-Training: Infusing Large...	12	Experimental	3	Python
69	ashutoshml/alleviating-inconsistency ACL 2022 (Findings): Striking a Balance: Alleviating Inconsistency in...	12	Experimental	3	Python
70	phanxuanphucnd/knowledge_tracing Model AKT for Knowledge Tracing	12	Experimental	3	Python
71	ltorroba/lms-from-mlms Repository for the ACL 2023 paper "Deriving Language Models from Masked...	11	Experimental	2	Python
72	SondreWold/comp_rep_study Official implementation for the ACL 2025 Main paper "Circuit Compositions:...	10	Experimental	1	Python
73	tyjiangU/physical_artifacts_function Code for the paper "Learning Prototypical Functions for Physical Artifacts"	10	Experimental	1	Python

Comparisons in this category

SimCSE and RankCSE (63 vs 30) simple-effective-text-matching and simple-effective-text-matching-pytorch (42 vs 40)