Compositional Reasoning Embeddings NLP Tools

Research implementations focusing on compositional reasoning, modular structures in language models, and contrastive learning methods for semantic representations. Does NOT include general pre-training, task-specific applications, or single-language tools without compositional focus.

There are 73 compositional reasoning embeddings tools tracked. 2 score above 50 (established tier). The highest-rated is princeton-nlp/SimCSE at 63/100 with 3,644 stars and 162 monthly downloads.

Get all 73 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=compositional-reasoning-embeddings&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 princeton-nlp/SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings...

63
Established
2 n-waves/multifit

The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual...

51
Established
3 yxuansu/SimCTG

[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation

47
Emerging
4 Shark-NLP/OpenICL

OpenICL is an open-source framework to facilitate research, development, and...

46
Emerging
5 alibaba-edu/simple-effective-text-matching

Source code of the ACL2019 paper "Simple and Effective Text Matching with...

42
Emerging
6 alibaba-edu/simple-effective-text-matching-pytorch

A pytorch implementation of the ACL2019 paper "Simple and Effective Text...

40
Emerging
7 KwangKa/SIMCSE_unsup

中文无监督SimCSE Pytorch实现

39
Emerging
8 Alibaba-NLP/ACE

[ACL-IJCNLP 2021] Automated Concatenation of Embeddings for Structured Prediction

39
Emerging
9 xlang-ai/UnifiedSKG

[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with...

36
Emerging
10 yueyu1030/COSINE

[NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained...

35
Emerging
11 yumeng5/SuperGen

[NeurIPS 2022] Generating Training Data with Language Models: Towards...

35
Emerging
12 SAP-samples/acl2022-self-contrastive-decorrelation

Source code for ACL 2022 paper "Self-contrastive Decorrelation for Sentence...

32
Emerging
13 aangelopoulos/conformal-risk

Conformal prediction for controlling monotonic risk functions. Simple...

32
Emerging
14 mesnico/ALADIN

Official implementation of the paper "ALADIN: Distilling Fine-grained...

32
Emerging
15 j6mes/acl2021-factual-error-correction

ACL 2021

32
Emerging
16 allenai/dont-stop-pretraining

Code associated with the Don't Stop Pretraining ACL 2020 paper

31
Emerging
17 GaryYufei/ACL2021MF

Source Code For ACL 2021 Paper "Mention Flags (MF): Constraining...

31
Emerging
18 perceptiveshawty/RankCSE

Implementation of "RankCSE: Unsupervised Sentence Representation Learning...

30
Emerging
19 zhuang-li/FactualSceneGraph

[ACL 2023 Findings] FACTUAL dataset, the textual scene graph parser trained...

30
Emerging
20 L-Zhe/BTmPG

Code for paper Pushing Paraphrase Away from Original Sentence: A Multi-Round...

29
Experimental
21 songyang-dev/uml-translation-3step

Official repository for the paper Yang et al. 2022

28
Experimental
22 hexuandeng/Mono4SiMT

The implementation for our paper, "Improving Simultaneous Machine...

28
Experimental
23 SAP-samples/acl2023-micse

Source code for ACL 2023 paper "miCSE: Mutual Information Contrastive...

27
Experimental
24 OpenMatch/COCO-DR

[EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR:...

26
Experimental
25 Smu-Tan/Remedy

[EMNLP2025] Remedy: Learning Machine Translation Evaluation from Human...

26
Experimental
26 cisnlp/Glot500

Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages...

25
Experimental
27 yueyu1030/ReGen

[ACL'23 Findings] This is the code repo for our ACL'23 Findings paper...

25
Experimental
28 xlang-ai/icl-selective-annotation

[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models...

25
Experimental
29 LCS2-IIITD/ACL-FFLM

Code for our ACL (Findings) Paper - Fingerprinting Fine-tuned Language...

25
Experimental
30 jiacheng-ye/ZeroGen

[EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via...

25
Experimental
31 rosewang2008/calibrate_your_listeners

Calibrate your listeners! Robust communication-based training for pragmatic...

24
Experimental
32 BM-K/KoSimCSE-SKT

Simple Contrastive Learning of Korean Sentence Embeddings

24
Experimental
33 Lingkai-Kong/Calibrated-BERT-Fine-Tuning

Code for Paper: Calibrated Language Model Fine-Tuning for In- and...

24
Experimental
34 limteng-rpi/mlmt

Code for the paper "A Multi-lingual Multi-task Architecture for Low-resource...

24
Experimental
35 YJiangcm/WebR

[ACL 2025] Instruction-Tuning Data Synthesis from Scratch via Web Reconstruction

22
Experimental
36 princeton-nlp/TRIME

[EMNLP 2022] Training Language Models with Memory Augmentation...

22
Experimental
37 TianduoWang/DiffAug

[EMNLP 2022] Differentiable Data Augmentation for Contrastive Sentence...

22
Experimental
38 chenllliang/MLS

Source code of our paper "Focus on the Target’s Vocabulary: Masked Label...

22
Experimental
39 SLAB-NLP/Linguistic-Blood-Bank

Balanced Data Approach for Evaluating Cross-Lingual Transfer: Mapping the...

22
Experimental
40 xuanyuan14/ARES

SIGIR'22 paper: Axiomatically Regularized Pre-training for Ad hoc Search

22
Experimental
41 zjunlp/Revisit-KNN

[CCL 2023] Revisiting k-NN for Fine-tuning Pre-trained Language Models

21
Experimental
42 lifan-yuan/PLMCalibration

Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained...

21
Experimental
43 joisino/zeh

Code for "Even GPT-5.2 Can’t Count to Five: The Case for Zero-Error Horizons...

21
Experimental
44 YecanLee/Adaptive-Contrastive-Search

[EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive...

21
Experimental
45 UKPLab/acl2021-metaphor-generation-conceptual

This repository is for the paper Metaphor Generation with Conceptual...

20
Experimental
46 alibaba/SimCSE-with-CARDS

Source code for SIGIR 2022 paper.

20
Experimental
47 yaushian/mSimCSE

mSimCSE: Multilingual SimCSE

20
Experimental
48 machelreid/afromt

Code for the EMNLP 2021 Paper "AfroMT: Pretraining Strategies and...

19
Experimental
49 yxuansu/TaCL

[NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning

19
Experimental
50 zxlzr/DCL

Code for the CICAI 2021 paper "Disentangled Contrastive Learning for...

18
Experimental
51 LittletreeZou/Cross-Modal-Cloze-Task

This repository is for ACL 2022 findings paper: Cross-Modal Cloze Task: A...

16
Experimental
52 tejasvaidhyadev/Efficient-Co-RLSR

Codebase accompanying the paper "Efficient Co-Regularised Least Squares Regression".

16
Experimental
53 Sreyan88/CompA

Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional...

15
Experimental
54 megagonlabs/zett

:see_no_evil: Code for Zero-shot Triplet Extraction by Template Infilling...

15
Experimental
55 jlin816/rewards-from-language

Code and data for "Inferring Rewards from Language in Context" [ACL 2022].

15
Experimental
56 orionw/MTLvsIFT

Code for the paper "When to Use Multi-Task Learning vs Intermediate...

15
Experimental
57 IndexFziQ/COMMA

The code of COMMA: Modeling Relationship among Motivations, Emotions and...

14
Experimental
58 tic-top/LoraCSE

😜Constrative Learning of Sentence Embedding using LoRA (EECS487 final project)

14
Experimental
59 griff4692/calibrating-summaries

This is the official PyTorch codebase for the ACL 2023 paper: "What are the...

14
Experimental
60 THUNLP-MT/TRICE

Code for our paper "Transfer Learning for Sequence Generation: from...

14
Experimental
61 Yangyi-Chen/LM-TOAST

Source code for ACL 2023 Findings paper "Making Pre-trained Language Models...

13
Experimental
62 MattYoon/reasoning-models-confidence

[NeurIPS 2025] Reasoning Models Better Express Their Confidence"

13
Experimental
63 CAS-SIAT-XinHai/NUMCoT

[ACL 2024] NUMCoT: Numerals and Units of Measurement in Chain-of-Thought...

13
Experimental
64 zwhe99/SelfTraining4UNMT

[ACL 2022] Bridging the Data Gap between Training and Inference for...

12
Experimental
65 itayle/diverse-demonstrations

Diverse Demonstrations Improve In-context Compositional Generalization

12
Experimental
66 tran-khoa/joint-training-cascaded-st

Code for the paper "Does Joint Training Really Help Cascaded Speech...

12
Experimental
67 yass-ML/slm-few-shot-optimization

An empirical investigation into optimizing few-shot prompting strategies for...

12
Experimental
68 LarsHill/pointer-guided-pre-training

Code for the ECML 2024 paper "Pointer-Guided Pre-Training: Infusing Large...

12
Experimental
69 ashutoshml/alleviating-inconsistency

ACL 2022 (Findings): Striking a Balance: Alleviating Inconsistency in...

12
Experimental
70 phanxuanphucnd/knowledge_tracing

Model AKT for Knowledge Tracing

12
Experimental
71 ltorroba/lms-from-mlms

Repository for the ACL 2023 paper "Deriving Language Models from Masked...

11
Experimental
72 SondreWold/comp_rep_study

Official implementation for the ACL 2025 Main paper "Circuit Compositions:...

10
Experimental
73 tyjiangU/physical_artifacts_function

Code for the paper "Learning Prototypical Functions for Physical Artifacts"

10
Experimental