Model Evaluation Diagnostics Transformer Models
Tools for systematically evaluating, diagnosing, and benchmarking transformer models across NLI, WSD, and other NLP tasks using standard test sets and evaluation frameworks. Does NOT include general model training, fine-tuning without evaluation focus, or language-specific model overviews.
There are 48 model evaluation diagnostics models tracked. 1 score above 50 (established tier). The highest-rated is minggnim/nlp-models at 50/100 with 2 stars and 201 monthly downloads.
Get all 48 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=model-evaluation-diagnostics&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
minggnim/nlp-models
A repository for training transformer based models |
|
Established |
| 2 |
IntelLabs/nlp-architect
A model library for exploring state-of-the-art deep learning topologies and... |
|
Emerging |
| 3 |
yuanzhoulvpi2017/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理) |
|
Emerging |
| 4 |
LoicGrobol/zeldarose
Train transformer-based models. |
|
Emerging |
| 5 |
CPJKU/wechsel
Code for WECHSEL: Effective initialization of subword embeddings for... |
|
Emerging |
| 6 |
soldni/pyterrier_sentence_transformers
Create PyTerrier compatible dense indices using any sentence_transformers model |
|
Emerging |
| 7 |
MahmoudWahdan/dialog-nlu
Tensorflow and Keras implementation of the state of the art researches in... |
|
Emerging |
| 8 |
yuanzhoulvpi2017/quick_sentence_transformers
sentence-transformers to onnx 让sbert模型推理效率更快 |
|
Emerging |
| 9 |
ukairia777/tensorflow-nlp-tutorial
tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림... |
|
Emerging |
| 10 |
HarderThenHarder/transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification,... |
|
Emerging |
| 11 |
g8a9/ferret
A python package for benchmarking interpretability techniques on Transformers. |
|
Emerging |
| 12 |
sinanuozdemir/oreilly-bert-nlp
This repository contains code for the O'Reilly Live Online Training for BERT |
|
Experimental |
| 13 |
Azure/nlp-samples
Japanese NLP sample codes |
|
Experimental |
| 14 |
ManashJKonwar/NLP-Transformers
Transformer (BERT, GPT2, etc.) based Training Module for popular NLP tasks |
|
Experimental |
| 15 |
polakowo/textai
Applications using state-of-the-art in NLP |
|
Experimental |
| 16 |
shunk031/allennlp-shiba-model
AllenNLP integration for Shiba: Japanese CANINE model |
|
Experimental |
| 17 |
rajaswa/indic-syntax-evaluation
Vyākarana: A Colorless Green Benchmark for Syntactic Evaluation in Indic Languages |
|
Experimental |
| 18 |
ropensci/pangoling
An R package for estimating the log-probabilities of words in a given... |
|
Experimental |
| 19 |
VirtualRoyalty/gan-plus-nlp
Generative adversarial approach to most popular NLP tasks |
|
Experimental |
| 20 |
prajjwal1/generalize_lm_nli
Code for the paper EMNLP 2021 workshop paper "Generalization in NLI: Ways... |
|
Experimental |
| 21 |
stevezheng23/fewshot_nlp_pt
Few-shot NLP in PyTorch |
|
Experimental |
| 22 |
Nickil21/weakly-supervised-parsing
Official Code for our Findings of ACL 2022 paper: Co-training an... |
|
Experimental |
| 23 |
matteomedioli/BERT-KG
Enriching Language Models Representations via Knowledge Graphs Regularisation |
|
Experimental |
| 24 |
th789/mbr-for-nmt
Characterizing the performance of minimum Bayes risk (MBR) decoding for... |
|
Experimental |
| 25 |
CyberAgentAILab/japanese-nli-model
This repository provides the code for Japanese NLI model, a fine-tuned... |
|
Experimental |
| 26 |
proycon/deepfrog
An NLP-suite powered by deep learning |
|
Experimental |
| 27 |
ai-forever/model-zoo
NLP model zoo for Russian |
|
Experimental |
| 28 |
Beomi/transformers-language-modeling
Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3 |
|
Experimental |
| 29 |
yucc2018/share
一些代码实践分享。 |
|
Experimental |
| 30 |
TRISTAN-ORF/RiboTIE
Scripts and instructions to apply RiboTIE on Ribo-seq data |
|
Experimental |
| 31 |
ishan00/meta-learning-for-multi-task-multilingual
Official Repository for the paper titled "Meta-Learning for Effective... |
|
Experimental |
| 32 |
DFKI-NLP/gevalm
Code and data for the paper "Evaluating German Transformer Language Models... |
|
Experimental |
| 33 |
hppRC/simple-simcse-ja
Exploring Japanese SimCSE |
|
Experimental |
| 34 |
zhestyatsky/MCL-WiC
Research on Multilingual and Cross-lingual Word-in-Context Disambiguation |
|
Experimental |
| 35 |
SapienzaNLP/xl-wsd-code
Code to train and test Word Sense Disambiguation models based on different... |
|
Experimental |
| 36 |
princeton-nlp/MultilingualAnalysis
Repository for the paper titled: "When is BERT Multilingual? Isolating... |
|
Experimental |
| 37 |
aarnetalman/nli-with-transformers
Fine-tune transformers with NLI data |
|
Experimental |
| 38 |
brihijoshi/granular-similarity-COLING-2020
Code for the paper "The Devil is in the Details: Evaluating Limitations of... |
|
Experimental |
| 39 |
RobinSmits/Dutch-NLP-Experiments
This repository contains a number of experiments with Multi Lingual... |
|
Experimental |
| 40 |
iamlxb3/UMAMGT
Code for the publication of LREC'22 |
|
Experimental |
| 41 |
HannaAbiAkl/PSYCHIC
The official repository for the PSYCHIC model |
|
Experimental |
| 42 |
TRISTAN-ORF/RiboTIE_article
Scripts run to produce the RiboTIE paper |
|
Experimental |
| 43 |
skomban/seq-unscrambler
Unscrambles shuffled letters in a word sequence. |
|
Experimental |
| 44 |
DudalaShrujana/nlp-transformers-toolkit
ModularNLP pipeline utilizing Hugging Face Transformers for Sentiment... |
|
Experimental |
| 45 |
mhdr3a/transformers-diagnostics
Model Evaluation using SuperGLUE Diagnostic Dataset |
|
Experimental |
| 46 |
bglid/haitian-creole-nlu
Project designed to reimplement and build upon CreoleVal's Reading... |
|
Experimental |
| 47 |
loubnabnl/canine-mednli
CANINE for Medical Natural Language Inference on MedNLI data, as part of the... |
|
Experimental |
| 48 |
SambhawDrag/XLNet.jl
A Julia-based implementation of XLNet: A Generalized Autoregressive... |
|
Experimental |