All Transformer Models
6,968 models ranked by quality score · Page 23 of 70
| # | Model | Score | Tier |
|---|---|---|---|
| 2201 |
ManashJKonwar/NLP-Transformers
Transformer (BERT, GPT2, etc.) based Training Module for popular NLP tasks |
|
Emerging |
| 2202 |
Gen-Verse/ReasonFlux
[NeurIPS 2025 Spotlight] LLM post-training suite — featuring ReasonFlux,... |
|
Emerging |
| 2203 |
leliuga/cohere-configurations
Co:Here Inference configurations |
|
Emerging |
| 2204 |
Hon-Wong/VoRA
[Fully open] [Encoder-free MLLM] Vision as LoRA |
|
Emerging |
| 2205 |
nanowell/Differential-Transformer-PyTorch
PyTorch implementation of the Differential-Transformer architecture for... |
|
Emerging |
| 2206 |
X-iZhang/CCD
📷 CCD: Mitigating Hallucinations in Radiology MLLMs via Clinical Contrastive... |
|
Emerging |
| 2207 |
CLAIRE-Labo/quantile-reward-policy-optimization
Official codebase for "Quantile Reward Policy Optimization: Alignment with... |
|
Emerging |
| 2208 |
cifkao/context-probing
Black-box language model explanation by context length probing |
|
Emerging |
| 2209 |
nareshis21/Truelarge-RT
Android inference engine running 20B+ parameter LLMs on 4GB-8GB RAM devices.... |
|
Emerging |
| 2210 |
Hamtech-ai/Persian-Image-Captioning
A Persian Image Captioning model based on Vision Encoder Decoder Models of... |
|
Emerging |
| 2211 |
dougeeai/llama-cpp-python-wheels
Pre-built wheels for llama-cpp-python across platforms and CUDA versions |
|
Emerging |
| 2212 |
forgi86/sysid-transformers
Code to reproduce the results of the paper In-context learning for... |
|
Emerging |
| 2213 |
starmpcc/CAMEL
Clinically Adapted Model Enhanced from LLaMA |
|
Emerging |
| 2214 |
davide-coccomini/MINTIME-Multi-Identity-size-iNvariant-TIMEsformer-for-Video-Deepfake-Detection
Code for Video Deepfake Detector from "MINTIME: Multi-Identity... |
|
Emerging |
| 2215 |
suyash/mlt
Multilingual Neural Machine Translation using Transformers with Conditional... |
|
Emerging |
| 2216 |
PKU-Alignment/beavertails
BeaverTails is a collection of datasets designed to facilitate research on... |
|
Emerging |
| 2217 |
AntonioGr7/pratical-llms
A collection of hand on notebook for LLMs practitioner |
|
Emerging |
| 2218 |
CEC-Agent/CEC
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for... |
|
Emerging |
| 2219 |
fboulnois/llm-leaderboard-csv
CSVs of the Huggingface and LMArena LLM leaderboards, along with the code to... |
|
Emerging |
| 2220 |
jorgemunozl/Finetunning-Llama-Vision-11b
Inference and finnetunning of a VLM (LLama Vision 11b) using the Unsloth,... |
|
Emerging |
| 2221 |
jakobtroidl/neuron-shape-reasoning
PyTorch Implementation of Global Neuron Shape Reasoning with Point Affinity... |
|
Emerging |
| 2222 |
ASSERT-KTH/repairllama
RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program... |
|
Emerging |
| 2223 |
ModelTC/QLLM
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate... |
|
Emerging |
| 2224 |
nestordemeure/stop_word
Huggingface transformers stopping criteria that halts the generation when a... |
|
Emerging |
| 2225 |
SqueezeAILab/KVQuant
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with... |
|
Emerging |
| 2226 |
henrikalbihn/gliner-as-a-service
GLiNER model in a FastAPI microservice. |
|
Emerging |
| 2227 |
Infini-AI-Lab/Sequoia
scalable and robust tree-based speculative decoding algorithm |
|
Emerging |
| 2228 |
sdpkjc/SATQuest
🏞 A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs |
|
Emerging |
| 2229 |
wang2226/Awesome-LLM-Decoding
📜 Paper list on decoding methods for LLMs and LVLMs |
|
Emerging |
| 2230 |
itsqyh/Awesome-LMMs-Mechanistic-Interpretability
A curated collection of resources focused on the Mechanistic... |
|
Emerging |
| 2231 |
NiuTrans/LaMaTE
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine... |
|
Emerging |
| 2232 |
moritztng/fltr
Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B. |
|
Emerging |
| 2233 |
DCQN-axiomatics/DCQN-Matrix-Axiomatik-LLM-Protocol
A strict, deterministic LLM protocol for loading, reading and activating the... |
|
Emerging |
| 2234 |
PathologyFoundation/plip
Pathology Language and Image Pre-Training (PLIP) is the first vision and... |
|
Emerging |
| 2235 |
ksm26/Open-Source-Models-with-Hugging-Face
"Open Source Models with Hugging Face" course empowers you with the skills... |
|
Emerging |
| 2236 |
MNoorFawi/curlora
The code repository for the CURLoRA research paper. Stable LLM continual... |
|
Emerging |
| 2237 |
CASE-Lab-UMD/Router-Tuning-Mixture-of-Depths
The open-source Mixture of Depths code and the official implementation of... |
|
Emerging |
| 2238 |
DestroyerDarkNess/fastvlm-webgpu
Real-time video captioning powered by FastVLM |
|
Emerging |
| 2239 |
zerovl/ZeroVL
[ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources |
|
Emerging |
| 2240 |
AkiRusProd/numpy-transformer
A numpy implementation of the Transformer model in "Attention is All You Need" |
|
Emerging |
| 2241 |
WayneMao/RoboMatrix
The Official Implementation of RoboMatrix |
|
Emerging |
| 2242 |
deep-div/PlotLLM
Data Visualization with LLM automatically analyzes data and generates... |
|
Emerging |
| 2243 |
antoninodimaggio/Hugging-Captions
Generate realistic Instagram captions using transformers 🤗 |
|
Emerging |
| 2244 |
HaoAreYuDong/MachineLearningLM
Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML |
|
Emerging |
| 2245 |
google/curie
Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long... |
|
Emerging |
| 2246 |
michaelnny/QLoRA-LLM
A simple custom QLoRA implementation for fine-tuning a language model (LLM)... |
|
Emerging |
| 2247 |
Tebmer/Awesome-Knowledge-Distillation-of-LLMs
This repository collects papers for "A Survey on Knowledge Distillation of... |
|
Emerging |
| 2248 |
Nikityyy/lille
A powerful 130-million-parameter model trained from scratch as part of a... |
|
Emerging |
| 2249 |
hesamsheikh/llm-mechanics
Coding an LLM and its building blocks from scratch. |
|
Emerging |
| 2250 |
OneInterface/realtime-bakllava
llama.cpp with BakLLaVA model describes what does it see |
|
Emerging |
| 2251 |
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO. |
|
Emerging |
| 2252 |
iKernels/transformers-lightning
A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses... |
|
Emerging |
| 2253 |
holarissun/RewardModelingBeyondBradleyTerry
official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models... |
|
Emerging |
| 2254 |
hpdps-group/ElasticMM
ElasticMM: Elastic and Efficient MLLM Serving System |
|
Emerging |
| 2255 |
rezazad68/transdeeplab
TransDeepLab: Convolution-Free Transformer-based DeepLab v3+ for Medical... |
|
Emerging |
| 2256 |
RAHB-REALTORS-Association/email-autodrafts
Email Auto-ReplAI is a Python tool that uses AI to automate drafting... |
|
Emerging |
| 2257 |
Pengxin-Guo/FedSA-LoRA
Selective Aggregation for Low-Rank Adaptation in Federated Learning [ICLR 2025] |
|
Emerging |
| 2258 |
jonrbates/turing
A PyTorch library for simulating Turing machines with neural networks, based... |
|
Emerging |
| 2259 |
Uralstech/vid-orca
Deploy LLaMA-2 Chat on Google Cloud. |
|
Emerging |
| 2260 |
srsawant34/efficient_instruction_learning
Code base for the paper "Instruction Tuned Models are Quick Learners". |
|
Emerging |
| 2261 |
Riccorl/llama-trainer
Llama Trainer Utility |
|
Emerging |
| 2262 |
hollobit/GenAI_LLM_timeline
ChatGPT, GenerativeAI and LLMs Timeline |
|
Emerging |
| 2263 |
Anjum48/commonlitreadabilityprize
4th Place solution for the Kaggle CommonLit Readability Prize |
|
Emerging |
| 2264 |
declare-lab/TEAM
Our EMNLP 2022 paper on MCQA |
|
Emerging |
| 2265 |
MLD3/steerability
An open-source evaluation framework for measuring LLM steerability. |
|
Emerging |
| 2266 |
Srijan-D/LangChain-v0.2-HuggingFace-Llama3
This project integrates LangChain v0.2.6, HuggingFace Serverless Inference... |
|
Emerging |
| 2267 |
elephantmipt/compressors
A small library with distillation, quantization and pruning pipelines |
|
Emerging |
| 2268 |
graphcore-research/jax-scalify
JAX Scalify: end-to-end scaled arithmetics |
|
Emerging |
| 2269 |
chrisjob1021/transformer-examples
A collection of educational toy implementations and examples of key... |
|
Emerging |
| 2270 |
UBC-MDS/fixml
LLM Tool for effective test evaluation of ML projects with curated... |
|
Emerging |
| 2271 |
smitkiri/news-qa
Reading comprehension based question-answering model for news articles. |
|
Emerging |
| 2272 |
IIT-DM/BattleofLLMs
Benchmarks of LLMs with Conversational QA datasets. |
|
Emerging |
| 2273 |
HariomJangra/project-lumen
A 128M parameter language model built from scratch for learning how large... |
|
Emerging |
| 2274 |
loretoparisi/bert_text_classifier
Text Classification with BERT |
|
Emerging |
| 2275 |
akanyaani/miniLLAMA
A simplified LLAMA implementation for training and inference tasks. |
|
Emerging |
| 2276 |
jseeio/gpt2-tfjs
GPT2 with Tensorflow.js |
|
Emerging |
| 2277 |
YuanGongND/ltu
Code, Dataset, and Pretrained Models for Audio and Speech Large Language... |
|
Emerging |
| 2278 |
haesleinhuepf/vlm-pictionary
Play pictionary with Vision Language Models! |
|
Emerging |
| 2279 |
Esmail-ibraheem/Tinyllamas-pytorch
Tinyllamas🦙 is an Extensible advanced language model framework, inspired by... |
|
Emerging |
| 2280 |
Nondzu/LlamaTor
LlamaTor: Decentralized AI model sharing via BitTorrent for efficient,... |
|
Emerging |
| 2281 |
telekom/transformer-tools
Transformers Training Tools |
|
Emerging |
| 2282 |
Ajax0564/VyomAI
VyomAI: state-of-the-art NLP LLM Vision MultiModel transformers ... |
|
Emerging |
| 2283 |
songxiaoshuai/progco
Official Implementation of "ProgCo: Program Helps Self-Correction of Large... |
|
Emerging |
| 2284 |
DoubleVII/lithft
Pretrain, finetune any LLMs from huggingface on your own data. |
|
Emerging |
| 2285 |
wangcongcong123/transection
Transection: Transformers for English to Chinese Translation |
|
Emerging |
| 2286 |
monk1337/NanoPeft
The simplest repository & Neat implementation of different Lora methods for... |
|
Emerging |
| 2287 |
pat-jj/KG-FIT
[NeurIPS'24] Knowledge Graph Fine-Tuning using LLMs |
|
Emerging |
| 2288 |
microsoft/MMLU-CF
A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025] |
|
Emerging |
| 2289 |
jianzhnie/LLMToolkit
LLMToolkit is a toolkit for NLP(Natural Language Processing) and LLM(Large... |
|
Emerging |
| 2290 |
daskol/llama.py
Python bindings to llama.cpp |
|
Emerging |
| 2291 |
sail-sg/dice
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards |
|
Emerging |
| 2292 |
detsutut/ama-bot
A modern and lightweight NLP interface for Question-Answering systems and... |
|
Emerging |
| 2293 |
yaojin17/Unlearning_LLM
[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large... |
|
Emerging |
| 2294 |
notAI-tech/Anuvaad
State of the art open-source translation for Indic languages. |
|
Emerging |
| 2295 |
rkinas/reasoning_models_how_to
This repository serves as a collection of research notes and resources on... |
|
Emerging |
| 2296 |
krishnapriya-18/COVID-19-Tweet-Classification-using-Roberta-and-Bert-Simple-Transformers
Rank 1 / 216 |
|
Emerging |
| 2297 |
duyhominhnguyen/Exgra-Med
[NeurIPS 2025] ExGra-Med: Medical Multi-Modal LLM with Extended Context Alignment |
|
Emerging |
| 2298 |
hasanisaeed/C-Transformer
Implementation of the core Transformer architecture in pure C |
|
Emerging |
| 2299 |
SORRY-Bench/sorry-bench
Benchmark evaluation code for "SORRY-Bench: Systematically Evaluating Large... |
|
Emerging |
| 2300 |
WooooDyy/BAPO
Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for... |
|
Emerging |