All Transformer Models
6,427 models ranked by quality score · Page 5 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 401 |
microsoft/vidur
A large-scale simulation framework for LLM inference |
|
Emerging |
| 402 |
facebookresearch/LayerSkip
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative... |
|
Emerging |
| 403 |
yuriwa/crewai-sheets-ui
Use google sheets as a gui for crewAI |
|
Emerging |
| 404 |
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads |
|
Emerging |
| 405 |
yoshoku/llama_cpp.rb
llama_cpp.rb provides Ruby bindings for llama.cpp |
|
Emerging |
| 406 |
riyanshibohra/TuneKit
Upload your data → Get a fine-tuned SLM. Free. |
|
Emerging |
| 407 |
alephpi/Texo-web
The web application for Texo, a minimalist SOTA LaTeX OCR model which... |
|
Emerging |
| 408 |
rxn4chemistry/rxn-onmt-models
Training of OpenNMT-based RXN models |
|
Emerging |
| 409 |
young-geng/scalax
A simple library for scaling up JAX programs |
|
Emerging |
| 410 |
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data |
|
Emerging |
| 411 |
VectorInstitute/vector-inference
Efficient LLM inference on Slurm clusters. |
|
Emerging |
| 412 |
IbrahimSobh/llms
Large Language Models: In this repository Language models are introduced... |
|
Emerging |
| 413 |
bobazooba/xllm
🦖 X—LLM: Cutting Edge & Easy LLM Finetuning |
|
Emerging |
| 414 |
ashishpatel26/LLM-Finetuning
LLM Finetuning with peft |
|
Emerging |
| 415 |
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data),... |
|
Emerging |
| 416 |
Leeroo-AI/mergoo
A library for easily merging multiple LLM experts, and efficiently train the... |
|
Emerging |
| 417 |
fla-org/flame
🔥 A minimal training framework for scaling FLA models |
|
Emerging |
| 418 |
tensorops/TransformerX
Flexible Python library providing building blocks (layers) for reproducible... |
|
Emerging |
| 419 |
Denis2054/Transformers-for-NLP-2nd-Edition
Transformer models from BERT to GPT-4, environments from Hugging Face to... |
|
Emerging |
| 420 |
pytorch/torchchat
Run PyTorch LLMs locally on servers, desktop and mobile |
|
Emerging |
| 421 |
inclusionAI/asystem-awex
A high-performance RL training-inference weight synchronization framework,... |
|
Emerging |
| 422 |
kennethleungty/Llama-2-Open-Source-LLM-CPU-Inference
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A |
|
Emerging |
| 423 |
kyegomez/PALI3
Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS:... |
|
Emerging |
| 424 |
kyegomez/GPT4o
Community Open Source Implementation of GPT4o in PyTorch |
|
Emerging |
| 425 |
kyegomez/LIMoE
Implementation of the "the first large-scale multimodal mixture of experts... |
|
Emerging |
| 426 |
ai-forever/ru-gpts
Russian GPT3 models. |
|
Emerging |
| 427 |
gluonfield/enchanted
Enchanted is iOS and macOS app for chatting with private self hosted... |
|
Emerging |
| 428 |
0hq/WebGPT
Run GPT model on the browser with WebGPU. An implementation of GPT inference... |
|
Emerging |
| 429 |
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from... |
|
Emerging |
| 430 |
FMInference/FlexLLMGen
Running large language models on a single GPU for throughput-oriented scenarios. |
|
Emerging |
| 431 |
grammarly/gector
Official implementation of the papers "GECToR – Grammatical Error... |
|
Emerging |
| 432 |
NLPOptimize/flash-tokenizer
EFFICIENT AND OPTIMIZED TOKENIZER ENGINE FOR LLM INFERENCE SERVING |
|
Emerging |
| 433 |
cdqa-suite/cdQA
⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System. |
|
Emerging |
| 434 |
bshao001/ChatLearner
A chatbot implemented in TensorFlow based on the seq2seq model, with certain... |
|
Emerging |
| 435 |
OscarKjell/text
Using Transformers from HuggingFace in R |
|
Emerging |
| 436 |
Nicolepcx/transformers-the-definitive-guide
This is the official repository for the book Transformers - The Definitive Guide |
|
Emerging |
| 437 |
pszemraj/textsum
CLI & Python API to easily summarize text-based files with transformers |
|
Emerging |
| 438 |
jeya-maria-jose/Medical-Transformer
Official Pytorch Code for "Medical Transformer: Gated Axial-Attention for... |
|
Emerging |
| 439 |
showlab/Show-o
[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer... |
|
Emerging |
| 440 |
monologg/KoELECTRA
Pretrained ELECTRA Model for Korean |
|
Emerging |
| 441 |
Mann1988/awesome-claude-skills
📊 Explore high-quality Claude skills focused on business analysis and... |
|
Emerging |
| 442 |
cure-lab/LTSF-Linear
[AAAI-23 Oral] Official implementation of the paper "Are Transformers... |
|
Emerging |
| 443 |
monologg/JointBERT
Pytorch implementation of JointBERT: "BERT for Joint Intent Classification... |
|
Emerging |
| 444 |
vitoplantamura/OnnxStream
Lightweight inference library for ONNX files, written in C++. It can run... |
|
Emerging |
| 445 |
chanind/frame-semantic-transformer
Frame Semantic Parser based on T5 and FrameNet |
|
Emerging |
| 446 |
tanyuqian/redco
NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A... |
|
Emerging |
| 447 |
lonePatient/Bert-Multi-Label-Text-Classification
This repo contains a PyTorch implementation of a pretrained BERT model for... |
|
Emerging |
| 448 |
pengzhangzhi/Open-dLLM
Open diffusion language model for code generation — releasing pretraining,... |
|
Emerging |
| 449 |
shreyansh26/Annotated-ML-Papers
Annotations of the interesting ML papers I read |
|
Emerging |
| 450 |
ikergarcia1996/Easy-Translate
Easy-Translate is a script for translating large text files with a SINGLE... |
|
Emerging |
| 451 |
daviddaytw/react-native-transformers
Run local LLM from Huggingface in React-Native or Expo using onnxruntime. |
|
Emerging |
| 452 |
bytedance/video-SALMONN-2
video-SALMONN 2 is a powerful audio-visual large language model (LLM) that... |
|
Emerging |
| 453 |
Rishit-dagli/Fast-Transformer
An implementation of Additive Attention |
|
Emerging |
| 454 |
olivkoch/nano-trm
An implementation of Tiny Recursive Models (TRM) |
|
Emerging |
| 455 |
kmeng01/rome
Locating and editing factual associations in GPT (NeurIPS 2022) |
|
Emerging |
| 456 |
GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS
End to End Generative AI Industry Projects on LLM Models with... |
|
Emerging |
| 457 |
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models.... |
|
Emerging |
| 458 |
abhimishra91/transformers-tutorials
Github repo with tutorials to fine tune transformers for diff NLP tasks |
|
Emerging |
| 459 |
tensorgi/TPA
[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6)... |
|
Emerging |
| 460 |
Gen-Verse/dLLM-RL
[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for... |
|
Emerging |
| 461 |
tylerelyt/LLM-Workshop
🌟 Learn Large Language Model development through hands-on projects and... |
|
Emerging |
| 462 |
kyegomez/LongNet
Implementation of plug in and play Attention from "LongNet: Scaling... |
|
Emerging |
| 463 |
rasbt/LLM-workshop-2024
A 4-hour coding workshop to understand how LLMs are implemented and used |
|
Emerging |
| 464 |
Rishit-dagli/Perceiver
Implementation of Perceiver, General Perception with Iterative Attention |
|
Emerging |
| 465 |
polakowo/gpt2bot
Your new Telegram buddy powered by transformers |
|
Emerging |
| 466 |
willyfh/graph-transformer
An unofficial implementation of Graph Transformer (Masked Label Prediction:... |
|
Emerging |
| 467 |
jina-ai/rungpt
An open-source cloud-native of large multi-modal models (LMMs) serving framework. |
|
Emerging |
| 468 |
analyticalrohit/llms-from-scratch
Build a ChatGPT like LLM from scratch in PyTorch, explained step by step. |
|
Emerging |
| 469 |
cruiseresearchgroup/SensorLLM
[EMNLP 2025] Official implementation of "SensorLLM: Aligning Large Language... |
|
Emerging |
| 470 |
camenduru/text-generation-webui-colab
A colab gradio web UI for running Large Language Models |
|
Emerging |
| 471 |
salesforce/TransmogrifAI
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building... |
|
Emerging |
| 472 |
Tzohar/PassLLM
World's most accurate password guessing AI tool. A PyTorch implementation of... |
|
Emerging |
| 473 |
bbruceyuan/LLMs-Zero-to-Hero
从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!! |
|
Emerging |
| 474 |
sinanuozdemir/oreilly-llm-rl-alignment
This training offers an intensive exploration into the frontier of... |
|
Emerging |
| 475 |
Tencent-Hunyuan/GradLoc
Implementation of GradLoc from the Tencent Hunyuan blog "Stabilizing RLVR... |
|
Emerging |
| 476 |
uber-research/PPLM
Plug and Play Language Model implementation. Allows to steer topic and... |
|
Emerging |
| 477 |
SamsungSAILMontreal/nino
Code for "Accelerating Training with Neuron Interaction and Nowcasting... |
|
Emerging |
| 478 |
ml4fp/2025-lbnl
ML4FP 2025: notebooks used for the Machine Learning for Fundamental Physics... |
|
Emerging |
| 479 |
MagedSaeed/generate-sequences
A python package made to generate sequences (greedy and beam-search) from... |
|
Emerging |
| 480 |
AliHaiderAhmad001/GPT-from-Scratch-with-Tensorflow
Implementation for "Improving Language Understanding by Generative... |
|
Emerging |
| 481 |
EleutherAI/knowledge-neurons
A library for finding knowledge neurons in pretrained transformer models. |
|
Emerging |
| 482 |
microsoft/rat-sql
A relation-aware semantic parsing model from English to SQL |
|
Emerging |
| 483 |
adrienpetralia/NILMFormer
[KDD 2025] NILMFormer: A Sequence-To-Sequence Non-Stationarity Aware... |
|
Emerging |
| 484 |
kenhktsui/anyclassifier
One Line To Build Zero-Data Classifiers in Minutes |
|
Emerging |
| 485 |
huggingface/transformers-bloom-inference
Fast Inference Solutions for BLOOM |
|
Emerging |
| 486 |
sammcj/ingest
Parse files (e.g. code repos) and websites to clipboard or a file for... |
|
Emerging |
| 487 |
JIA-Lab-research/MGM-Omni
MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech |
|
Emerging |
| 488 |
backprop-ai/backprop
Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models. |
|
Emerging |
| 489 |
Gleghorn-Lab/Protify
Low code molecular property prediction |
|
Emerging |
| 490 |
alephpi/Texo
A minimalist SOTA LaTeX OCR model with only 20M parameters, running in... |
|
Emerging |
| 491 |
gordicaleksa/pytorch-original-transformer
My implementation of the original transformer model (Vaswani et al.). I've... |
|
Emerging |
| 492 |
EfficientMoE/MoE-Infinity
PyTorch library for cost-effective, fast and easy serving of MoE models. |
|
Emerging |
| 493 |
r2d4/rellm
Exact structure out of any language model completion. |
|
Emerging |
| 494 |
hoangsonww/Spot-the-Scam-AI-Job-Fraud
🎒 An AI/ML-powered, full-stack job-posting fraud copilot delivering... |
|
Emerging |
| 495 |
appvision-ai/fast-bert
Super easy library for BERT based NLP models |
|
Emerging |
| 496 |
MDGrey33/pyvisionai
The PyVisionAI Official Repo |
|
Emerging |
| 497 |
LM-Kit/lm-kit-net-samples
.NET samples for LM-Kit.NET |
|
Emerging |
| 498 |
laelhalawani/gguf_modeldb
A quick and optimized solution to manage llama based gguf quantized models,... |
|
Emerging |
| 499 |
KRR-Oxford/HierarchyTransformers
Language Models as Hierarchy Encoders |
|
Emerging |
| 500 |
gitkaz/mlx_gguf_server
This is a FastAPI based LLM server. Load multiple LLM models (MLX or... |
|
Emerging |