All Transformer Models

6,429 models ranked by quality score · Page 4 of 65

Showing 301–400 of 6,429
# Model Score Tier
301 modelscope/easydistill

a toolkit on knowledge distillation for large language models

53
Established
302 potamides/DeTikZify

Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ.

53
Established
303 sovit-123/vision_transformers

Vision Transformers for image classification, image segmentation, and object...

53
Established
304 keith2018/TinyGPT

Tiny C++ LLM inference implementation from scratch

53
Established
305 Nicolepcx/Transformers-in-Action

This is the corresponding code for the book Transformers in Action

53
Established
306 guanwei49/LogLLM

LogLLM: Log-based Anomaly Detection Using Large Language Models (system log...

53
Established
307 tjake/Jlama

Jlama is a modern LLM inference engine for Java

53
Established
308 BioinfoMachineLearning/DeepInteract

A geometric deep learning framework (Geometric Transformers) for predicting...

53
Established
309 GeeeekExplorer/nano-vllm

Nano vLLM

53
Established
310 jax-ml/jax-llm-examples

Minimal yet performant LLM examples in pure JAX

53
Established
311 hscspring/hcgf

Humanable Chat Generative-model Fine-tuning | LLM微调

53
Established
312 MattyB95/Jabberjay

🦜 Synthetic Voice Detection

53
Established
313 tue-mps/eomt

[CVPR 2025 Highlight] Official code and models for Encoder-only Mask...

53
Established
314 kyegomez/zeta

Build high-performance AI models with modular building blocks

53
Established
315 SKTBrain/KoBERT

Korean BERT pre-trained cased (KoBERT)

53
Established
316 IbrahimSobh/llms

Large Language Models: In this repository Language models are introduced...

52
Established
317 CASE-Lab-UMD/LLM-Drop

The official implementation of the paper "Uncovering the Redundancy in...

52
Established
318 microsoft/vidur

A large-scale simulation framework for LLM inference

52
Established
319 fla-org/flame

🔥 A minimal training framework for scaling FLA models

52
Established
320 sb-ai-lab/RePlay

A Comprehensive Framework for Building End-to-End Recommendation Systems...

52
Established
321 zhenye234/LLaSA_training

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

52
Established
322 xrsrke/toolformer

Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools

52
Established
323 IBM/TabFormer

Code & Data for "Tabular Transformers for Modeling Multivariate Time Series"...

52
Established
324 facebookresearch/LayerSkip

Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative...

52
Established
325 sinanuozdemir/oreilly-hands-on-gpt-llm

Mastering the Art of Scalable and Efficient AI Model Deployment

52
Established
326 bobazooba/xllm

🦖 X—LLM: Cutting Edge & Easy LLM Finetuning

52
Established
327 VectorInstitute/vector-inference

Efficient LLM inference on Slurm clusters.

52
Established
328 oripress/AlgoTune

AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and...

52
Established
329 kyegomez/SingLoRA

This repository provides a minimal, single-file implementation of SingLoRA...

52
Established
330 alibaba/InferSim

A Lightweight LLM Inference Performance Simulator

52
Established
331 FareedKhan-dev/train-llm-from-scratch

A straightforward method for training your LLM, from downloading data to...

52
Established
332 foundation-model-stack/fms-fsdp

🚀 Efficiently (pre)training foundation models with native PyTorch features,...

52
Established
333 fluxions-ai/vui

100M parameter lightweight conversational text-to-speech model with breaths,...

52
Established
334 yoshoku/llama_cpp.rb

llama_cpp.rb provides Ruby bindings for llama.cpp

52
Established
335 TsinghuaC3I/MARTI

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

52
Established
336 Leeroo-AI/mergoo

A library for easily merging multiple LLM experts, and efficiently train the...

52
Established
337 thu-nics/C2C

[ICLR'26] The official code implementation for "Cache-to-Cache: Direct...

52
Established
338 thammegowda/nllb-serve

Meta's "No Language Left Behind" models served as web app and REST API

52
Established
339 yuriwa/crewai-sheets-ui

Use google sheets as a gui for crewAI

52
Established
340 OpenVoiceOS/ovos-audio-transformer-plugin-ggwave

data over sound plugin

52
Established
341 young-geng/scalax

A simple library for scaling up JAX programs

52
Established
342 Denis2054/Transformers-for-NLP-2nd-Edition

Transformer models from BERT to GPT-4, environments from Hugging Face to...

51
Established
343 kyegomez/LongNet

Implementation of plug in and play Attention from "LongNet: Scaling...

51
Established
344 NLPOptimize/flash-tokenizer

EFFICIENT AND OPTIMIZED TOKENIZER ENGINE FOR LLM INFERENCE SERVING

51
Established
345 kossisoroyce/timber

Ollama for classical ML models. AOT compiler that turns XGBoost, LightGBM,...

51
Established
346 grammarly/gector

Official implementation of the papers "GECToR – Grammatical Error...

51
Established
347 tanyuqian/redco

NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A...

51
Established
348 Nicolepcx/transformers-the-definitive-guide

This is the official repository for the book Transformers - The Definitive Guide

51
Established
349 monologg/KoELECTRA

Pretrained ELECTRA Model for Korean

51
Established
350 ikergarcia1996/Easy-Translate

Easy-Translate is a script for translating large text files with a SINGLE...

51
Established
351 kyegomez/LIMoE

Implementation of the "the first large-scale multimodal mixture of experts...

51
Established
352 cdqa-suite/cdQA

⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.

51
Established
353 rasbt/LLM-workshop-2024

A 4-hour coding workshop to understand how LLMs are implemented and used

51
Established
354 AI-Hypercomputer/JetStream

JetStream is a throughput and memory optimized engine for LLM inference on...

51
Established
355 ai-forever/ru-gpts

Russian GPT3 models.

51
Established
356 fixie-ai/ultravox

A fast multimodal LLM for real-time voice

51
Established
357 tylerelyt/LLM-Workshop

🌟 Learn Large Language Model development through hands-on projects and...

51
Established
358 jadore801120/attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

51
Established
359 abhimishra91/transformers-tutorials

Github repo with tutorials to fine tune transformers for diff NLP tasks

51
Established
360 bshao001/ChatLearner

A chatbot implemented in TensorFlow based on the seq2seq model, with certain...

51
Established
361 alephpi/Texo-web

The web application for Texo, a minimalist SOTA LaTeX OCR model which...

51
Established
362 tensorgi/TPA

[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6)...

51
Established
363 monologg/JointBERT

Pytorch implementation of JointBERT: "BERT for Joint Intent Classification...

51
Established
364 kennethleungty/Llama-2-Open-Source-LLM-CPU-Inference

Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A

51
Established
365 cure-lab/LTSF-Linear

[AAAI-23 Oral] Official implementation of the paper "Are Transformers...

51
Established
366 vitoplantamura/OnnxStream

Lightweight inference library for ONNX files, written in C++. It can run...

51
Established
367 jeya-maria-jose/Medical-Transformer

Official Pytorch Code for "Medical Transformer: Gated Axial-Attention for...

51
Established
368 OscarKjell/text

Using Transformers from HuggingFace in R

51
Established
369 Rishit-dagli/Fast-Transformer

An implementation of Additive Attention

51
Established
370 kyegomez/PALI3

Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS:...

51
Established
371 GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS

End to End Generative AI Industry Projects on LLM Models with...

51
Established
372 daviddaytw/react-native-transformers

Run local LLM from Huggingface in React-Native or Expo using onnxruntime.

51
Established
373 symfony/ai-platform

PHP library for interacting with AI platform provider.

51
Established
374 helpmefindaname/transformer-smaller-training-vocab

Temporary remove unused tokens during training to save ram and speed.

51
Established
375 pszemraj/textsum

CLI & Python API to easily summarize text-based files with transformers

51
Established
376 shreyansh26/Annotated-ML-Papers

Annotations of the interesting ML papers I read

51
Established
377 lonePatient/Bert-Multi-Label-Text-Classification

This repo contains a PyTorch implementation of a pretrained BERT model for...

51
Established
378 NVlabs/OmniVinci

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and...

51
Established
379 kmeng01/rome

Locating and editing factual associations in GPT (NeurIPS 2022)

51
Established
380 showlab/Show-o

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer...

51
Established
381 CVHub520/X-AnyLabeling-Server

A Simple, Lightweight, and Extensible Serving Framework for X-AnyLabeling

51
Established
382 kyegomez/RT-X

Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open...

51
Established
383 tensorops/TransformerX

Flexible Python library providing building blocks (layers) for reproducible...

51
Established
384 opendilab/LightRFT

LightRFT: Light, Efficient, Omni-modal & Reward-model Driven Reinforcement...

51
Established
385 PKU-Alignment/safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from...

51
Established
386 tatsu-lab/alpaca_eval

An automatic evaluator for instruction-following language models....

51
Established
387 chanind/frame-semantic-transformer

Frame Semantic Parser based on T5 and FrameNet

51
Established
388 EleutherAI/knowledge-neurons

A library for finding knowledge neurons in pretrained transformer models.

50
Established
389 pengzhangzhi/Open-dLLM

Open diffusion language model for code generation — releasing pretraining,...

50
Established
390 cruiseresearchgroup/SensorLLM

[EMNLP 2025] Official implementation of "SensorLLM: Aligning Large Language...

50
Established
391 alesanfra/toons

A high-performance TOON (Token Oriented Object Notation) parser and...

50
Established
392 kyegomez/SwitchTransformers

Implementation of Switch Transformers from the paper: "Switch Transformers:...

50
Established
393 MDGrey33/pyvisionai

The PyVisionAI Official Repo

50
Established
394 qcri/LLMeBench

Benchmarking Large Language Models

50
Established
395 BeRo1985/pasllm

PasLLM - LLM inference engine in Object Pascal (synced from my private work...

50
Established
396 ridgerchu/matmulfreellm

Implementation for MatMul-free LM.

50
Established
397 EfficientMoE/MoE-Infinity

PyTorch library for cost-effective, fast and easy serving of MoE models.

50
Established
398 HPAI-BSC/TuRTLe

TuRTLe: A Unified Evaluation of LLMs for RTL Generation 🐢 (MLCAD 2025)

50
Established
399 jina-ai/rungpt

An open-source cloud-native of large multi-modal models (LMMs) serving framework.

50
Established
400 Rishit-dagli/Perceiver

Implementation of Perceiver, General Perception with Iterative Attention

50
Established
« Prev 1 2 3 4 5 6 63 64 65 Next »