All Transformer Models

6,427 models ranked by quality score · Page 3 of 65

Showing 201–300 of 6,427
# Model Score Tier
201 OpenVoiceOS/ovos-audio-transformer-plugin-ggwave

data over sound plugin

52
Established
202 ScrapeGraphAI/toonify

Toonify: Compact data format reducing LLM token usage by 30-60%

52
Established
203 PRIME-RL/TTRL

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

52
Established
204 nerdai/llms-from-scratch-rs

A comprehensive Rust translation of the code from Sebastian Raschka's Build...

52
Established
205 avikumart/LLM-GenAI-Transformers-Notebooks

An repository containing all the LLM notebooks with tutorial and projects

52
Established
206 mgonzs13/llama_ros

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

52
Established
207 TharinduDR/TransQuest

Transformer based translation quality estimation

51
Established
208 jadore801120/attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

51
Established
209 PacktPublishing/Mastering-NLP-from-Foundations-to-LLMs

Mastering NLP from Foundations to LLMs, Published by Packt

51
Established
210 explosion/curated-transformers

🤖 A PyTorch library of curated Transformer models and their composable components

51
Established
211 ai-decentralized/BloomBee

Decentralized LLMs fine-tuning and inference with offloading

51
Established
212 SalesforceAIResearch/uni2ts

Unified Training of Universal Time Series Forecasting Transformers

51
Established
213 ServiceNow/TACTiS

TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time...

51
Established
214 fixie-ai/ultravox

A fast multimodal LLM for real-time voice

51
Established
215 helpmefindaname/transformer-smaller-training-vocab

Temporary remove unused tokens during training to save ram and speed.

51
Established
216 google/deepconsensus

DeepConsensus uses gap-aware sequence transformers to correct errors in...

50
Established
217 stanfordnlp/axbench

Stanford NLP Python library for benchmarking the utility of LLM...

50
Established
218 UKPLab/gpl

Powerful unsupervised domain adaptation method for dense retrieval. Requires...

50
Established
219 mindspore-lab/step_into_llm

MindSpore online courses: Step into LLM

50
Established
220 alesanfra/toons

A high-performance TOON (Token Oriented Object Notation) parser and...

50
Established
221 adithya-s-k/AI-Engineering.academy

Mastering Applied AI, One Concept at a Time

50
Established
222 jsksxs360/How-to-use-Transformers

Transformers 库快速入门教程

50
Established
223 huggingface/transformers.js-examples

A collection of 🤗 Transformers.js demos and example applications

50
Established
224 dvgodoy/FineTuningLLMs

Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with...

50
Established
225 moment-timeseries-foundation-model/moment

MOMENT: A Family of Open Time-series Foundation Models, ICML'24

50
Established
226 ridgerchu/matmulfreellm

Implementation for MatMul-free LM.

50
Established
227 Omid-Nejati/MedViTV2

MedViTV2: Medical Image Classification with KAN-Integrated Transformers and...

50
Established
228 minggnim/nlp-models

A repository for training transformer based models

50
Established
229 yjg30737/pyqt-openai

VividNode: Multi-purpose Text & Image Generation Desktop Chatbot (supporting...

50
Established
230 ruanchaves/hashformers

Accurate word segmentation for hashtags and text, powered by Transformers...

50
Established
231 serge-chat/serge

A web interface for chatting with Alpaca through llama.cpp. Fully...

50
Established
232 ggml-org/llama.vscode

VS Code extension for LLM-assisted code/text completion

50
Established
233 kyegomez/MambaTransformer

Integrating Mamba/SSMs with Transformer for Enhanced Long Context and...

50
Established
234 hyunwoongko/nanoRLHF

nanoRLHF: from-scratch journey into how LLMs and RLHF really work.

50
Established
235 SafeAILab/EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and...

50
Established
236 tattn/LocalLLMClient

Swift package to run local LLMs on iOS, macOS, Linux

50
Established
237 Strvm/meta-ai-api

Llama 3 API 70B & 405B (MetaAI Reverse Engineered)

50
Established
238 higgsfield-ai/higgsfield

Fault-tolerant, highly scalable GPU orchestration, and a machine learning...

49
Emerging
239 iusztinpaul/hands-on-llms

🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training,...

49
Emerging
240 lucidrains/alphagenome

Implementation of AlphaGenome, Deepmind's updated genomic attention model

49
Emerging
241 IntelLabs/nlp-architect

A model library for exploring state-of-the-art deep learning topologies and...

49
Emerging
242 mukel/llama3.java

Practical Llama 3 inference in Java

49
Emerging
243 bodaay/HuggingFaceModelDownloader

Simple go utility to download HuggingFace Models and Datasets

49
Emerging
244 abelriboulot/onnxt5

Summarization, translation, sentiment-analysis, text-generation and more at...

49
Emerging
245 yuanzhoulvpi2017/zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

49
Emerging
246 louisfb01/start-llms

A complete guide to start and improve your LLM skills in 2026 with little...

49
Emerging
247 intel/ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM,...

49
Emerging
248 KimMeen/Time-LLM

[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting...

49
Emerging
249 sapientinc/HRM

Hierarchical Reasoning Model Official Release

49
Emerging
250 CLUEbenchmark/CLUE

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets,...

49
Emerging
251 galilai-group/stable-pretraining

Reliable, minimal and scalable library for pretraining foundation and world models

49
Emerging
252 kossisoroyce/timber

Ollama for classical ML models. AOT compiler that turns XGBoost, LightGBM,...

49
Emerging
253 kyegomez/Jamba

PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"

49
Emerging
254 kyegomez/MambaByte

Implementation of MambaByte in "MambaByte: Token-free Selective State Space...

49
Emerging
255 maziyarpanahi/openmed

open-source healthcare ai

49
Emerging
256 DashyDashOrg/pandas-llm

Pandas-LLM

49
Emerging
257 AXERA-TECH/ax-llm

Explore LLM model deployment based on AXera's AI chips

49
Emerging
258 jhkchan/translategemma-cli

Local CLI for Google's TranslateGemma translation models with multi-platform...

49
Emerging
259 TIGER-AI-Lab/MMLU-Pro

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task...

49
Emerging
260 ZHZisZZ/dllm

dLLM: Simple Diffusion Language Modeling

49
Emerging
261 multimodal-art-projection/YuE

YuE: Open Full-song Music Generation Foundation Model, something similar to...

49
Emerging
262 telekom/mltb2

Machine Learning Toolbox 2

49
Emerging
263 dbiir/UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

49
Emerging
264 kyegomez/LFM

An open source implementation of LFMs from Liquid AI: Liquid Foundation Models

49
Emerging
265 eth-sri/matharena

Evaluation of LLMs on latest math competitions

48
Emerging
266 ddh0/easy-llama

Python package wrapping llama.cpp for on-device LLM inference

48
Emerging
267 TIGER-AI-Lab/VLM2Vec

This repo contains the code for "VLM2Vec: Training Vision-Language Models...

48
Emerging
268 edwko/OuteTTS

Interface for OuteTTS models.

48
Emerging
269 DadaNanjesha/AI-Text-Humanizer-App

Transform AI-generated text into formal, human-like, and academic writing...

48
Emerging
270 UdbhavPrasad072300/Transformer-Implementations

Library - Vanilla, ViT, DeiT, BERT, GPT

48
Emerging
271 ymcui/Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

48
Emerging
272 Facico/Chinese-Vicuna

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model ——...

48
Emerging
273 ggml-org/llama.vim

Vim plugin for LLM-assisted code/text completion

48
Emerging
274 guinmoon/LLMFarm

llama and other large language models on iOS and MacOS offline using GGML library.

48
Emerging
275 lone-cloud/gerbil

A desktop app for running Large Language Models locally.

48
Emerging
276 tensorchord/modelz-llm

OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and...

48
Emerging
277 socialfoundations/folktexts

Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on...

48
Emerging
278 google-deepmind/long-form-factuality

Benchmarking long-form factuality in large language models. Original code...

48
Emerging
279 MadryLab/context-cite

Attribute (or cite) statements generated by LLMs back to in-context information.

48
Emerging
280 OFA-Sys/Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and...

48
Emerging
281 megagonlabs/ginza-transformers

Use custom tokenizers in spacy-transformers

48
Emerging
282 NVIDIA/FasterTransformer

Transformer related optimization, including BERT, GPT

48
Emerging
283 AdityaNG/kan-gpt

The PyTorch implementation of Generative Pre-trained Transformers (GPTs)...

48
Emerging
284 datawhalechina/llms-from-scratch-cn

仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理

48
Emerging
285 CASE-Lab-UMD/LLM-Drop

The official implementation of the paper "Uncovering the Redundancy in...

48
Emerging
286 autonomousvision/transfuser

[PAMI'23] TransFuser: Imitation with Transformer-Based Sensor Fusion for...

48
Emerging
287 yotambraun/APDTFlow

APDTFlow is a modern and extensible forecasting framework for time series...

48
Emerging
288 AI-Hypercomputer/JetStream

JetStream is a throughput and memory optimized engine for LLM inference on...

48
Emerging
289 kyegomez/attn_res

A clean, single-file PyTorch implementation of Attention Residuals (Kimi...

48
Emerging
290 BiomedSciAI/biomed-multi-omic

Build foundation model for RNA or DNA data

48
Emerging
291 mirpo/fastapi-gen

Build LLM-enabled FastAPI applications without build configuration.

48
Emerging
292 beehive-lab/GPULlama3.java

GPU-accelerated Llama3.java inference in pure Java using TornadoVM.

48
Emerging
293 MiniMax-AI/MiniMax-01

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model...

48
Emerging
294 belladoreai/llama3-tokenizer-js

JS tokenizer for LLaMA 3 and LLaMA 3.1

48
Emerging
295 NiuTrans/LaTeXTrans

A tool for translating the content of LaTeX documents into various other...

48
Emerging
296 LoicGrobol/zeldarose

Train transformer-based models.

48
Emerging
297 Kohulan/DECIMER-Image_Transformer

DECIMER Image Transformer is a deep-learning-based tool designed for...

48
Emerging
298 YerbaPage/LongCodeZip

LongCodeZip: Compress Long Context for Code Language Models [ASE2025]

48
Emerging
299 haizelabs/verdict

Inference-time scaling for LLMs-as-a-judge.

48
Emerging
300 zjunlp/EasyInstruct

[ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.

48
Emerging
« Prev 1 2 3 4 5 63 64 65 Next »