All Transformer Models

6,427 models ranked by quality score · Page 5 of 65

Showing 401–500 of 6,427
# Model Score Tier
401 microsoft/vidur

A large-scale simulation framework for LLM inference

45
Emerging
402 facebookresearch/LayerSkip

Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative...

45
Emerging
403 yuriwa/crewai-sheets-ui

Use google sheets as a gui for crewAI

45
Emerging
404 FasterDecoding/Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

45
Emerging
405 yoshoku/llama_cpp.rb

llama_cpp.rb provides Ruby bindings for llama.cpp

45
Emerging
406 riyanshibohra/TuneKit

Upload your data → Get a fine-tuned SLM. Free.

45
Emerging
407 alephpi/Texo-web

The web application for Texo, a minimalist SOTA LaTeX OCR model which...

45
Emerging
408 rxn4chemistry/rxn-onmt-models

Training of OpenNMT-based RXN models

45
Emerging
409 young-geng/scalax

A simple library for scaling up JAX programs

45
Emerging
410 imoneoi/openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

45
Emerging
411 VectorInstitute/vector-inference

Efficient LLM inference on Slurm clusters.

45
Emerging
412 IbrahimSobh/llms

Large Language Models: In this repository Language models are introduced...

45
Emerging
413 bobazooba/xllm

🦖 X—LLM: Cutting Edge & Easy LLM Finetuning

45
Emerging
414 ashishpatel26/LLM-Finetuning

LLM Finetuning with peft

45
Emerging
415 PhoebusSi/Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data),...

45
Emerging
416 Leeroo-AI/mergoo

A library for easily merging multiple LLM experts, and efficiently train the...

45
Emerging
417 fla-org/flame

🔥 A minimal training framework for scaling FLA models

45
Emerging
418 tensorops/TransformerX

Flexible Python library providing building blocks (layers) for reproducible...

44
Emerging
419 Denis2054/Transformers-for-NLP-2nd-Edition

Transformer models from BERT to GPT-4, environments from Hugging Face to...

44
Emerging
420 pytorch/torchchat

Run PyTorch LLMs locally on servers, desktop and mobile

44
Emerging
421 inclusionAI/asystem-awex

A high-performance RL training-inference weight synchronization framework,...

44
Emerging
422 kennethleungty/Llama-2-Open-Source-LLM-CPU-Inference

Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A

44
Emerging
423 kyegomez/PALI3

Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS:...

44
Emerging
424 kyegomez/GPT4o

Community Open Source Implementation of GPT4o in PyTorch

44
Emerging
425 kyegomez/LIMoE

Implementation of the "the first large-scale multimodal mixture of experts...

44
Emerging
426 ai-forever/ru-gpts

Russian GPT3 models.

44
Emerging
427 gluonfield/enchanted

Enchanted is iOS and macOS app for chatting with private self hosted...

44
Emerging
428 0hq/WebGPT

Run GPT model on the browser with WebGPU. An implementation of GPT inference...

44
Emerging
429 PKU-Alignment/safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from...

44
Emerging
430 FMInference/FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

44
Emerging
431 grammarly/gector

Official implementation of the papers "GECToR – Grammatical Error...

44
Emerging
432 NLPOptimize/flash-tokenizer

EFFICIENT AND OPTIMIZED TOKENIZER ENGINE FOR LLM INFERENCE SERVING

44
Emerging
433 cdqa-suite/cdQA

⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.

44
Emerging
434 bshao001/ChatLearner

A chatbot implemented in TensorFlow based on the seq2seq model, with certain...

44
Emerging
435 OscarKjell/text

Using Transformers from HuggingFace in R

44
Emerging
436 Nicolepcx/transformers-the-definitive-guide

This is the official repository for the book Transformers - The Definitive Guide

44
Emerging
437 pszemraj/textsum

CLI & Python API to easily summarize text-based files with transformers

44
Emerging
438 jeya-maria-jose/Medical-Transformer

Official Pytorch Code for "Medical Transformer: Gated Axial-Attention for...

44
Emerging
439 showlab/Show-o

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer...

44
Emerging
440 monologg/KoELECTRA

Pretrained ELECTRA Model for Korean

44
Emerging
441 Mann1988/awesome-claude-skills

📊 Explore high-quality Claude skills focused on business analysis and...

44
Emerging
442 cure-lab/LTSF-Linear

[AAAI-23 Oral] Official implementation of the paper "Are Transformers...

44
Emerging
443 monologg/JointBERT

Pytorch implementation of JointBERT: "BERT for Joint Intent Classification...

44
Emerging
444 vitoplantamura/OnnxStream

Lightweight inference library for ONNX files, written in C++. It can run...

44
Emerging
445 chanind/frame-semantic-transformer

Frame Semantic Parser based on T5 and FrameNet

44
Emerging
446 tanyuqian/redco

NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A...

44
Emerging
447 lonePatient/Bert-Multi-Label-Text-Classification

This repo contains a PyTorch implementation of a pretrained BERT model for...

44
Emerging
448 pengzhangzhi/Open-dLLM

Open diffusion language model for code generation — releasing pretraining,...

44
Emerging
449 shreyansh26/Annotated-ML-Papers

Annotations of the interesting ML papers I read

44
Emerging
450 ikergarcia1996/Easy-Translate

Easy-Translate is a script for translating large text files with a SINGLE...

44
Emerging
451 daviddaytw/react-native-transformers

Run local LLM from Huggingface in React-Native or Expo using onnxruntime.

44
Emerging
452 bytedance/video-SALMONN-2

video-SALMONN 2 is a powerful audio-visual large language model (LLM) that...

44
Emerging
453 Rishit-dagli/Fast-Transformer

An implementation of Additive Attention

44
Emerging
454 olivkoch/nano-trm

An implementation of Tiny Recursive Models (TRM)

44
Emerging
455 kmeng01/rome

Locating and editing factual associations in GPT (NeurIPS 2022)

44
Emerging
456 GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS

End to End Generative AI Industry Projects on LLM Models with...

44
Emerging
457 tatsu-lab/alpaca_eval

An automatic evaluator for instruction-following language models....

44
Emerging
458 abhimishra91/transformers-tutorials

Github repo with tutorials to fine tune transformers for diff NLP tasks

44
Emerging
459 tensorgi/TPA

[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6)...

44
Emerging
460 Gen-Verse/dLLM-RL

[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for...

44
Emerging
461 tylerelyt/LLM-Workshop

🌟 Learn Large Language Model development through hands-on projects and...

44
Emerging
462 kyegomez/LongNet

Implementation of plug in and play Attention from "LongNet: Scaling...

44
Emerging
463 rasbt/LLM-workshop-2024

A 4-hour coding workshop to understand how LLMs are implemented and used

44
Emerging
464 Rishit-dagli/Perceiver

Implementation of Perceiver, General Perception with Iterative Attention

43
Emerging
465 polakowo/gpt2bot

Your new Telegram buddy powered by transformers

43
Emerging
466 willyfh/graph-transformer

An unofficial implementation of Graph Transformer (Masked Label Prediction:...

43
Emerging
467 jina-ai/rungpt

An open-source cloud-native of large multi-modal models (LMMs) serving framework.

43
Emerging
468 analyticalrohit/llms-from-scratch

Build a ChatGPT like LLM from scratch in PyTorch, explained step by step.

43
Emerging
469 cruiseresearchgroup/SensorLLM

[EMNLP 2025] Official implementation of "SensorLLM: Aligning Large Language...

43
Emerging
470 camenduru/text-generation-webui-colab

A colab gradio web UI for running Large Language Models

43
Emerging
471 salesforce/TransmogrifAI

TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building...

43
Emerging
472 Tzohar/PassLLM

World's most accurate password guessing AI tool. A PyTorch implementation of...

43
Emerging
473 bbruceyuan/LLMs-Zero-to-Hero

从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!

43
Emerging
474 sinanuozdemir/oreilly-llm-rl-alignment

This training offers an intensive exploration into the frontier of...

43
Emerging
475 Tencent-Hunyuan/GradLoc

Implementation of GradLoc from the Tencent Hunyuan blog "Stabilizing RLVR...

43
Emerging
476 uber-research/PPLM

Plug and Play Language Model implementation. Allows to steer topic and...

43
Emerging
477 SamsungSAILMontreal/nino

Code for "Accelerating Training with Neuron Interaction and Nowcasting...

43
Emerging
478 ml4fp/2025-lbnl

ML4FP 2025: notebooks used for the Machine Learning for Fundamental Physics...

43
Emerging
479 MagedSaeed/generate-sequences

A python package made to generate sequences (greedy and beam-search) from...

43
Emerging
480 AliHaiderAhmad001/GPT-from-Scratch-with-Tensorflow

Implementation for "Improving Language Understanding by Generative...

43
Emerging
481 EleutherAI/knowledge-neurons

A library for finding knowledge neurons in pretrained transformer models.

43
Emerging
482 microsoft/rat-sql

A relation-aware semantic parsing model from English to SQL

43
Emerging
483 adrienpetralia/NILMFormer

[KDD 2025] NILMFormer: A Sequence-To-Sequence Non-Stationarity Aware...

43
Emerging
484 kenhktsui/anyclassifier

One Line To Build Zero-Data Classifiers in Minutes

43
Emerging
485 huggingface/transformers-bloom-inference

Fast Inference Solutions for BLOOM

43
Emerging
486 sammcj/ingest

Parse files (e.g. code repos) and websites to clipboard or a file for...

43
Emerging
487 JIA-Lab-research/MGM-Omni

MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech

43
Emerging
488 backprop-ai/backprop

Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.

43
Emerging
489 Gleghorn-Lab/Protify

Low code molecular property prediction

43
Emerging
490 alephpi/Texo

A minimalist SOTA LaTeX OCR model with only 20M parameters, running in...

43
Emerging
491 gordicaleksa/pytorch-original-transformer

My implementation of the original transformer model (Vaswani et al.). I've...

43
Emerging
492 EfficientMoE/MoE-Infinity

PyTorch library for cost-effective, fast and easy serving of MoE models.

43
Emerging
493 r2d4/rellm

Exact structure out of any language model completion.

43
Emerging
494 hoangsonww/Spot-the-Scam-AI-Job-Fraud

🎒 An AI/ML-powered, full-stack job-posting fraud copilot delivering...

43
Emerging
495 appvision-ai/fast-bert

Super easy library for BERT based NLP models

43
Emerging
496 MDGrey33/pyvisionai

The PyVisionAI Official Repo

43
Emerging
497 LM-Kit/lm-kit-net-samples

.NET samples for LM-Kit.NET

43
Emerging
498 laelhalawani/gguf_modeldb

A quick and optimized solution to manage llama based gguf quantized models,...

43
Emerging
499 KRR-Oxford/HierarchyTransformers

Language Models as Hierarchy Encoders

43
Emerging
500 gitkaz/mlx_gguf_server

This is a FastAPI based LLM server. Load multiple LLM models (MLX or...

43
Emerging
« Prev 1 2 3 4 5 6 7 63 64 65 Next »