All Transformer Models
6,429 models ranked by quality score · Page 7 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 601 |
young-geng/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for... |
|
Emerging |
| 602 |
ItsPi3141/alpaca-electron
The simplest way to run Alpaca (and other LLaMA-based local LLMs) on your... |
|
Emerging |
| 603 |
MahmoudWahdan/dialog-nlu
Tensorflow and Keras implementation of the state of the art researches in... |
|
Emerging |
| 604 |
WangRongsheng/CareGPT
🌞 CareGPT... |
|
Emerging |
| 605 |
FoundationVision/Liquid
(Accepted by IJCV) Liquid: Language Models are Scalable and Unified... |
|
Emerging |
| 606 |
Chongjie-Si/Subspace-Tuning
A generalized framework for subspace tuning methods in parameter efficient... |
|
Emerging |
| 607 |
xNul/chat-llama-discord-bot
A Discord Bot for chatting with LLaMA, Vicuna, Alpaca, MPT, or any other... |
|
Emerging |
| 608 |
replit/ReplitLM
Inference code and configs for the ReplitLM model family |
|
Emerging |
| 609 |
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型) |
|
Emerging |
| 610 |
SCIR-HI/Huatuo-Llama-Med-Chinese
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large... |
|
Emerging |
| 611 |
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language... |
|
Emerging |
| 612 |
THU-SI/Spatial-MLLM
[NeurIPS 2025] Official implementation of Spatial-MLLM: Boosting MLLM... |
|
Emerging |
| 613 |
Paranioar/Awesome_Matching_Pretraining_Transfering
The Paper List of Large Multi-Modality Model (Perception, Generation,... |
|
Emerging |
| 614 |
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on... |
|
Emerging |
| 615 |
zai-org/CogView
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView:... |
|
Emerging |
| 616 |
deepreinforce-ai/CUDA-L2
CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through... |
|
Emerging |
| 617 |
bytedance/byteir
A model compilation solution for various hardware |
|
Emerging |
| 618 |
KB-AI-Research/KB-ALBERT
KB국민은행에서 제공하는 경제/금융 도메인에 특화된 한국어 ALBERT 모델 |
|
Emerging |
| 619 |
skylight-org/sparse-attention-hub
Advancing the frontier of efficient AI |
|
Emerging |
| 620 |
intel/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA... |
|
Emerging |
| 621 |
kmeng01/memit
Mass-editing thousands of facts into a transformer memory (ICLR 2023) |
|
Emerging |
| 622 |
voidful/TFkit
🤖📇 handling multiple nlp task in one pipeline |
|
Emerging |
| 623 |
dvmazur/mixtral-offloading
Run Mixtral-8x7B models in Colab or consumer desktops |
|
Emerging |
| 624 |
Cognitive-AI-Systems/MAPF-GPT-DDG
[IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding... |
|
Emerging |
| 625 |
JIA-Lab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral) |
|
Emerging |
| 626 |
j-min/VL-T5
PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021) |
|
Emerging |
| 627 |
HumanSignal/label-studio-transformers
Label data using HuggingFace's transformers and automatically get a... |
|
Emerging |
| 628 |
bradyz/cross_view_transformers
Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral) |
|
Emerging |
| 629 |
OctoberChang/X-Transformer
X-Transformer: Taming Pretrained Transformers for eXtreme Multi-label Text... |
|
Emerging |
| 630 |
synacktraa/tool-parse
Making LLM Tool-Calling Simpler. |
|
Emerging |
| 631 |
huggingface/optimum-graphcore
Blazing fast training of 🤗 Transformers on Graphcore IPUs |
|
Emerging |
| 632 |
Czi24/Awesome-MLLM-LLM-Colab
Happy experimenting with MLLM and LLM models! |
|
Emerging |
| 633 |
yuanzhoulvpi2017/quick_sentence_transformers
sentence-transformers to onnx 让sbert模型推理效率更快 |
|
Emerging |
| 634 |
naru-project/naru
Neural Relation Understanding: neural cardinality estimators for tabular data |
|
Emerging |
| 635 |
quantium-ai/research
Research experiments exploring uncommon quant techniques. |
|
Emerging |
| 636 |
patil-suraj/onnx_transformers
Accelerated NLP pipelines for fast inference on CPU. Built with Transformers... |
|
Emerging |
| 637 |
LowinLi/fastgpt
⚡ boost inference speed of GPT models in transformers by onnxruntime |
|
Emerging |
| 638 |
AviSoori1x/makeMoE
From scratch implementation of a sparse mixture of experts language model... |
|
Emerging |
| 639 |
chaitjo/learning-tsp
Code for the paper 'Learning TSP Requires Rethinking Generalization' (CP 2021) |
|
Emerging |
| 640 |
tintn/vision-transformer-from-scratch
A Simplified PyTorch Implementation of Vision Transformer (ViT) |
|
Emerging |
| 641 |
icon-lab/ResViT
Official Implementation of ResViT: Residual Vision Transformers for... |
|
Emerging |
| 642 |
qubvel/transformers-notebooks
Inference and fine-tuning examples for vision models from 🤗 Transformers |
|
Emerging |
| 643 |
davidiommi/Pytorch--3D-Medical-Images-Segmentation--SALMON
Segmentation deep learning ALgorithm based on MONai toolbox: single and... |
|
Emerging |
| 644 |
dddzg/up-detr
[TPAMI 2022 & CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object... |
|
Emerging |
| 645 |
ai4co/routefinder
[TMLR 2025 + ICML 2024 FM-Wild Oral] RouteFinder: Towards Foundation Models... |
|
Emerging |
| 646 |
jmisilo/clip-gpt-captioning
CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2. |
|
Emerging |
| 647 |
THUDM/ProteinLM
Protein Language Model |
|
Emerging |
| 648 |
USC-FORTIS/AD-LLM
[ACL Findings 2025] A benchmark for anomaly detection using large language... |
|
Emerging |
| 649 |
deveix/react-native-apple-llm
React Native Apple LLM plugin using Foundation Models |
|
Emerging |
| 650 |
Emmi-AI/noether
Deep-learning framework for Engineering AI. Built on transformer building... |
|
Emerging |
| 651 |
KristiyanVachev/Leaf-Question-Generation
Easy to use and understand multiple-choice question generation algorithm... |
|
Emerging |
| 652 |
thu-nics/MoA
[CoLM'25] The official implementation of the paper |
|
Emerging |
| 653 |
Graphlet-AI/eridu
Deep fuzzy matching people and company names for multilingual entity... |
|
Emerging |
| 654 |
cli99/llm-analysis
Latency and Memory Analysis of Transformer Models for Training and Inference |
|
Emerging |
| 655 |
mbzuai-oryx/LLMVoX
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM |
|
Emerging |
| 656 |
inboxpraveen/LLM-Minutes-of-Meeting
🎤📄 An innovative tool that transforms audio or video files into text... |
|
Emerging |
| 657 |
qingsongedu/time-series-transformers-review
A professionally curated list of awesome resources (paper, code, data, etc.)... |
|
Emerging |
| 658 |
AIoT-MLSys-Lab/SVD-LLM
[ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2 |
|
Emerging |
| 659 |
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF. |
|
Emerging |
| 660 |
sinanuozdemir/oreilly-optimizing-llms
Optimizing LLMs with Fine-Tuning and Prompt Engineering |
|
Emerging |
| 661 |
verifai/multiLLM
🚀 Invoke multiple large language models concurrently and the rank results.... |
|
Emerging |
| 662 |
FudanDISC/DISC-LawLLM
[中文法律大模型] DISC-LawLLM: an intelligent legal system powered by large language... |
|
Emerging |
| 663 |
mit-han-lab/lite-transformer
[ICLR 2020] Lite Transformer with Long-Short Range Attention |
|
Emerging |
| 664 |
TigerResearch/TigerBot
TigerBot: A multi-language multi-task LLM |
|
Emerging |
| 665 |
zhvng/open-musiclm
Implementation of MusicLM, a text to music model published by Google... |
|
Emerging |
| 666 |
FareedKhan-dev/train-llama4
Building LLaMA 4 MoE from Scratch |
|
Emerging |
| 667 |
Deep-Spark/DeepSparkInference
DeepSparkInference has selected 216 inference models of both small and large... |
|
Emerging |
| 668 |
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads |
|
Emerging |
| 669 |
kyegomez/PALM-E
Implementation of "PaLM-E: An Embodied Multimodal Language Model" |
|
Emerging |
| 670 |
hiyouga/Dual-Contrastive-Learning
Code for our paper "Dual Contrastive Learning: Text Classification via... |
|
Emerging |
| 671 |
Wang-ML-Lab/bayesian-peft
Bayesian Low-Rank Adaptation of LLMs: BLoB [NeurIPS 2024] and TFB [NeurIPS 2025] |
|
Emerging |
| 672 |
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data |
|
Emerging |
| 673 |
lyuchenyang/Macaw-LLM
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text... |
|
Emerging |
| 674 |
InternLM/SIM-CoT
[ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit... |
|
Emerging |
| 675 |
X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family |
|
Emerging |
| 676 |
FairyFali/SLMs-Survey
Survey of Small Language Models from Penn State, ... |
|
Emerging |
| 677 |
gabeur/mmt
Multi-Modal Transformer for Video Retrieval |
|
Emerging |
| 678 |
domschl/HuggingFaceGuidedTourForMac
A guided tour on how to use HuggingFace large language models on Macs with... |
|
Emerging |
| 679 |
danielzuegner/code-transformer
Implementation of the paper "Language-agnostic representation learning of... |
|
Emerging |
| 680 |
jxiw/MambaInLlama
[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and... |
|
Emerging |
| 681 |
snwfdhmp/llm
Use any LLM from the command line. |
|
Emerging |
| 682 |
IBM/regression-transformer
Regression Transformer (2023; Nature Machine Intelligence) |
|
Emerging |
| 683 |
ashishpatel26/LLM-Finetuning
LLM Finetuning with peft |
|
Emerging |
| 684 |
JIA-Lab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model" |
|
Emerging |
| 685 |
deepglint/unicom
Large-Scale Visual Representation Model |
|
Emerging |
| 686 |
QData/C-Tran
General Multi-label Image Classification with Transformers |
|
Emerging |
| 687 |
VarunGumma/IndicTransToolkit
A simple, consistent and extendable toolkit for IndicTrans2. (Pypi:... |
|
Emerging |
| 688 |
DarshanDeshpande/jax-models
Unofficial JAX implementations of deep learning research papers |
|
Emerging |
| 689 |
THUDM/LongBench
LongBench v2 and LongBench (ACL 25'&24') |
|
Emerging |
| 690 |
marella/ctransformers
Python bindings for the Transformer models implemented in C/C++ using GGML library. |
|
Emerging |
| 691 |
microsoft/LLF-Bench
A benchmark for evaluating learning agents based on just language feedback |
|
Emerging |
| 692 |
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data),... |
|
Emerging |
| 693 |
sobelio/llm-chain
`llm-chain` is a powerful rust crate for building chains in large language... |
|
Emerging |
| 694 |
open-mmlab/Multimodal-GPT
Multimodal-GPT |
|
Emerging |
| 695 |
rxn4chemistry/rxn-onmt-models
Training of OpenNMT-based RXN models |
|
Emerging |
| 696 |
Yangyi-Chen/Multimodal-AND-Large-Language-Models
Paper list about multimodal and large language models, only used to record... |
|
Emerging |
| 697 |
donderom/llm4s
Scala 3 bindings for llama.cpp 🦙 |
|
Emerging |
| 698 |
YJiangcm/FollowBench
[ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following... |
|
Emerging |
| 699 |
rishikksh20/convolution-vision-transformers
PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers |
|
Emerging |
| 700 |
RWKV/rwkv.cpp
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model |
|
Emerging |