All Transformer Models

6,429 models ranked by quality score · Page 7 of 65

Showing 601–700 of 6,429
# Model Score Tier
601 young-geng/EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for...

46
Emerging
602 ItsPi3141/alpaca-electron

The simplest way to run Alpaca (and other LLaMA-based local LLMs) on your...

46
Emerging
603 MahmoudWahdan/dialog-nlu

Tensorflow and Keras implementation of the state of the art researches in...

46
Emerging
604 WangRongsheng/CareGPT

🌞 CareGPT...

46
Emerging
605 FoundationVision/Liquid

(Accepted by IJCV) Liquid: Language Models are Scalable and Unified...

46
Emerging
606 Chongjie-Si/Subspace-Tuning

A generalized framework for subspace tuning methods in parameter efficient...

46
Emerging
607 xNul/chat-llama-discord-bot

A Discord Bot for chatting with LLaMA, Vicuna, Alpaca, MPT, or any other...

46
Emerging
608 replit/ReplitLM

Inference code and configs for the ReplitLM model family

46
Emerging
609 LianjiaTech/BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

46
Emerging
610 SCIR-HI/Huatuo-Llama-Med-Chinese

Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large...

46
Emerging
611 DAMO-NLP-SG/Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language...

46
Emerging
612 THU-SI/Spatial-MLLM

[NeurIPS 2025] Official implementation of Spatial-MLLM: Boosting MLLM...

46
Emerging
613 Paranioar/Awesome_Matching_Pretraining_Transfering

The Paper List of Large Multi-Modality Model (Perception, Generation,...

46
Emerging
614 AutoGPTQ/AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on...

46
Emerging
615 zai-org/CogView

Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView:...

46
Emerging
616 deepreinforce-ai/CUDA-L2

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through...

46
Emerging
617 bytedance/byteir

A model compilation solution for various hardware

46
Emerging
618 KB-AI-Research/KB-ALBERT

KB국민은행에서 제공하는 경제/금융 도메인에 특화된 한국어 ALBERT 모델

46
Emerging
619 skylight-org/sparse-attention-hub

Advancing the frontier of efficient AI

46
Emerging
620 intel/intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA...

46
Emerging
621 kmeng01/memit

Mass-editing thousands of facts into a transformer memory (ICLR 2023)

46
Emerging
622 voidful/TFkit

🤖📇 handling multiple nlp task in one pipeline

46
Emerging
623 dvmazur/mixtral-offloading

Run Mixtral-8x7B models in Colab or consumer desktops

46
Emerging
624 Cognitive-AI-Systems/MAPF-GPT-DDG

[IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding...

46
Emerging
625 JIA-Lab-research/LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

46
Emerging
626 j-min/VL-T5

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

46
Emerging
627 HumanSignal/label-studio-transformers

Label data using HuggingFace's transformers and automatically get a...

46
Emerging
628 bradyz/cross_view_transformers

Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral)

46
Emerging
629 OctoberChang/X-Transformer

X-Transformer: Taming Pretrained Transformers for eXtreme Multi-label Text...

46
Emerging
630 synacktraa/tool-parse

Making LLM Tool-Calling Simpler.

46
Emerging
631 huggingface/optimum-graphcore

Blazing fast training of 🤗 Transformers on Graphcore IPUs

46
Emerging
632 Czi24/Awesome-MLLM-LLM-Colab

Happy experimenting with MLLM and LLM models!

46
Emerging
633 yuanzhoulvpi2017/quick_sentence_transformers

sentence-transformers to onnx 让sbert模型推理效率更快

46
Emerging
634 naru-project/naru

Neural Relation Understanding: neural cardinality estimators for tabular data

46
Emerging
635 quantium-ai/research

Research experiments exploring uncommon quant techniques.

46
Emerging
636 patil-suraj/onnx_transformers

Accelerated NLP pipelines for fast inference on CPU. Built with Transformers...

46
Emerging
637 LowinLi/fastgpt

⚡ boost inference speed of GPT models in transformers by onnxruntime

46
Emerging
638 AviSoori1x/makeMoE

From scratch implementation of a sparse mixture of experts language model...

46
Emerging
639 chaitjo/learning-tsp

Code for the paper 'Learning TSP Requires Rethinking Generalization' (CP 2021)

46
Emerging
640 tintn/vision-transformer-from-scratch

A Simplified PyTorch Implementation of Vision Transformer (ViT)

46
Emerging
641 icon-lab/ResViT

Official Implementation of ResViT: Residual Vision Transformers for...

46
Emerging
642 qubvel/transformers-notebooks

Inference and fine-tuning examples for vision models from 🤗 Transformers

46
Emerging
643 davidiommi/Pytorch--3D-Medical-Images-Segmentation--SALMON

Segmentation deep learning ALgorithm based on MONai toolbox: single and...

46
Emerging
644 dddzg/up-detr

[TPAMI 2022 & CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object...

46
Emerging
645 ai4co/routefinder

[TMLR 2025 + ICML 2024 FM-Wild Oral] RouteFinder: Towards Foundation Models...

46
Emerging
646 jmisilo/clip-gpt-captioning

CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.

46
Emerging
647 THUDM/ProteinLM

Protein Language Model

46
Emerging
648 USC-FORTIS/AD-LLM

[ACL Findings 2025] A benchmark for anomaly detection using large language...

46
Emerging
649 deveix/react-native-apple-llm

React Native Apple LLM plugin using Foundation Models

46
Emerging
650 Emmi-AI/noether

Deep-learning framework for Engineering AI. Built on transformer building...

46
Emerging
651 KristiyanVachev/Leaf-Question-Generation

Easy to use and understand multiple-choice question generation algorithm...

46
Emerging
652 thu-nics/MoA

[CoLM'25] The official implementation of the paper

46
Emerging
653 Graphlet-AI/eridu

Deep fuzzy matching people and company names for multilingual entity...

46
Emerging
654 cli99/llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

46
Emerging
655 mbzuai-oryx/LLMVoX

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

46
Emerging
656 inboxpraveen/LLM-Minutes-of-Meeting

🎤📄 An innovative tool that transforms audio or video files into text...

46
Emerging
657 qingsongedu/time-series-transformers-review

A professionally curated list of awesome resources (paper, code, data, etc.)...

46
Emerging
658 AIoT-MLSys-Lab/SVD-LLM

[ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2

46
Emerging
659 RLHFlow/RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

46
Emerging
660 sinanuozdemir/oreilly-optimizing-llms

Optimizing LLMs with Fine-Tuning and Prompt Engineering

46
Emerging
661 verifai/multiLLM

🚀 Invoke multiple large language models concurrently and the rank results....

46
Emerging
662 FudanDISC/DISC-LawLLM

[中文法律大模型] DISC-LawLLM: an intelligent legal system powered by large language...

46
Emerging
663 mit-han-lab/lite-transformer

[ICLR 2020] Lite Transformer with Long-Short Range Attention

46
Emerging
664 TigerResearch/TigerBot

TigerBot: A multi-language multi-task LLM

45
Emerging
665 zhvng/open-musiclm

Implementation of MusicLM, a text to music model published by Google...

45
Emerging
666 FareedKhan-dev/train-llama4

Building LLaMA 4 MoE from Scratch

45
Emerging
667 Deep-Spark/DeepSparkInference

DeepSparkInference has selected 216 inference models of both small and large...

45
Emerging
668 FasterDecoding/Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

45
Emerging
669 kyegomez/PALM-E

Implementation of "PaLM-E: An Embodied Multimodal Language Model"

45
Emerging
670 hiyouga/Dual-Contrastive-Learning

Code for our paper "Dual Contrastive Learning: Text Classification via...

45
Emerging
671 Wang-ML-Lab/bayesian-peft

Bayesian Low-Rank Adaptation of LLMs: BLoB [NeurIPS 2024] and TFB [NeurIPS 2025]

45
Emerging
672 imoneoi/openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

45
Emerging
673 lyuchenyang/Macaw-LLM

Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text...

45
Emerging
674 InternLM/SIM-CoT

[ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit...

45
Emerging
675 X-PLUG/mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

45
Emerging
676 FairyFali/SLMs-Survey

Survey of Small Language Models from Penn State, ...

45
Emerging
677 gabeur/mmt

Multi-Modal Transformer for Video Retrieval

45
Emerging
678 domschl/HuggingFaceGuidedTourForMac

A guided tour on how to use HuggingFace large language models on Macs with...

45
Emerging
679 danielzuegner/code-transformer

Implementation of the paper "Language-agnostic representation learning of...

45
Emerging
680 jxiw/MambaInLlama

[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and...

45
Emerging
681 snwfdhmp/llm

Use any LLM from the command line.

45
Emerging
682 IBM/regression-transformer

Regression Transformer (2023; Nature Machine Intelligence)

45
Emerging
683 ashishpatel26/LLM-Finetuning

LLM Finetuning with peft

45
Emerging
684 JIA-Lab-research/LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

45
Emerging
685 deepglint/unicom

Large-Scale Visual Representation Model

45
Emerging
686 QData/C-Tran

General Multi-label Image Classification with Transformers

45
Emerging
687 VarunGumma/IndicTransToolkit

A simple, consistent and extendable toolkit for IndicTrans2. (Pypi:...

45
Emerging
688 DarshanDeshpande/jax-models

Unofficial JAX implementations of deep learning research papers

45
Emerging
689 THUDM/LongBench

LongBench v2 and LongBench (ACL 25'&24')

45
Emerging
690 marella/ctransformers

Python bindings for the Transformer models implemented in C/C++ using GGML library.

45
Emerging
691 microsoft/LLF-Bench

A benchmark for evaluating learning agents based on just language feedback

45
Emerging
692 PhoebusSi/Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data),...

45
Emerging
693 sobelio/llm-chain

`llm-chain` is a powerful rust crate for building chains in large language...

45
Emerging
694 open-mmlab/Multimodal-GPT

Multimodal-GPT

45
Emerging
695 rxn4chemistry/rxn-onmt-models

Training of OpenNMT-based RXN models

45
Emerging
696 Yangyi-Chen/Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record...

45
Emerging
697 donderom/llm4s

Scala 3 bindings for llama.cpp 🦙

45
Emerging
698 YJiangcm/FollowBench

[ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following...

45
Emerging
699 rishikksh20/convolution-vision-transformers

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers

45
Emerging
700 RWKV/rwkv.cpp

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

45
Emerging
« Prev 1 2 3 5 6 7 8 9 63 64 65 Next »