All Transformer Models
6,429 models ranked by quality score · Page 17 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 1601 |
cgtuebingen/ua3dscancomp
Latent Uncertainty-Aware Multi-View SDF Scan Completion |
|
Emerging |
| 1602 |
jackaduma/ChatGLM-LoRA-RLHF-PyTorch
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer... |
|
Emerging |
| 1603 |
HenryHZY/Awesome-Multimodal-LLM
Research Trends in LLM-guided Multimodal Learning. |
|
Emerging |
| 1604 |
SeekingDream/DyCodeEval
Official repository of the ICML2025 paper “Dynamic Benchmarking of Reasoning... |
|
Emerging |
| 1605 |
chenmozhijin/BSRoformer.cpp
GGML-based C++ inference for BS Roformer/Mel-Band-Roformer vocal separation... |
|
Emerging |
| 1606 |
Relaxed-System-Lab/Flash-Sparse-Attention
🚀🚀 Efficient implementations of Native Sparse Attention |
|
Emerging |
| 1607 |
gbaptista/ollama-ai
A Ruby gem for interacting with Ollama's API that allows you to run open... |
|
Emerging |
| 1608 |
TIGER-AI-Lab/LongICLBench
Code and Data for "Long-context LLMs Struggle with Long In-context Learning"... |
|
Emerging |
| 1609 |
surrey-nlp/NLP-2026
Labs for COM3029/COMM061 at University of Surrey |
|
Emerging |
| 1610 |
guxm2021/ALT_SpeechBrain
[ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription |
|
Emerging |
| 1611 |
c0sogi/llama-api
An OpenAI-like LLaMA inference API |
|
Emerging |
| 1612 |
bahree/helloLondon
Historical Language Model for London - A specialized LLM trained on... |
|
Emerging |
| 1613 |
gusye1234/llm-as-function
Embed your LLM into a python function |
|
Emerging |
| 1614 |
rese1f/aurora
[ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a... |
|
Emerging |
| 1615 |
jonrbates/turing
A PyTorch library for simulating Turing machines with neural networks, based... |
|
Emerging |
| 1616 |
jqtangust/Robust-R1
🔥🔥🔥[AAAI 2026 Oral] Official Implementation of Robust-R1: Degradation-Aware... |
|
Emerging |
| 1617 |
avocardio/Zicklein
Finetuning instruct-LLaMA on german datasets. |
|
Emerging |
| 1618 |
gotzmann/booster
Booster - open accelerator for LLM models. Better inference and debugging... |
|
Emerging |
| 1619 |
HOLYKEYZ/model-unfetter
The production engine for directional ablation. Unalign / remove models... |
|
Emerging |
| 1620 |
PaddlePaddle/PALM
a Fast, Flexible, Extensible and Easy-to-use NLP Large-scale Pretraining and... |
|
Emerging |
| 1621 |
m0dulo/InferSpore
🌱 A fully independent Large Language Model (LLM) inference engine, built... |
|
Emerging |
| 1622 |
Ratnesh-181998/python-ai-ml-libraries
A comprehensive Python AI/ML repository covering end-to-end workflows using... |
|
Emerging |
| 1623 |
AutonomicPerfectionist/PipeInfer
PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation |
|
Emerging |
| 1624 |
TIGER-AI-Lab/MAmmoTH
Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid... |
|
Emerging |
| 1625 |
declare-lab/LLM-PuzzleTest
This repository is maintained to release dataset and models for multimodal... |
|
Emerging |
| 1626 |
haiodo/oaitt
An OpenAI compatible transcriber using transformers and whisperx. |
|
Emerging |
| 1627 |
jeffreysijuntan/lloco
The official repo for "LLoCo: Learning Long Contexts Offline" |
|
Emerging |
| 1628 |
HiThink-Research/BizFinBench
A Business-Driven Real-World Financial Benchmark for Evaluating LLMs |
|
Emerging |
| 1629 |
MME-Benchmarks/Video-MME
✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark... |
|
Emerging |
| 1630 |
robinhad/kruk
Ukrainian instruction-tuned language models and datasets |
|
Emerging |
| 1631 |
BUAADreamer/Chinese-LLaVA-Med
中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine |
|
Emerging |
| 1632 |
SingleZombie/LLSA
Official implementation of Log-linear Sparse Attention (LLSA). |
|
Emerging |
| 1633 |
JinjieNi/MegaDLMs
GPU-optimized framework for training diffusion language models at any scale.... |
|
Emerging |
| 1634 |
0x7o/text2keywords
Trained T5 and T5-large model for creating keywords from text |
|
Emerging |
| 1635 |
yihongXU/TransCenter
This is the official implementation of TransCenter (TPAMI). The code and... |
|
Emerging |
| 1636 |
SkyworkAI/MoE-plus-plus
[ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with... |
|
Emerging |
| 1637 |
UCSC-REAL/DS2
[ICLR 2025] Official implementation of paper "Improving Data Efficiency via... |
|
Emerging |
| 1638 |
cankocagil/SwinDetr
Integration of Swin Transformer to DETR for Robust Object Detection (DEMO) |
|
Emerging |
| 1639 |
yongchao98/R1-Code-Interpreter
R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and... |
|
Emerging |
| 1640 |
ray-project/ray-llm
RayLLM - LLMs on Ray (Archived). Read README for more info. |
|
Emerging |
| 1641 |
ethicalabs-ai/kurtis
Kurtis is a fine-tuning, inference and evaluation tool built for SLMs (Small... |
|
Emerging |
| 1642 |
kyegomez/PALI
Democratization of "PaLI: A Jointly-Scaled Multilingual Language-Image Model" |
|
Emerging |
| 1643 |
etaoxing/multigame-dt
Implementation of Multi-Game Decision Transformers in PyTorch |
|
Emerging |
| 1644 |
trzy/llava-cpp-server
LLaVA server (llama.cpp). |
|
Emerging |
| 1645 |
xuyang-liu16/GlobalCom2
[AAAI 2026] Global Compression Commander: Plug-and-Play Inference... |
|
Emerging |
| 1646 |
grigio/llm-eval-simple
llm-eval-simple is a simple LLM evaluation framework with intermediate... |
|
Emerging |
| 1647 |
nanxiang11/CodeLab_LLM
🌟 从LLaMA2开启大语言模型原理与实践教程 |
|
Emerging |
| 1648 |
cleopatra-itn/fair_multimodal_sentiment
Code and Splits for the paper "A Fair and Comprehensive Comparison of... |
|
Emerging |
| 1649 |
awneesht/KVShuttle
Benchmark & decision framework for KV cache transfer compression in... |
|
Emerging |
| 1650 |
retarfi/language-pretraining
Pre-training Language Models for Japanese |
|
Emerging |
| 1651 |
ChuloAI/BrainChulo
Harnessing the Memory Power of the Camelids |
|
Emerging |
| 1652 |
Ethyros-AI/ModelCypher
ModelCypher - Decipher the high dimensional geometry of LLMs. An open source... |
|
Emerging |
| 1653 |
abhilash1910/LongPegasus
LongPegasus package is used for inducing longformer self attention over base... |
|
Emerging |
| 1654 |
jagilley/fact-checker
Fact-checking LLM outputs with self-ask |
|
Emerging |
| 1655 |
abhisheknair10/llama3.cu
Lightweight Llama 3 8B Inference Engine in CUDA C |
|
Emerging |
| 1656 |
BatsResearch/trove
A Flexible Toolkit for Dense Retrieval |
|
Emerging |
| 1657 |
julienkay/com.doji.transformers
A Unity package to run pretrained transformer models with Unity Sentis |
|
Emerging |
| 1658 |
deep-symbolic-mathematics/Multimodal-Math-Pretraining
[ICLR 2024 Spotlight] This is the official code for the paper "SNIP:... |
|
Emerging |
| 1659 |
TIGER-AI-Lab/VisualWebInstruct
The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction... |
|
Emerging |
| 1660 |
HenryNdubuaku/nanodl
Build GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more in JAX. |
|
Emerging |
| 1661 |
dobriban/Principles-of-AI-LLMs
Materials for the course Principles of AI: LLMs at UPenn (Stat 9911, Spring... |
|
Emerging |
| 1662 |
purvanshjoshi/IndiVoice-DeepASR
Deep Learning framework for Indian-accented Speech-to-Text using Whisper and... |
|
Emerging |
| 1663 |
bnosac/golgotha
Contextualised Embeddings and Language Modelling using BERT and Friends using R |
|
Emerging |
| 1664 |
sh0416/llama-classification
Text classification with Foundation Language Model LLaMA |
|
Emerging |
| 1665 |
monologg/HanBert-Transformers
HanBert on 🤗 Huggingface Transformers 🤗 |
|
Emerging |
| 1666 |
shahrukhx01/siamese-nn-semantic-text-similarity
A repository containing comprehensive Neural Networks based PyTorch... |
|
Emerging |
| 1667 |
jdaln/dgx-spark-inference-stack
Serve the home! Inference stack for your Nvidia DGX Spark aka the Grace... |
|
Emerging |
| 1668 |
voidism/Lookback-Lens
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual... |
|
Emerging |
| 1669 |
mayank164/loveFreeTools
🛠️ Provide free tools like temporary emails, link shortening, and more, all... |
|
Emerging |
| 1670 |
ai-action/cypress-ai-demo
Cypress AI Demo |
|
Emerging |
| 1671 |
IrohXu/Awesome-Multimodal-LLM-Autonomous-Driving
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving |
|
Emerging |
| 1672 |
real-stanford/reflect
[CoRL 2023] REFLECT: Summarizing Robot Experiences for Failure Explanation... |
|
Emerging |
| 1673 |
HKUDS/SepLLM
[ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One... |
|
Emerging |
| 1674 |
FengheTan9/LLM4Seg
[MICCAI 2025] Official code for "Pre-Trained LLM is a Semantic-Aware and... |
|
Emerging |
| 1675 |
wehos/awesome-graph-transformer
Papers about graph transformers. |
|
Emerging |
| 1676 |
nuhmanpk/quick-llama
Run Ollama models on Google Colab |
|
Emerging |
| 1677 |
researchim-ai/models-at-home
training models at home |
|
Emerging |
| 1678 |
FareedKhan-dev/gpt4o-from-scratch
Implementation of a GPT-4o like Multimodal from Scratch using Python |
|
Emerging |
| 1679 |
arrmansa/Basic-UI-for-GPT-Neo-with-low-vram
A basic ui for running gpt neo 2.7B on low vram (3 gb Vram minimum) |
|
Emerging |
| 1680 |
TideDra/VL-RLHF
A RLHF Infrastructure for Vision-Language Models |
|
Emerging |
| 1681 |
a-tokyo/ai-zero-shot-classifier
🧠 leverage advanced AI embeddings to perform multilingual zero-shot text... |
|
Emerging |
| 1682 |
ziqipang/LM4VisualEncoding
[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are... |
|
Emerging |
| 1683 |
rust-dd/iTransformer
An iTransformer implementation in Rust |
|
Emerging |
| 1684 |
Ankur3107/nlp_notebooks
Tensorflow, Pytorch, Huggingface Transformer, Fastai, etc. tutorial Colab Notebooks. |
|
Emerging |
| 1685 |
Airmomo/transformers-docs-zh
【持续更新中】 完全中文版的 Transformers 学习笔记及演示示例,支持 Jupyter Notebook,主要内容来自 🤗 Hugging... |
|
Emerging |
| 1686 |
thushv89/packt_nlp_tensorflow_2
This will contain the code for the 2nd edition of NLP with TensorFlow (Edition 2) |
|
Emerging |
| 1687 |
EvanZhouDev/llm.pdf
Run LLMs inside a PDF file. |
|
Emerging |
| 1688 |
UCSC-REAL/TokenCleaning
[ICML 2025] Official implementation of paper "Token Cleaning: Fine-Grained... |
|
Emerging |
| 1689 |
modal-labs/stopwatch
A tool for benchmarking LLMs on Modal |
|
Emerging |
| 1690 |
Jagatmohan46/tiny-recursive-model
🚀 Implement the Tiny Recursive Model (TRM) for improved performance in... |
|
Emerging |
| 1691 |
wxjiao/ParroT
The ParroT framework to enhance and regulate the Translation Abilities... |
|
Emerging |
| 1692 |
gsarti/t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP |
|
Emerging |
| 1693 |
InternRobotics/PointLLM
[ECCV 2024 Best Paper Candidate & TPAMI 2025] PointLLM: Empowering Large... |
|
Emerging |
| 1694 |
misonsky/HiFT
memory-efficient fine-tuning; support 24G GPU memory fine-tuning 7B |
|
Emerging |
| 1695 |
nickduran/align2-linguistic-alignment
ALIGN 2.0: Modern Python package for multi-level linguistic alignment... |
|
Emerging |
| 1696 |
Tanveer81/ReVisionLLM
This is the official implementation of ReVisionLLM: Recursive... |
|
Emerging |
| 1697 |
amazon-science/recode
Releasing code for "ReCode: Robustness Evaluation of Code Generation Models" |
|
Emerging |
| 1698 |
OnlyTerp/turboquant
First open-source implementation of Google TurboQuant (ICLR 2026) --... |
|
Emerging |
| 1699 |
Gurumurthy30/Stackformer
Modular PyTorch transformer library for building, training, and... |
|
Emerging |
| 1700 |
macabdul9/AnyGen
A Unified and Minimalist Pipeline for Generating Outputs with LLMs... |
|
Emerging |