All Transformer Models
6,429 models ranked by quality score · Page 14 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 1301 |
jackaduma/Vicuna-LoRA-RLHF-PyTorch
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer... |
|
Emerging |
| 1302 |
harveybc/predictor
Predictor that uses a configurable plugin-based predictive supervised... |
|
Emerging |
| 1303 |
janelu9/EasyLLM
Running Large Language Model easily. |
|
Emerging |
| 1304 |
ruimalheiro/training-custom-llama
Llama-style transformer in PyTorch with multi-node / multi-GPU training.... |
|
Emerging |
| 1305 |
Archimedes1618/Madlab
Madlab is an advanced AI development studio designed to streamline the... |
|
Emerging |
| 1306 |
leaderj1001/CLIP
CLIP: Connecting Text and Image (Learning Transferable Visual Models From... |
|
Emerging |
| 1307 |
slSeanWU/Compose_and_Embellish
Official PyTorch implementation of ICASSP 2023 paper "Compose & Embellish:... |
|
Emerging |
| 1308 |
complex-reasoning/RPG
[ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508) |
|
Emerging |
| 1309 |
padeler/PE-former
2D Human Pose estimation using transformers. Implementation in Pytorch |
|
Emerging |
| 1310 |
Aaronhuang-778/BiLLM
[ICML 2024] BiLLM: Pushing the Limit of Post-Training Quantization for LLMs |
|
Emerging |
| 1311 |
UCSC-VLAA/m1
[ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical... |
|
Emerging |
| 1312 |
lvyufeng/cybertron-ai
mindspore implementation of transformers |
|
Emerging |
| 1313 |
WayneJin0918/SRUM
Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified... |
|
Emerging |
| 1314 |
praeclarum/transformers-js
Browser-compatible JS library for running language models |
|
Emerging |
| 1315 |
AyushExel/trolo
An SDK for Transformers + YOLO and other SSD family models |
|
Emerging |
| 1316 |
zinengtang/TVLT
PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral) |
|
Emerging |
| 1317 |
Michael-A-Kuykendall/shimmytok
Pure Rust tokenizer for GGUF models - llama.cpp compatible |
|
Emerging |
| 1318 |
DeepChainBio/deepchain-apps
A library for deploying App on deepchain.bio |
|
Emerging |
| 1319 |
akx/ollama-dl
Download models from the Ollama library, without Ollama |
|
Emerging |
| 1320 |
YJiangcm/Lion
[EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models |
|
Emerging |
| 1321 |
DAMO-NLP-SG/CLEX
[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models |
|
Emerging |
| 1322 |
young-geng/m3ae_public
Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation |
|
Emerging |
| 1323 |
withcaer/curtana
Simplified zero-cost wrapper over llama.cpp powered by the lama-cpp-2 Crate. |
|
Emerging |
| 1324 |
ariG23498/gemma3-object-detection
Fine tune Gemma 3 on an object detection task |
|
Emerging |
| 1325 |
muhtalhakhan/Hacktoberfest2025
Hacktoberfest 2025 🧑🏻💻 OPEN FIRST Pull Request 🎉 |
|
Emerging |
| 1326 |
anchen1011/FireAct
FireAct: Toward Language Agent Fine-tuning |
|
Emerging |
| 1327 |
amazon-science/crossmodal-contrastive-learning
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video... |
|
Emerging |
| 1328 |
asprenger/ray_vllm_inference
A simple service that integrates vLLM with Ray Serve for fast and scalable... |
|
Emerging |
| 1329 |
ariannamethod/doe
DoE Janus Architecture: Democracy of Experts |
|
Emerging |
| 1330 |
AlekseyKorshuk/huggingartists
Lyrics generation with GPT2-based Transformer |
|
Emerging |
| 1331 |
ChenRocks/UNITER
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt... |
|
Emerging |
| 1332 |
LLMBook-zh/LLMBook-zh.github.io
《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣 |
|
Emerging |
| 1333 |
zjunlp/Deco
[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation |
|
Emerging |
| 1334 |
UKPLab/5pils
Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!"... |
|
Emerging |
| 1335 |
hpretila/llama.net
.NET wrapper for LLaMA.cpp for LLaMA language model inference on CPU. 🦙 |
|
Emerging |
| 1336 |
gopikrsmscs/stock-price-prediction-transformer
Tesal Stock Price Prediction Using Transformer |
|
Emerging |
| 1337 |
riccardomusmeci/mlx-llm
Large Language Models (LLMs) applications and tools running on Apple Silicon... |
|
Emerging |
| 1338 |
amoffat/HeimdaLLM
Constrain LLM output |
|
Emerging |
| 1339 |
THUDM/LongAlign
[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs |
|
Emerging |
| 1340 |
golololologol/LLM-Distillery
A pipeline for LLM knowledge distillation |
|
Emerging |
| 1341 |
TatevKaren/BabyGPT-Build_GPT_From_Scratch
BabyGPT: Build Your Own GPT Large Language Model from Scratch Pre-Training... |
|
Emerging |
| 1342 |
slp-rl/slamkit
SlamKit is an open source tool kit for efficient training of SpeechLMs. It... |
|
Emerging |
| 1343 |
liuyukid/transformers-ner
Pytorch-Named-Entity-Recognition-with-transformers |
|
Emerging |
| 1344 |
xenova/sponsorblock-ml
Automatically detect in-video YouTube sponsorships, self/unpaid promotions,... |
|
Emerging |
| 1345 |
jerryshell/resumind
AI 智能简历分析系统,为每个职位定制专属反馈与 ATS 评分 |
|
Emerging |
| 1346 |
AIoT-MLSys-Lab/Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey |
|
Emerging |
| 1347 |
iflytek/VLE
VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型) |
|
Emerging |
| 1348 |
osainz59/Ask2Transformers
A Framework for Textual Entailment based Zero Shot text classification |
|
Emerging |
| 1349 |
xingyizhou/GTR
Global Tracking Transformers, CVPR 2022 |
|
Emerging |
| 1350 |
hasanirtiza/PedesFormer-Transformer-Networks-For-Pedestrian-Detection
Transformer Networks for Pedestrian Detection |
|
Emerging |
| 1351 |
shrut2702/upasak
UI-based Fine-Tuning for Large Language Models (LLMs) |
|
Emerging |
| 1352 |
BarCodeReader/SelfReformer
[TMM-2023] Official implementation of "Towards Complete and Detail-Preserved... |
|
Emerging |
| 1353 |
daniel-furman/sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and... |
|
Emerging |
| 1354 |
DirtyHarryLYL/Transformer-in-Vision
Recent Transformer-based CV and related works. |
|
Emerging |
| 1355 |
baaivision/EVE
EVE Series: Encoder-Free Vision-Language Models from BAAI |
|
Emerging |
| 1356 |
OFA-Sys/ExpertLLaMA
An opensource ChatBot built with ExpertPrompting which achieves 96% of... |
|
Emerging |
| 1357 |
OFA-Sys/OFASys
OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models |
|
Emerging |
| 1358 |
Kaleidophon/nlp-uncertainty-zoo
Model zoo for different kinds of uncertainty quantification methods used in... |
|
Emerging |
| 1359 |
ma2za/telegram-llm-bot
Telegram LLM bot backed by OpenAI, Whisper, Beam, LLaMA, Weaviate, MinIO and MongoDB |
|
Emerging |
| 1360 |
Nkluge-correa/Tucano
Natively pre-trained open-source Portuguese language models. |
|
Emerging |
| 1361 |
Yxxxb/VoCo-LLaMA
[CVPR'2025] VoCo-LLaMA: This repo is the official implementation of... |
|
Emerging |
| 1362 |
itsnamgyu/block-transformer
Block Transformer: Global-to-Local Language Modeling for Fast Inference... |
|
Emerging |
| 1363 |
icon-lab/SLATER
Official implementation of the paper: Unsupervised MRI Reconstruction via... |
|
Emerging |
| 1364 |
lukashermann/hulc
Hierarchical Universal Language Conditioned Policies |
|
Emerging |
| 1365 |
openpsi-project/ReaLHF
Super-Efficient RLHF Training of LLMs with Parameter Reallocation |
|
Emerging |
| 1366 |
WisconsinAIVision/ViP-LLaVA
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary... |
|
Emerging |
| 1367 |
tomekkorbak/pretraining-with-human-feedback
Code accompanying the paper Pretraining Language Models with Human Preferences |
|
Emerging |
| 1368 |
Whiax/BERT-Transformer-Pytorch
Basic implementation of BERT and Transformer in Pytorch in one short python... |
|
Emerging |
| 1369 |
sshh12/multi_token
Embed arbitrary modalities (images, audio, documents, etc) into large... |
|
Emerging |
| 1370 |
AmpereComputingAI/llama.cpp
Ampere optimized llama.cpp |
|
Emerging |
| 1371 |
shufangxun/LLaVA-MoD
[ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation |
|
Emerging |
| 1372 |
egaoharu-kensei/flash-attention-triton
Cross-platform FlashAttention-2 Triton implementation for Turing+ GPUs with... |
|
Emerging |
| 1373 |
arrmansa/Basic-UI-for-GPT-J-6B-with-low-vram
A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for... |
|
Emerging |
| 1374 |
sergiomorapardo/AdvancedTopicsAnalytics
Material y notebooks del curso "Tópicos Avanzados en Analítica... |
|
Emerging |
| 1375 |
takara-ai/go-attention
A full attention mechanism and transformer in pure go. |
|
Emerging |
| 1376 |
HillZhang1999/ICD
Code & Data for our Paper "Alleviating Hallucinations of Large Language... |
|
Emerging |
| 1377 |
jakubburkiewicz/node-red-contrib-ollama
A Node-RED module that wraps the ollama.js library, offering its... |
|
Emerging |
| 1378 |
wuwangzhang1216/prometheus
Fully automatic censorship removal for language models. LoRA abliteration +... |
|
Emerging |
| 1379 |
Hugging-Face-Supporter/tftokenizers
Use Huggingface Transformer and Tokenizers as Tensorflow Reusable SavedModels |
|
Emerging |
| 1380 |
epfml/llm-optimizer-benchmark
Benchmarking Optimizers for LLM Pretraining |
|
Emerging |
| 1381 |
Longyichen/Alpaca-family-library
Summarize all open source Large Languages Models and low-cost replication... |
|
Emerging |
| 1382 |
BhabhaAI/dataformer
Solving data for LLMs - Create quality synthetic datasets! |
|
Emerging |
| 1383 |
dbmdz/berts
DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models |
|
Emerging |
| 1384 |
di37/finetuning-quantize-evaluate
Fine-Tune, Quantize, Evaluate: The Complete Guide — LLMs, VLMs, and Embedding Models |
|
Emerging |
| 1385 |
minosvasilias/godot-dodo
Finetuning large language models for GDScript generation. |
|
Emerging |
| 1386 |
moeru-ai/inventory
🧠🃏 Your universal model catalog, everything, everywhere, all at once. |
|
Emerging |
| 1387 |
intersun/LightningDOT
source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT |
|
Emerging |
| 1388 |
zarzouram/image_captioning_with_transformers
Pytorch implementation of image captioning using transformer-based model. |
|
Emerging |
| 1389 |
hao-ai-lab/Consistency_LLM
[ICML 2024] CLLMs: Consistency Large Language Models |
|
Emerging |
| 1390 |
NohTow/PPL-MCTS
Repository for the code of the "PPL-MCTS: Constrained Textual Generation... |
|
Emerging |
| 1391 |
Infini-AI-Lab/vortex_torch
Vortex: A Flexible and Efficient Sparse Attention Framework |
|
Emerging |
| 1392 |
ParCIS/Chimera
Chimera: bidirectional pipeline parallelism for efficiently training... |
|
Emerging |
| 1393 |
kyaiooiayk/Awesome-LLM-Large-Language-Models-Notes
What can I do with a LLM model? |
|
Emerging |
| 1394 |
Curated-Awesome-Lists/awesome-llms-fine-tuning
Explore a comprehensive collection of resources, tutorials, papers, tools,... |
|
Emerging |
| 1395 |
kolinko/effort
An implementation of bucketMul LLM inference |
|
Emerging |
| 1396 |
robert-mcdermott/LLM-Image-Classification
Image Classification Testing with LLMs |
|
Emerging |
| 1397 |
InhwanBae/LMTrajectory
Official Code for "Can Language Beat Numerical Regression? Language-Based... |
|
Emerging |
| 1398 |
matlab-deep-learning/transformer-networks-for-time-series-prediction
Deep Learning in Quantitative Finance: Transformer Networks for Time Series... |
|
Emerging |
| 1399 |
upb-lea/mag-net-hub
MagNet Toolkit - Certified Models of the MagNet Challenge |
|
Emerging |
| 1400 |
chenhan97/TimeLlama
The official repo of TimeLlama, an instruction-finetuned Llama2 series that... |
|
Emerging |