All Transformer Models

6,429 models ranked by quality score · Page 14 of 65

Showing 1301–1400 of 6,429
# Model Score Tier
1301 jackaduma/Vicuna-LoRA-RLHF-PyTorch

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer...

39
Emerging
1302 harveybc/predictor

Predictor that uses a configurable plugin-based predictive supervised...

39
Emerging
1303 janelu9/EasyLLM

Running Large Language Model easily.

39
Emerging
1304 ruimalheiro/training-custom-llama

Llama-style transformer in PyTorch with multi-node / multi-GPU training....

39
Emerging
1305 Archimedes1618/Madlab

Madlab is an advanced AI development studio designed to streamline the...

39
Emerging
1306 leaderj1001/CLIP

CLIP: Connecting Text and Image (Learning Transferable Visual Models From...

39
Emerging
1307 slSeanWU/Compose_and_Embellish

Official PyTorch implementation of ICASSP 2023 paper "Compose & Embellish:...

39
Emerging
1308 complex-reasoning/RPG

[ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)

39
Emerging
1309 padeler/PE-former

2D Human Pose estimation using transformers. Implementation in Pytorch

39
Emerging
1310 Aaronhuang-778/BiLLM

[ICML 2024] BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

39
Emerging
1311 UCSC-VLAA/m1

[ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical...

39
Emerging
1312 lvyufeng/cybertron-ai

mindspore implementation of transformers

39
Emerging
1313 WayneJin0918/SRUM

Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified...

39
Emerging
1314 praeclarum/transformers-js

Browser-compatible JS library for running language models

39
Emerging
1315 AyushExel/trolo

An SDK for Transformers + YOLO and other SSD family models

39
Emerging
1316 zinengtang/TVLT

PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)

39
Emerging
1317 Michael-A-Kuykendall/shimmytok

Pure Rust tokenizer for GGUF models - llama.cpp compatible

39
Emerging
1318 DeepChainBio/deepchain-apps

A library for deploying App on deepchain.bio

39
Emerging
1319 akx/ollama-dl

Download models from the Ollama library, without Ollama

39
Emerging
1320 YJiangcm/Lion

[EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models

39
Emerging
1321 DAMO-NLP-SG/CLEX

[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models

39
Emerging
1322 young-geng/m3ae_public

Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation

39
Emerging
1323 withcaer/curtana

Simplified zero-cost wrapper over llama.cpp powered by the lama-cpp-2 Crate.

39
Emerging
1324 ariG23498/gemma3-object-detection

Fine tune Gemma 3 on an object detection task

39
Emerging
1325 muhtalhakhan/Hacktoberfest2025

Hacktoberfest 2025 🧑🏻‍💻 OPEN FIRST Pull Request 🎉

39
Emerging
1326 anchen1011/FireAct

FireAct: Toward Language Agent Fine-tuning

39
Emerging
1327 amazon-science/crossmodal-contrastive-learning

CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video...

39
Emerging
1328 asprenger/ray_vllm_inference

A simple service that integrates vLLM with Ray Serve for fast and scalable...

39
Emerging
1329 ariannamethod/doe

DoE Janus Architecture: Democracy of Experts

39
Emerging
1330 AlekseyKorshuk/huggingartists

Lyrics generation with GPT2-based Transformer

39
Emerging
1331 ChenRocks/UNITER

Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt...

39
Emerging
1332 LLMBook-zh/LLMBook-zh.github.io

《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣

39
Emerging
1333 zjunlp/Deco

[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

39
Emerging
1334 UKPLab/5pils

Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!"...

39
Emerging
1335 hpretila/llama.net

.NET wrapper for LLaMA.cpp for LLaMA language model inference on CPU. 🦙

39
Emerging
1336 gopikrsmscs/stock-price-prediction-transformer

Tesal Stock Price Prediction Using Transformer

39
Emerging
1337 riccardomusmeci/mlx-llm

Large Language Models (LLMs) applications and tools running on Apple Silicon...

39
Emerging
1338 amoffat/HeimdaLLM

Constrain LLM output

39
Emerging
1339 THUDM/LongAlign

[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

39
Emerging
1340 golololologol/LLM-Distillery

A pipeline for LLM knowledge distillation

39
Emerging
1341 TatevKaren/BabyGPT-Build_GPT_From_Scratch

BabyGPT: Build Your Own GPT Large Language Model from Scratch Pre-Training...

39
Emerging
1342 slp-rl/slamkit

SlamKit is an open source tool kit for efficient training of SpeechLMs. It...

39
Emerging
1343 liuyukid/transformers-ner

Pytorch-Named-Entity-Recognition-with-transformers

39
Emerging
1344 xenova/sponsorblock-ml

Automatically detect in-video YouTube sponsorships, self/unpaid promotions,...

39
Emerging
1345 jerryshell/resumind

AI 智能简历分析系统,为每个职位定制专属反馈与 ATS 评分

38
Emerging
1346 AIoT-MLSys-Lab/Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

38
Emerging
1347 iflytek/VLE

VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)

38
Emerging
1348 osainz59/Ask2Transformers

A Framework for Textual Entailment based Zero Shot text classification

38
Emerging
1349 xingyizhou/GTR

Global Tracking Transformers, CVPR 2022

38
Emerging
1350 hasanirtiza/PedesFormer-Transformer-Networks-For-Pedestrian-Detection

Transformer Networks for Pedestrian Detection

38
Emerging
1351 shrut2702/upasak

UI-based Fine-Tuning for Large Language Models (LLMs)

38
Emerging
1352 BarCodeReader/SelfReformer

[TMM-2023] Official implementation of "Towards Complete and Detail-Preserved...

38
Emerging
1353 daniel-furman/sft-demos

Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and...

38
Emerging
1354 DirtyHarryLYL/Transformer-in-Vision

Recent Transformer-based CV and related works.

38
Emerging
1355 baaivision/EVE

EVE Series: Encoder-Free Vision-Language Models from BAAI

38
Emerging
1356 OFA-Sys/ExpertLLaMA

An opensource ChatBot built with ExpertPrompting which achieves 96% of...

38
Emerging
1357 OFA-Sys/OFASys

OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models

38
Emerging
1358 Kaleidophon/nlp-uncertainty-zoo

Model zoo for different kinds of uncertainty quantification methods used in...

38
Emerging
1359 ma2za/telegram-llm-bot

Telegram LLM bot backed by OpenAI, Whisper, Beam, LLaMA, Weaviate, MinIO and MongoDB

38
Emerging
1360 Nkluge-correa/Tucano

Natively pre-trained open-source Portuguese language models.

38
Emerging
1361 Yxxxb/VoCo-LLaMA

[CVPR'2025] VoCo-LLaMA: This repo is the official implementation of...

38
Emerging
1362 itsnamgyu/block-transformer

Block Transformer: Global-to-Local Language Modeling for Fast Inference...

38
Emerging
1363 icon-lab/SLATER

Official implementation of the paper: Unsupervised MRI Reconstruction via...

38
Emerging
1364 lukashermann/hulc

Hierarchical Universal Language Conditioned Policies

38
Emerging
1365 openpsi-project/ReaLHF

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

38
Emerging
1366 WisconsinAIVision/ViP-LLaVA

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary...

38
Emerging
1367 tomekkorbak/pretraining-with-human-feedback

Code accompanying the paper Pretraining Language Models with Human Preferences

38
Emerging
1368 Whiax/BERT-Transformer-Pytorch

Basic implementation of BERT and Transformer in Pytorch in one short python...

38
Emerging
1369 sshh12/multi_token

Embed arbitrary modalities (images, audio, documents, etc) into large...

38
Emerging
1370 AmpereComputingAI/llama.cpp

Ampere optimized llama.cpp

38
Emerging
1371 shufangxun/LLaVA-MoD

[ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation

38
Emerging
1372 egaoharu-kensei/flash-attention-triton

Cross-platform FlashAttention-2 Triton implementation for Turing+ GPUs with...

38
Emerging
1373 arrmansa/Basic-UI-for-GPT-J-6B-with-low-vram

A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for...

38
Emerging
1374 sergiomorapardo/AdvancedTopicsAnalytics

Material y notebooks del curso "Tópicos Avanzados en Analítica...

38
Emerging
1375 takara-ai/go-attention

A full attention mechanism and transformer in pure go.

38
Emerging
1376 HillZhang1999/ICD

Code & Data for our Paper "Alleviating Hallucinations of Large Language...

38
Emerging
1377 jakubburkiewicz/node-red-contrib-ollama

A Node-RED module that wraps the ollama.js library, offering its...

38
Emerging
1378 wuwangzhang1216/prometheus

Fully automatic censorship removal for language models. LoRA abliteration +...

38
Emerging
1379 Hugging-Face-Supporter/tftokenizers

Use Huggingface Transformer and Tokenizers as Tensorflow Reusable SavedModels

38
Emerging
1380 epfml/llm-optimizer-benchmark

Benchmarking Optimizers for LLM Pretraining

38
Emerging
1381 Longyichen/Alpaca-family-library

Summarize all open source Large Languages Models and low-cost replication...

38
Emerging
1382 BhabhaAI/dataformer

Solving data for LLMs - Create quality synthetic datasets!

38
Emerging
1383 dbmdz/berts

DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models

38
Emerging
1384 di37/finetuning-quantize-evaluate

Fine-Tune, Quantize, Evaluate: The Complete Guide — LLMs, VLMs, and Embedding Models

38
Emerging
1385 minosvasilias/godot-dodo

Finetuning large language models for GDScript generation.

38
Emerging
1386 moeru-ai/inventory

🧠🃏 Your universal model catalog, everything, everywhere, all at once.

38
Emerging
1387 intersun/LightningDOT

source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT

38
Emerging
1388 zarzouram/image_captioning_with_transformers

Pytorch implementation of image captioning using transformer-based model.

38
Emerging
1389 hao-ai-lab/Consistency_LLM

[ICML 2024] CLLMs: Consistency Large Language Models

38
Emerging
1390 NohTow/PPL-MCTS

Repository for the code of the "PPL-MCTS: Constrained Textual Generation...

38
Emerging
1391 Infini-AI-Lab/vortex_torch

Vortex: A Flexible and Efficient Sparse Attention Framework

38
Emerging
1392 ParCIS/Chimera

Chimera: bidirectional pipeline parallelism for efficiently training...

38
Emerging
1393 kyaiooiayk/Awesome-LLM-Large-Language-Models-Notes

What can I do with a LLM model?

38
Emerging
1394 Curated-Awesome-Lists/awesome-llms-fine-tuning

Explore a comprehensive collection of resources, tutorials, papers, tools,...

38
Emerging
1395 kolinko/effort

An implementation of bucketMul LLM inference

38
Emerging
1396 robert-mcdermott/LLM-Image-Classification

Image Classification Testing with LLMs

38
Emerging
1397 InhwanBae/LMTrajectory

Official Code for "Can Language Beat Numerical Regression? Language-Based...

38
Emerging
1398 matlab-deep-learning/transformer-networks-for-time-series-prediction

Deep Learning in Quantitative Finance: Transformer Networks for Time Series...

38
Emerging
1399 upb-lea/mag-net-hub

MagNet Toolkit - Certified Models of the MagNet Challenge

38
Emerging
1400 chenhan97/TimeLlama

The official repo of TimeLlama, an instruction-finetuned Llama2 series that...

38
Emerging
« Prev 1 2 3 12 13 14 15 16 63 64 65 Next »