All Transformer Models

6,429 models ranked by quality score · Page 17 of 65

Showing 1601–1700 of 6,429
# Model Score Tier
1601 cgtuebingen/ua3dscancomp

Latent Uncertainty-Aware Multi-View SDF Scan Completion

36
Emerging
1602 jackaduma/ChatGLM-LoRA-RLHF-PyTorch

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer...

36
Emerging
1603 HenryHZY/Awesome-Multimodal-LLM

Research Trends in LLM-guided Multimodal Learning.

36
Emerging
1604 SeekingDream/DyCodeEval

Official repository of the ICML2025 paper “Dynamic Benchmarking of Reasoning...

36
Emerging
1605 chenmozhijin/BSRoformer.cpp

GGML-based C++ inference for BS Roformer/Mel-Band-Roformer vocal separation...

36
Emerging
1606 Relaxed-System-Lab/Flash-Sparse-Attention

🚀🚀 Efficient implementations of Native Sparse Attention

36
Emerging
1607 gbaptista/ollama-ai

A Ruby gem for interacting with Ollama's API that allows you to run open...

36
Emerging
1608 TIGER-AI-Lab/LongICLBench

Code and Data for "Long-context LLMs Struggle with Long In-context Learning"...

36
Emerging
1609 surrey-nlp/NLP-2026

Labs for COM3029/COMM061 at University of Surrey

36
Emerging
1610 guxm2021/ALT_SpeechBrain

[ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription

36
Emerging
1611 c0sogi/llama-api

An OpenAI-like LLaMA inference API

36
Emerging
1612 bahree/helloLondon

Historical Language Model for London - A specialized LLM trained on...

36
Emerging
1613 gusye1234/llm-as-function

Embed your LLM into a python function

36
Emerging
1614 rese1f/aurora

[ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a...

36
Emerging
1615 jonrbates/turing

A PyTorch library for simulating Turing machines with neural networks, based...

36
Emerging
1616 jqtangust/Robust-R1

🔥🔥🔥[AAAI 2026 Oral] Official Implementation of Robust-R1: Degradation-Aware...

36
Emerging
1617 avocardio/Zicklein

Finetuning instruct-LLaMA on german datasets.

36
Emerging
1618 gotzmann/booster

Booster - open accelerator for LLM models. Better inference and debugging...

36
Emerging
1619 HOLYKEYZ/model-unfetter

The production engine for directional ablation. Unalign / remove models...

36
Emerging
1620 PaddlePaddle/PALM

a Fast, Flexible, Extensible and Easy-to-use NLP Large-scale Pretraining and...

36
Emerging
1621 m0dulo/InferSpore

🌱 A fully independent Large Language Model (LLM) inference engine, built...

36
Emerging
1622 Ratnesh-181998/python-ai-ml-libraries

A comprehensive Python AI/ML repository covering end-to-end workflows using...

36
Emerging
1623 AutonomicPerfectionist/PipeInfer

PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation

36
Emerging
1624 TIGER-AI-Lab/MAmmoTH

Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid...

36
Emerging
1625 declare-lab/LLM-PuzzleTest

This repository is maintained to release dataset and models for multimodal...

36
Emerging
1626 haiodo/oaitt

An OpenAI compatible transcriber using transformers and whisperx.

36
Emerging
1627 jeffreysijuntan/lloco

The official repo for "LLoCo: Learning Long Contexts Offline"

36
Emerging
1628 HiThink-Research/BizFinBench

A Business-Driven Real-World Financial Benchmark for Evaluating LLMs

36
Emerging
1629 MME-Benchmarks/Video-MME

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark...

36
Emerging
1630 robinhad/kruk

Ukrainian instruction-tuned language models and datasets

36
Emerging
1631 BUAADreamer/Chinese-LLaVA-Med

中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine

36
Emerging
1632 SingleZombie/LLSA

Official implementation of Log-linear Sparse Attention (LLSA).

36
Emerging
1633 JinjieNi/MegaDLMs

GPU-optimized framework for training diffusion language models at any scale....

36
Emerging
1634 0x7o/text2keywords

Trained T5 and T5-large model for creating keywords from text

36
Emerging
1635 yihongXU/TransCenter

This is the official implementation of TransCenter (TPAMI). The code and...

36
Emerging
1636 SkyworkAI/MoE-plus-plus

[ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with...

36
Emerging
1637 UCSC-REAL/DS2

[ICLR 2025] Official implementation of paper "Improving Data Efficiency via...

36
Emerging
1638 cankocagil/SwinDetr

Integration of Swin Transformer to DETR for Robust Object Detection (DEMO)

36
Emerging
1639 yongchao98/R1-Code-Interpreter

R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and...

36
Emerging
1640 ray-project/ray-llm

RayLLM - LLMs on Ray (Archived). Read README for more info.

36
Emerging
1641 ethicalabs-ai/kurtis

Kurtis is a fine-tuning, inference and evaluation tool built for SLMs (Small...

36
Emerging
1642 kyegomez/PALI

Democratization of "PaLI: A Jointly-Scaled Multilingual Language-Image Model"

36
Emerging
1643 etaoxing/multigame-dt

Implementation of Multi-Game Decision Transformers in PyTorch

36
Emerging
1644 trzy/llava-cpp-server

LLaVA server (llama.cpp).

36
Emerging
1645 xuyang-liu16/GlobalCom2

[AAAI 2026] Global Compression Commander: Plug-and-Play Inference...

36
Emerging
1646 grigio/llm-eval-simple

llm-eval-simple is a simple LLM evaluation framework with intermediate...

36
Emerging
1647 nanxiang11/CodeLab_LLM

🌟 从LLaMA2开启大语言模型原理与实践教程

36
Emerging
1648 cleopatra-itn/fair_multimodal_sentiment

Code and Splits for the paper "A Fair and Comprehensive Comparison of...

36
Emerging
1649 awneesht/KVShuttle

Benchmark & decision framework for KV cache transfer compression in...

36
Emerging
1650 retarfi/language-pretraining

Pre-training Language Models for Japanese

36
Emerging
1651 ChuloAI/BrainChulo

Harnessing the Memory Power of the Camelids

36
Emerging
1652 Ethyros-AI/ModelCypher

ModelCypher - Decipher the high dimensional geometry of LLMs. An open source...

36
Emerging
1653 abhilash1910/LongPegasus

LongPegasus package is used for inducing longformer self attention over base...

36
Emerging
1654 jagilley/fact-checker

Fact-checking LLM outputs with self-ask

36
Emerging
1655 abhisheknair10/llama3.cu

Lightweight Llama 3 8B Inference Engine in CUDA C

36
Emerging
1656 BatsResearch/trove

A Flexible Toolkit for Dense Retrieval

36
Emerging
1657 julienkay/com.doji.transformers

A Unity package to run pretrained transformer models with Unity Sentis

36
Emerging
1658 deep-symbolic-mathematics/Multimodal-Math-Pretraining

[ICLR 2024 Spotlight] This is the official code for the paper "SNIP:...

36
Emerging
1659 TIGER-AI-Lab/VisualWebInstruct

The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction...

36
Emerging
1660 HenryNdubuaku/nanodl

Build GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more in JAX.

36
Emerging
1661 dobriban/Principles-of-AI-LLMs

Materials for the course Principles of AI: LLMs at UPenn (Stat 9911, Spring...

36
Emerging
1662 purvanshjoshi/IndiVoice-DeepASR

Deep Learning framework for Indian-accented Speech-to-Text using Whisper and...

36
Emerging
1663 bnosac/golgotha

Contextualised Embeddings and Language Modelling using BERT and Friends using R

36
Emerging
1664 sh0416/llama-classification

Text classification with Foundation Language Model LLaMA

36
Emerging
1665 monologg/HanBert-Transformers

HanBert on 🤗 Huggingface Transformers 🤗

36
Emerging
1666 shahrukhx01/siamese-nn-semantic-text-similarity

A repository containing comprehensive Neural Networks based PyTorch...

36
Emerging
1667 jdaln/dgx-spark-inference-stack

Serve the home! Inference stack for your Nvidia DGX Spark aka the Grace...

36
Emerging
1668 voidism/Lookback-Lens

Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual...

36
Emerging
1669 mayank164/loveFreeTools

🛠️ Provide free tools like temporary emails, link shortening, and more, all...

36
Emerging
1670 ai-action/cypress-ai-demo

Cypress AI Demo

36
Emerging
1671 IrohXu/Awesome-Multimodal-LLM-Autonomous-Driving

[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving

36
Emerging
1672 real-stanford/reflect

[CoRL 2023] REFLECT: Summarizing Robot Experiences for Failure Explanation...

36
Emerging
1673 HKUDS/SepLLM

[ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One...

36
Emerging
1674 FengheTan9/LLM4Seg

[MICCAI 2025] Official code for "Pre-Trained LLM is a Semantic-Aware and...

36
Emerging
1675 wehos/awesome-graph-transformer

Papers about graph transformers.

35
Emerging
1676 nuhmanpk/quick-llama

Run Ollama models on Google Colab

35
Emerging
1677 researchim-ai/models-at-home

training models at home

35
Emerging
1678 FareedKhan-dev/gpt4o-from-scratch

Implementation of a GPT-4o like Multimodal from Scratch using Python

35
Emerging
1679 arrmansa/Basic-UI-for-GPT-Neo-with-low-vram

A basic ui for running gpt neo 2.7B on low vram (3 gb Vram minimum)

35
Emerging
1680 TideDra/VL-RLHF

A RLHF Infrastructure for Vision-Language Models

35
Emerging
1681 a-tokyo/ai-zero-shot-classifier

🧠 leverage advanced AI embeddings to perform multilingual zero-shot text...

35
Emerging
1682 ziqipang/LM4VisualEncoding

[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are...

35
Emerging
1683 rust-dd/iTransformer

An iTransformer implementation in Rust

35
Emerging
1684 Ankur3107/nlp_notebooks

Tensorflow, Pytorch, Huggingface Transformer, Fastai, etc. tutorial Colab Notebooks.

35
Emerging
1685 Airmomo/transformers-docs-zh

【持续更新中】 完全中文版的 Transformers 学习笔记及演示示例,支持 Jupyter Notebook,主要内容来自 🤗 Hugging...

35
Emerging
1686 thushv89/packt_nlp_tensorflow_2

This will contain the code for the 2nd edition of NLP with TensorFlow (Edition 2)

35
Emerging
1687 EvanZhouDev/llm.pdf

Run LLMs inside a PDF file.

35
Emerging
1688 UCSC-REAL/TokenCleaning

[ICML 2025] Official implementation of paper "Token Cleaning: Fine-Grained...

35
Emerging
1689 modal-labs/stopwatch

A tool for benchmarking LLMs on Modal

35
Emerging
1690 Jagatmohan46/tiny-recursive-model

🚀 Implement the Tiny Recursive Model (TRM) for improved performance in...

35
Emerging
1691 wxjiao/ParroT

The ParroT framework to enhance and regulate the Translation Abilities...

35
Emerging
1692 gsarti/t5-flax-gcp

Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP

35
Emerging
1693 InternRobotics/PointLLM

[ECCV 2024 Best Paper Candidate & TPAMI 2025] PointLLM: Empowering Large...

35
Emerging
1694 misonsky/HiFT

memory-efficient fine-tuning; support 24G GPU memory fine-tuning 7B

35
Emerging
1695 nickduran/align2-linguistic-alignment

ALIGN 2.0: Modern Python package for multi-level linguistic alignment...

35
Emerging
1696 Tanveer81/ReVisionLLM

This is the official implementation of ReVisionLLM: Recursive...

35
Emerging
1697 amazon-science/recode

Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"

35
Emerging
1698 OnlyTerp/turboquant

First open-source implementation of Google TurboQuant (ICLR 2026) --...

35
Emerging
1699 Gurumurthy30/Stackformer

Modular PyTorch transformer library for building, training, and...

35
Emerging
1700 macabdul9/AnyGen

A Unified and Minimalist Pipeline for Generating Outputs with LLMs...

35
Emerging
« Prev 1 2 3 15 16 17 18 19 63 64 65 Next »