All Transformer Models

6,429 models ranked by quality score · Page 22 of 65

Showing 2101–2200 of 6,429
# Model Score Tier
2101 AIRI-Institute/Probing_framework

Framework for probing tasks

32
Emerging
2102 RishabSA/interp-refusal-tokens

We study whether categorical refusal tokens enable controllable and...

32
Emerging
2103 dirmacs/lancor

A Rust client library for llama.cpp's OpenAI-compatible API server

32
Emerging
2104 anthonyfoust/ai-stack-homelab

Complete AI automation stack optimized for Mac Mini M4, but can work in...

32
Emerging
2105 taesiri/ArXivQA

WIP - Automated Question Answering for ArXiv Papers with Large Language...

32
Emerging
2106 nsi319/Finetune-Transformers

Abstractive text summarization by fine-tuning seq2seq models.

32
Emerging
2107 AspirinCode/AlphaPPImd

Exploring the conformational ensembles of protein-protein complexes with...

32
Emerging
2108 deep-div/Fine-Tuning-LLMs-and-VisionModels

Fine-Tuning LLMs (Gemma, LLaMA, Mistral, etc.) A practical guide to...

32
Emerging
2109 sixfingerdev/-Sixfinger-API---10-20x-Faster-AI-Chat-API

# ⚡ Sixfinger API - 10-20x Faster AI Chat API. İncludes 9 models.

32
Emerging
2110 styfeng/TinyDialogues

Code & data for the EMNLP 2024 paper: Is Child-Directed Speech Effective...

31
Emerging
2111 NTU-SQUAD/transformers-coqa

Albert for Conversational Question Answering Challenge

31
Emerging
2112 titanml/takeoff-community

TitanML Takeoff Server is an optimization, compression and deployment...

31
Emerging
2113 codefuse-ai/GALLa

[ACL 2025] Graph Aligned Large Language Models for Improved Source Code Understanding

31
Emerging
2114 pagraf/Seabed-Net

Quick start guide for Seabed-Net

31
Emerging
2115 deep-symbolic-mathematics/Multimodal-Symbolic-Regression

[ICLR 2024 Spotlight] SNIP on Symbolic Regression: Deep Symbolic Regression...

31
Emerging
2116 wassemgtk/llm.scala

Extensible implementation of a Language Model (LLM) training framework in Scala.

31
Emerging
2117 dropbox/grallama-panel

GraLLAMA panel for LLAMA data

31
Emerging
2118 FranxYao/FlanT5-CoT-Specialization

Implementation of ICML 23 Paper: Specializing Smaller Language Models...

31
Emerging
2119 IParraMartin/An-Explanation-Is-All-You-Need

The original transformer implementation from scratch. It contains...

31
Emerging
2120 xiaoachen98/Open-LLaVA-NeXT

An open-source implementation for training LLaVA-NeXT.

31
Emerging
2121 SCRN-VRC/Language-Translation-with-Fragment-Shaders

EN to JP and JP to EN with transformer models

31
Emerging
2122 Chunjiang-Intelligence/Credal-Transformer

论文「Credal Transformer: A Principled Approach for Quantifying and Mitigating...

31
Emerging
2123 RaptorMai/MLLM-CompBench

[NeurIPS'25] MLLM-CompBench evaluates the comparative reasoning of MLLMs...

31
Emerging
2124 FudanDISC/ReForm-Eval

An benchmark for evaluating the capabilities of large vision-language models (LVLMs)

31
Emerging
2125 Yifan-Song793/ETO

Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents...

31
Emerging
2126 nlp-uoregon/Okapi

Okapi: Instruction-tuned Large Language Models in Multiple Languages with...

31
Emerging
2127 ivanovitchm/PPGEEC2318

Repository for EEC2318, a graduate course on PPgEEC about Machine Learning

31
Emerging
2128 TamSiuhin/LLM-UM-Reading

A list of large language models for user modeling (LLM-UM) papers, based on...

31
Emerging
2129 tongnie/ImputeFormer

[KDD 2024] "ImputeFormer: Low Rankness-Induced Transformers for...

31
Emerging
2130 smpanaro/coreml-llm-cli

CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.

31
Emerging
2131 makllama/makllama

MaK(Mac+Kubernetes)llama - Running and orchestrating large language models...

31
Emerging
2132 Relaxed-System-Lab/HexGen

[ICML 2024] Serving LLMs on heterogeneous decentralized clusters.

31
Emerging
2133 AGI-Edgerunners/LLM-Optimizers-Papers

Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic...

31
Emerging
2134 juzhengz/LoRI

[COLM 2025] LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation

31
Emerging
2135 QwenLM/PolyMath

[NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath:...

31
Emerging
2136 Saivineeth147/llm-testlab

Comprehensive Testing Tool for Large Language Models

31
Emerging
2137 miranthajayatilake/nanoQA

Question-answering on your own data with Large Language Models (LLMs)

31
Emerging
2138 ZongXR/8th-National-AI-Training-Competition

第八届全国职工职业技能大赛人工智能训练师赛项

31
Emerging
2139 frankluise5220/ComfyUI-Lorahelper

A professional automation toolkit for ComfyUI to prepare LoRA training data...

31
Emerging
2140 DomHudson/bert-in-production

A collection of resources on using BERT (https://arxiv.org/abs/1810.04805 )...

31
Emerging
2141 danieloquelis/natural-language-git

Offline LLM-powered Git CLI tool. NLGit interprets your natural language...

31
Emerging
2142 JonSnow1807/Medical-Prescription-OCR

OCR system for handwritten medical prescriptions using Donut transformer and...

31
Emerging
2143 vbario/sleeping-llm

A language model that forms persistent memories from conversation and...

31
Emerging
2144 OpenMOSS/LongLLaDA

[AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

31
Emerging
2145 singhsidhukuldeep/Text-Summarizer

Comparing state of the art models for text summary generation

31
Emerging
2146 RahulSChand/llama2.c-for-dummies

Step by step explanation/tutorial of llama2.c

31
Emerging
2147 KishanBagaria/dAbot

🤖 CLI tool to automate stuff on DeviantArt.com

31
Emerging
2148 xmed-lab/TAM

[ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs

31
Emerging
2149 EagleW/Stage-wise-Fine-tuning

Code for Stage-wise Fine-tuning for Graph-to-Text Generation

31
Emerging
2150 jshuadvd/LongRoPE

Implementation of the LongRoPE: Extending LLM Context Window Beyond 2...

31
Emerging
2151 alan-turing-institute/prompto

An open source library for asynchronous querying of LLM endpoints

31
Emerging
2152 HLTCHKUST/VG-GPLMs

The code repository for EMNLP 2021 paper "Vision Guided Generative...

31
Emerging
2153 Orion-AI-Lab/televit

Teleconnection-driven vision transformers for improved long-term forecasting

31
Emerging
2154 ryoungj/ObsScaling

[NeurIPS'24 Spotlight] Observational Scaling Laws

31
Emerging
2155 vmarinowski/infini-attention

An unofficial pytorch implementation of 'Efficient Infinite Context...

31
Emerging
2156 ant-louis/belgpt2

🇧🇪 BelGPT-2: the 1st GPT model pretrained in French.

31
Emerging
2157 raymin0223/fast_robust_early_exit

Fast and Robust Early-Exiting Framework for Autoregressive Language Models...

31
Emerging
2158 AlexIoannides/transformers-gen-ai

Developing generative language models using transformers.

31
Emerging
2159 iVishalr/GPT

A minimal and efficient Pytorch implementation of OpenAI's GPT (Generative...

31
Emerging
2160 mts-ai/OpenAutoNLU

An open-source pipeline for training natural language understanding models

31
Emerging
2161 otvam/pyscalexfmr

Optimization and Scaling of Medium-Frequency Transformers

31
Emerging
2162 yangjianxin1/LongQLoRA

LongQLoRA: Extent Context Length of LLMs Efficiently

31
Emerging
2163 Mmorgan-ML/Phase-Slip-Sampler

Phase-Slip is a stochastic intervention architecture that operates on the...

31
Emerging
2164 UIC-Liu-Lab/ContinualLM

An Extensible Continual Learning Framework Focused on Language Models (LMs)

31
Emerging
2165 kyegomez/MambaDecoderBlock

MambaDecoderBlock is a novel decoder architecture that replaces traditional...

31
Emerging
2166 ChanMeng666/interactive-story-generator

【Join our constellation of stargazers!⭐️】An interactive AI-powered story...

31
Emerging
2167 shikiw/Modality-Integration-Rate

[ICCV 2025] The official code of the paper "Deciphering Cross-Modal...

31
Emerging
2168 curtisgray/wingman

Wingman is the fastest and easiest way to run Llama models on your PC or Mac.

31
Emerging
2169 obss/turkish-question-generation

Automated question generation and question answering from Turkish texts...

31
Emerging
2170 ntropy-network/enrichment_models

This repository benchmark Ntropy API against different Large Language Models...

31
Emerging
2171 Utshav-paudel/LLM-Zero-to-Hero

This repo contains the resources, projects and documentation of mine while...

31
Emerging
2172 dsdanielpark/hf-transllm

LLMtranslator translates and generates text in multiple languages.

31
Emerging
2173 Kagamma/llama-pas

Free Pascal bindings for llama.cpp

31
Emerging
2174 qiqiApink/MotionGPT

The official PyTorch implementation of the paper "MotionGPT: Finetuned LLMs...

31
Emerging
2175 vipulraheja/coedit

Official implementation of the paper "CoEdIT: Text Editing by Task-Specific...

31
Emerging
2176 katanaml/table-query-model

Table Query with ML

31
Emerging
2177 Riko0/messenger_logger_callback

messenger-logger-callback — Send ML training logs to Telegram. Standalone...

31
Emerging
2178 luiskugel/AI-Writing-Assistant-for-Thunderbird

A Thunderbird extension that helps improve your email writing using various...

31
Emerging
2179 Phildram1/myantfarm-ai

Multi-Agent LLM Orchestration for High-Quality Incident Response - 100%...

31
Emerging
2180 LostBeard/SpawnDev.BlazorJS.TransformersJS

Use Transformers.js from Blazor WebAssembly to run pretrained models with...

31
Emerging
2181 Kirill-Kravtsov/drophead-pytorch

An implementation of drophead regularization for pytorch transformers

31
Emerging
2182 iboing/CorDA

CorDA: Context-Oriented Decomposition Adaptation of Large Language Models...

31
Emerging
2183 rohit901/VANE-Bench

[NAACL'25] Contains code and documentation for our VANE-Bench paper.

31
Emerging
2184 baldoarbol/BodyShapeGPT

Fine-tuned LLMs generate accurate 3D human avatars from textual descriptions...

31
Emerging
2185 black-roland/homeassistant-cloud-ru-ai

Cloud.ru Foundation Models — cloud-based AI assistants for Home Assistant

31
Emerging
2186 pdaicode/awesome-LLMs-finetuning

Collection of resources for finetuning Large Language Models (LLMs).

31
Emerging
2187 naity/finetune-esm

Scalable Protein Language Model Finetuning with Distributed Learning and...

31
Emerging
2188 yinizhilian/ICLR2025-Papers-with-Code

历年ICLR论文和开源项目合集,包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.

31
Emerging
2189 hscspring/llama.np

Inference Llama/Llama2/Llama3 Modes in NumPy

31
Emerging
2190 samestrin/llm-newsletter-generator

llm-newsletter-generator transforms a valid RSS feed into a "Newsletter"...

31
Emerging
2191 Roboflow-Universe/finetune-RF-DETR

Modular CLI pipeline for fine‑tuning RF‑DETR object detection models on...

31
Emerging
2192 shinomakoi/magi_llm_gui

A Qt GUI for large language models

31
Emerging
2193 zzz47zzz/codebase-for-incremental-learning-with-llm

[ACL2024] A Codebase for Incremental Learning with Large Language Models;...

31
Emerging
2194 princeton-pli/AdaptMI

[COLM 2025] Adaptive Skill-based In-context Math Instruction for Small...

31
Emerging
2195 prajjwal1/generalize_lm_nli

Code for the paper EMNLP 2021 workshop paper "Generalization in NLI: Ways...

31
Emerging
2196 dmis-lab/Outlier-Safe-Pre-Training

[ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large...

31
Emerging
2197 botisan-ai/sentence-transformers.js

Run sentence-transformers (SBERT) compatible models in Node.js or browser.

31
Emerging
2198 hao-ai-lab/d3LLM

d3LLM: Ultra-Fast Diffusion LLM 🚀

31
Emerging
2199 amin-tehrani/ollama-colab

Serve Ollama LLMs on Google Colab (free plan) using Ngrok

31
Emerging
2200 Zalexanninev15/GetFreeChat

Automatic collection of free instances of AI text models (ChatGPT, Claude,...

31
Emerging
« Prev 1 2 3 20 21 22 23 24 63 64 65 Next »