All Transformer Models

6,968 models ranked by quality score · Page 23 of 70

Showing 2201–2300 of 6,968
# Model Score Tier
2201 ManashJKonwar/NLP-Transformers

Transformer (BERT, GPT2, etc.) based Training Module for popular NLP tasks

34
Emerging
2202 Gen-Verse/ReasonFlux

[NeurIPS 2025 Spotlight] LLM post-training suite — featuring ReasonFlux,...

34
Emerging
2203 leliuga/cohere-configurations

Co:Here Inference configurations

34
Emerging
2204 Hon-Wong/VoRA

[Fully open] [Encoder-free MLLM] Vision as LoRA

34
Emerging
2205 nanowell/Differential-Transformer-PyTorch

PyTorch implementation of the Differential-Transformer architecture for...

34
Emerging
2206 X-iZhang/CCD

📷 CCD: Mitigating Hallucinations in Radiology MLLMs via Clinical Contrastive...

34
Emerging
2207 CLAIRE-Labo/quantile-reward-policy-optimization

Official codebase for "Quantile Reward Policy Optimization: Alignment with...

34
Emerging
2208 cifkao/context-probing

Black-box language model explanation by context length probing

34
Emerging
2209 nareshis21/Truelarge-RT

Android inference engine running 20B+ parameter LLMs on 4GB-8GB RAM devices....

34
Emerging
2210 Hamtech-ai/Persian-Image-Captioning

A Persian Image Captioning model based on Vision Encoder Decoder Models of...

34
Emerging
2211 dougeeai/llama-cpp-python-wheels

Pre-built wheels for llama-cpp-python across platforms and CUDA versions

34
Emerging
2212 forgi86/sysid-transformers

Code to reproduce the results of the paper In-context learning for...

34
Emerging
2213 starmpcc/CAMEL

Clinically Adapted Model Enhanced from LLaMA

34
Emerging
2214 davide-coccomini/MINTIME-Multi-Identity-size-iNvariant-TIMEsformer-for-Video-Deepfake-Detection

Code for Video Deepfake Detector from "MINTIME: Multi-Identity...

34
Emerging
2215 suyash/mlt

Multilingual Neural Machine Translation using Transformers with Conditional...

34
Emerging
2216 PKU-Alignment/beavertails

BeaverTails is a collection of datasets designed to facilitate research on...

34
Emerging
2217 AntonioGr7/pratical-llms

A collection of hand on notebook for LLMs practitioner

34
Emerging
2218 CEC-Agent/CEC

Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for...

34
Emerging
2219 fboulnois/llm-leaderboard-csv

CSVs of the Huggingface and LMArena LLM leaderboards, along with the code to...

34
Emerging
2220 jorgemunozl/Finetunning-Llama-Vision-11b

Inference and finnetunning of a VLM (LLama Vision 11b) using the Unsloth,...

34
Emerging
2221 jakobtroidl/neuron-shape-reasoning

PyTorch Implementation of Global Neuron Shape Reasoning with Point Affinity...

34
Emerging
2222 ASSERT-KTH/repairllama

RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program...

34
Emerging
2223 ModelTC/QLLM

[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate...

34
Emerging
2224 nestordemeure/stop_word

Huggingface transformers stopping criteria that halts the generation when a...

34
Emerging
2225 SqueezeAILab/KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with...

34
Emerging
2226 henrikalbihn/gliner-as-a-service

GLiNER model in a FastAPI microservice.

34
Emerging
2227 Infini-AI-Lab/Sequoia

scalable and robust tree-based speculative decoding algorithm

34
Emerging
2228 sdpkjc/SATQuest

🏞 A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs

34
Emerging
2229 wang2226/Awesome-LLM-Decoding

📜 Paper list on decoding methods for LLMs and LVLMs

34
Emerging
2230 itsqyh/Awesome-LMMs-Mechanistic-Interpretability

A curated collection of resources focused on the Mechanistic...

34
Emerging
2231 NiuTrans/LaMaTE

Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine...

34
Emerging
2232 moritztng/fltr

Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.

34
Emerging
2233 DCQN-axiomatics/DCQN-Matrix-Axiomatik-LLM-Protocol

A strict, deterministic LLM protocol for loading, reading and activating the...

34
Emerging
2234 PathologyFoundation/plip

Pathology Language and Image Pre-Training (PLIP) is the first vision and...

34
Emerging
2235 ksm26/Open-Source-Models-with-Hugging-Face

"Open Source Models with Hugging Face" course empowers you with the skills...

34
Emerging
2236 MNoorFawi/curlora

The code repository for the CURLoRA research paper. Stable LLM continual...

34
Emerging
2237 CASE-Lab-UMD/Router-Tuning-Mixture-of-Depths

The open-source Mixture of Depths code and the official implementation of...

34
Emerging
2238 DestroyerDarkNess/fastvlm-webgpu

Real-time video captioning powered by FastVLM

34
Emerging
2239 zerovl/ZeroVL

[ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources

34
Emerging
2240 AkiRusProd/numpy-transformer

A numpy implementation of the Transformer model in "Attention is All You Need"

34
Emerging
2241 WayneMao/RoboMatrix

The Official Implementation of RoboMatrix

34
Emerging
2242 deep-div/PlotLLM

Data Visualization with LLM automatically analyzes data and generates...

34
Emerging
2243 antoninodimaggio/Hugging-Captions

Generate realistic Instagram captions using transformers 🤗

34
Emerging
2244 HaoAreYuDong/MachineLearningLM

Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML

34
Emerging
2245 google/curie

Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long...

34
Emerging
2246 michaelnny/QLoRA-LLM

A simple custom QLoRA implementation for fine-tuning a language model (LLM)...

34
Emerging
2247 Tebmer/Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of...

34
Emerging
2248 Nikityyy/lille

A powerful 130-million-parameter model trained from scratch as part of a...

34
Emerging
2249 hesamsheikh/llm-mechanics

Coding an LLM and its building blocks from scratch.

34
Emerging
2250 OneInterface/realtime-bakllava

llama.cpp with BakLLaVA model describes what does it see

34
Emerging
2251 RLHFlow/Online-RLHF

A recipe for online RLHF and online iterative DPO.

34
Emerging
2252 iKernels/transformers-lightning

A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses...

34
Emerging
2253 holarissun/RewardModelingBeyondBradleyTerry

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models...

34
Emerging
2254 hpdps-group/ElasticMM

ElasticMM: Elastic and Efficient MLLM Serving System

34
Emerging
2255 rezazad68/transdeeplab

TransDeepLab: Convolution-Free Transformer-based DeepLab v3+ for Medical...

34
Emerging
2256 RAHB-REALTORS-Association/email-autodrafts

Email Auto-ReplAI is a Python tool that uses AI to automate drafting...

34
Emerging
2257 Pengxin-Guo/FedSA-LoRA

Selective Aggregation for Low-Rank Adaptation in Federated Learning [ICLR 2025]

34
Emerging
2258 jonrbates/turing

A PyTorch library for simulating Turing machines with neural networks, based...

33
Emerging
2259 Uralstech/vid-orca

Deploy LLaMA-2 Chat on Google Cloud.

33
Emerging
2260 srsawant34/efficient_instruction_learning

Code base for the paper "Instruction Tuned Models are Quick Learners".

33
Emerging
2261 Riccorl/llama-trainer

Llama Trainer Utility

33
Emerging
2262 hollobit/GenAI_LLM_timeline

ChatGPT, GenerativeAI and LLMs Timeline

33
Emerging
2263 Anjum48/commonlitreadabilityprize

4th Place solution for the Kaggle CommonLit Readability Prize

33
Emerging
2264 declare-lab/TEAM

Our EMNLP 2022 paper on MCQA

33
Emerging
2265 MLD3/steerability

An open-source evaluation framework for measuring LLM steerability.

33
Emerging
2266 Srijan-D/LangChain-v0.2-HuggingFace-Llama3

This project integrates LangChain v0.2.6, HuggingFace Serverless Inference...

33
Emerging
2267 elephantmipt/compressors

A small library with distillation, quantization and pruning pipelines

33
Emerging
2268 graphcore-research/jax-scalify

JAX Scalify: end-to-end scaled arithmetics

33
Emerging
2269 chrisjob1021/transformer-examples

A collection of educational toy implementations and examples of key...

33
Emerging
2270 UBC-MDS/fixml

LLM Tool for effective test evaluation of ML projects with curated...

33
Emerging
2271 smitkiri/news-qa

Reading comprehension based question-answering model for news articles.

33
Emerging
2272 IIT-DM/BattleofLLMs

Benchmarks of LLMs with Conversational QA datasets.

33
Emerging
2273 HariomJangra/project-lumen

A 128M parameter language model built from scratch for learning how large...

33
Emerging
2274 loretoparisi/bert_text_classifier

Text Classification with BERT

33
Emerging
2275 akanyaani/miniLLAMA

A simplified LLAMA implementation for training and inference tasks.

33
Emerging
2276 jseeio/gpt2-tfjs

GPT2 with Tensorflow.js

33
Emerging
2277 YuanGongND/ltu

Code, Dataset, and Pretrained Models for Audio and Speech Large Language...

33
Emerging
2278 haesleinhuepf/vlm-pictionary

Play pictionary with Vision Language Models!

33
Emerging
2279 Esmail-ibraheem/Tinyllamas-pytorch

Tinyllamas🦙 is an Extensible advanced language model framework, inspired by...

33
Emerging
2280 Nondzu/LlamaTor

LlamaTor: Decentralized AI model sharing via BitTorrent for efficient,...

33
Emerging
2281 telekom/transformer-tools

Transformers Training Tools

33
Emerging
2282 Ajax0564/VyomAI

VyomAI: state-of-the-art NLP LLM Vision MultiModel transformers ...

33
Emerging
2283 songxiaoshuai/progco

Official Implementation of "ProgCo: Program Helps Self-Correction of Large...

33
Emerging
2284 DoubleVII/lithft

Pretrain, finetune any LLMs from huggingface on your own data.

33
Emerging
2285 wangcongcong123/transection

Transection: Transformers for English to Chinese Translation

33
Emerging
2286 monk1337/NanoPeft

The simplest repository & Neat implementation of different Lora methods for...

33
Emerging
2287 pat-jj/KG-FIT

[NeurIPS'24] Knowledge Graph Fine-Tuning using LLMs

33
Emerging
2288 microsoft/MMLU-CF

A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]

33
Emerging
2289 jianzhnie/LLMToolkit

LLMToolkit is a toolkit for NLP(Natural Language Processing) and LLM(Large...

33
Emerging
2290 daskol/llama.py

Python bindings to llama.cpp

33
Emerging
2291 sail-sg/dice

Official implementation of Bootstrapping Language Models via DPO Implicit Rewards

33
Emerging
2292 detsutut/ama-bot

A modern and lightweight NLP interface for Question-Answering systems and...

33
Emerging
2293 yaojin17/Unlearning_LLM

[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large...

33
Emerging
2294 notAI-tech/Anuvaad

State of the art open-source translation for Indic languages.

33
Emerging
2295 rkinas/reasoning_models_how_to

This repository serves as a collection of research notes and resources on...

33
Emerging
2296 krishnapriya-18/COVID-19-Tweet-Classification-using-Roberta-and-Bert-Simple-Transformers

Rank 1 / 216

33
Emerging
2297 duyhominhnguyen/Exgra-Med

[NeurIPS 2025] ExGra-Med: Medical Multi-Modal LLM with Extended Context Alignment

33
Emerging
2298 hasanisaeed/C-Transformer

Implementation of the core Transformer architecture in pure C

33
Emerging
2299 SORRY-Bench/sorry-bench

Benchmark evaluation code for "SORRY-Bench: Systematically Evaluating Large...

33
Emerging
2300 WooooDyy/BAPO

Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for...

33
Emerging
« Prev 1 2 3 21 22 23 24 25 68 69 70 Next »