All Transformer Models
6,429 models ranked by quality score · Page 24 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 2301 |
CognitiveAISystems/RATE
[ICLR 2026] Official implementation of Recurrent Action Transformer with... |
|
Emerging |
| 2302 |
DAMO-NLP-SG/LLM-Multilingual-Knowledge-Boundaries
[ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across... |
|
Emerging |
| 2303 |
sandyresearch/chipmunk
🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E... |
|
Emerging |
| 2304 |
gpustack/gguf-packer-go
Deliver LLMs of GGUF format via Dockerfile. |
|
Emerging |
| 2305 |
Harish25/StudyScreeningLanguageModel
Core LLM for M.A.R.S. (Model Assisted Review System). Utilizes fine-tuned... |
|
Emerging |
| 2306 |
whucs21Mzy/Model-Phase-Transitions
Navigating Model Phase Transitions to Enable Extreme Lossless Compression: A... |
|
Emerging |
| 2307 |
ahazeemi/dPrune
🌿 dPrune: A Framework for Data Pruning |
|
Emerging |
| 2308 |
partarstu/transformers-in-java
Experimental project for AI and NLP based on Transformer Architecture |
|
Emerging |
| 2309 |
OSUPCVLab/MobileUNETR
Official Implementation of MobileUNETR: A Lightweight End-To-End Hybrid... |
|
Emerging |
| 2310 |
gentaiscool/miners
MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual... |
|
Emerging |
| 2311 |
mubingshen/MLC-SLM-Baseline
The project is associated with the recently-launched INTERSPEECH 2025... |
|
Emerging |
| 2312 |
ulab-uiuc/Time-R1
Time-R1: Framework and resources for endowing LLMs with comprehensive... |
|
Emerging |
| 2313 |
DebeshJha/TransNetR
Official implementation of TransNetR: Transformer-based Residual Network for... |
|
Emerging |
| 2314 |
gersteinlab/Struc-Bench
[NAACL 2024] Struc-Bench: Are Large Language Models Good at Generating... |
|
Emerging |
| 2315 |
bfilar/URLTran
PyTorch/HuggingFace Implementation of URLTran: Improving Phishing URL... |
|
Emerging |
| 2316 |
ShinoharaHare/LLM-Training
A distributed training framework for large language models powered by Lightning. |
|
Emerging |
| 2317 |
chziakas/redeval
A library for red-teaming LLM applications with LLMs. |
|
Emerging |
| 2318 |
kyegomez/MC-ViT
Implementation of the model: "(MC-ViT)" from the paper: "Memory... |
|
Emerging |
| 2319 |
trekhleb/homemade-gpt-js
A minimal TensorFlow.js re-implementation of Karpathy's minGPT (Generative... |
|
Emerging |
| 2320 |
NVlabs/HMAR
[CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation |
|
Emerging |
| 2321 |
LMLK-seal/HuggingGGUF
Hugging Face Model downloader and GGUF Converter. |
|
Emerging |
| 2322 |
hukenovs/slovo
Slovo: Russian Sign Language Dataset and Models |
|
Emerging |
| 2323 |
theosorus/GPT2-Hasktorch
GPT2 implementation in Haskell with the Hasktorch library, inspired by... |
|
Emerging |
| 2324 |
westlake-repl/NRPStransformer
A Transformer-Based Predictor for Nonribosomal Peptide Synthetases (NRPS)... |
|
Emerging |
| 2325 |
hsisaberi/single-trait-electra
A complete ELECTRA-based framework for Big Five personality trait... |
|
Emerging |
| 2326 |
xmindflow/SSCT
[ICCV 2023] Self-supervised Semantic Segmentation: Consistency over Transformation |
|
Experimental |
| 2327 |
seedatnabeel/CLLM
Curated LLM (ICML 2024) |
|
Experimental |
| 2328 |
crux82/CLiC-it_2023_tutorial
This repository hosts materials from the CLiC-IT 2023 tutorial |
|
Experimental |
| 2329 |
THU-KEG/WaterBench
[ACL2024-Main] Data and Code for WaterBench: Towards Holistic Evaluation of... |
|
Experimental |
| 2330 |
declare-lab/MM-Align
[EMNLP 2022] This repository contains the official implementation of the... |
|
Experimental |
| 2331 |
PromptMixerDev/prompt-mixer-ollama-connector
Ollama Connector |
|
Experimental |
| 2332 |
saeeddhqan/tiny-transformer
Tiny transformer models implemented in pytorch. |
|
Experimental |
| 2333 |
qizhou000/UniEdit
[NeurIPS 2025 B & D] UniEdit: A Unified Knowledge Editing Benchmark for... |
|
Experimental |
| 2334 |
OSU-STARLAB/Simul-LLM
[ACL 2024] An easily extensible framework for simultaneous, text-to-text... |
|
Experimental |
| 2335 |
francoislanc/midistral
LLM finetuned for generating symbolic music |
|
Experimental |
| 2336 |
ASSERT-KTH/agentic-evals-lab
Framework for training and evaluating LLMs with reinforcement learning in... |
|
Experimental |
| 2337 |
wafflecomposite/langchain-ask-pdf-local
An AI-app that allows you to upload a PDF and ask questions about it. It... |
|
Experimental |
| 2338 |
Aloereed/llama.cpp-server-ohos
Llama.cpp server for OpenHarmony |
|
Experimental |
| 2339 |
antonyvigouret/Pay-Attention-to-MLPs
My implementation of the gMLP model from the paper "Pay Attention to MLPs". |
|
Experimental |
| 2340 |
sashazykov/json-repair-rb
A simple Ruby gem designed to repair broken JSON strings |
|
Experimental |
| 2341 |
Selozhd/FNet-tensorflow
Tensorflow Implementation of "FNet: Mixing Tokens with Fourier Transforms." |
|
Experimental |
| 2342 |
amazon-science/llm-code-preference
Training and Benchmarking LLMs for Code Preference. |
|
Experimental |
| 2343 |
GiannakopoulosIlias/vision-transformer-network-for-mr-electrical-properties-tomography
A 3D Vision Transformer-based neural network for reconstructing electrical... |
|
Experimental |
| 2344 |
dev-sufyaan/Nexlify
Unified API platform for free access to enterprise-grade AI models from... |
|
Experimental |
| 2345 |
krnel-ai/krnel-graph
Lightweight representation engineering dataflow operations for agent developers. |
|
Experimental |
| 2346 |
lrusso/llama3pure
Three inference engines for Llama 3: pure C for desktop systems, pure... |
|
Experimental |
| 2347 |
mehdihosseinimoghadam/AVA-Llama-3
Fine-Tuned Llama 3 Persian Large Language Model LLM / Persian Llama 3 |
|
Experimental |
| 2348 |
Ludobico/KakaoChatData
카카오톡 대화 데이터셋 |
|
Experimental |
| 2349 |
zwhe99/X-SIR
[ACL 2024] Can Watermarks Survive Translation? On the Cross-lingual... |
|
Experimental |
| 2350 |
wschella/llm-reliability
Code for the paper "Larger and more instructable language models become less... |
|
Experimental |
| 2351 |
abenechehab/dicl
[ICLR 2025] Official implementation of DICL (Disentangled In-Context... |
|
Experimental |
| 2352 |
corl-team/lime
Official implementation of the paper "You Do Not Fully Utilize Transformer's... |
|
Experimental |
| 2353 |
GeorgeMichailidis/multi-task-mixed-freq
Code repository for "Multi-Task Encoder-Dual-Decoder Modeling Framework on... |
|
Experimental |
| 2354 |
martin-wey/peft-llm-code
Replication package of the paper "Exploring Parameter-Efficient Fine-Tuning... |
|
Experimental |
| 2355 |
frankaging/ReCOGS
ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of... |
|
Experimental |
| 2356 |
Gary3410/TaPA
[arXiv 2023] Embodied Task Planning with Large Language Models |
|
Experimental |
| 2357 |
Silvestre17/BDA_AmazonReviews_DatabricksPySparkAnalysis_MasterProject
🛍️ Big Data project analyzing Amazon tech reviews using Databricks, PySpark,... |
|
Experimental |
| 2358 |
pluja/maestro
Turn natual language into commands. Your CLI tasks, now as easy as a... |
|
Experimental |
| 2359 |
sigeisler/reinforce-attacks-llms
REINFORCE Adversarial Attacks on Large Language Models: An Adaptive,... |
|
Experimental |
| 2360 |
teddykoker/grokking
PyTorch implementation of "Grokking: Generalization Beyond Overfitting on... |
|
Experimental |
| 2361 |
surrey-nlp/PLOD-AbbreviationDetection
This repository contains the PLOD Dataset for Abbreviation Detection... |
|
Experimental |
| 2362 |
kyegomez/M2PT
Implementation of M2PT in PyTorch from the paper: "Multimodal Pathway:... |
|
Experimental |
| 2363 |
antofuller/configaformers
A python library for highly configurable transformers - easing model... |
|
Experimental |
| 2364 |
DreamerGPT/DreamerGPT
🌱 梦想家(DreamerGPT):中文大语言模型指令精调 |
|
Experimental |
| 2365 |
WangRongsheng/Chinese-LLaMA-Alpaca-Usage
📔 对Chinese-LLaMA-Alpaca进行使用说明和核心代码注解 |
|
Experimental |
| 2366 |
Praveengovianalytics/falcon-evaluate
Falcon Evaluate is an open-source Python library aims to revolutionise the... |
|
Experimental |
| 2367 |
sampathkethineedi/bert-topic-sentiment
Topic Based Sentiment Detection using BERT |
|
Experimental |
| 2368 |
Am1n3e/active-learning-transformer
A hands-on tutorial on how to use Active Learning with Transformer models. |
|
Experimental |
| 2369 |
mcbal/spin-model-transformers
Physics-inspired transformer modules based on mean-field dynamics of... |
|
Experimental |
| 2370 |
PRIME-RL/Entropy-Mechanism-of-RL
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning. |
|
Experimental |
| 2371 |
kaist-cvml/I-HallA-v1.0
[AAAI 2025] Official Implementation of I-HallA v1.0 |
|
Experimental |
| 2372 |
avilum/llama-saas
A client/server for LLaMA (Large Language Model Meta AI) that can run ANYWHERE. |
|
Experimental |
| 2373 |
camenduru/alpaca-lora-colab
Alpaca Lora |
|
Experimental |
| 2374 |
fabienfrfr/tptt
😊 TPTT: Transforming Pretrained Transformers into Titans |
|
Experimental |
| 2375 |
MaxiDonkey/DelphiGroqCloud
The GroqCloud API wrapper for Delphi provides access to models from Meta,... |
|
Experimental |
| 2376 |
AristotelisPap/Question-Answering-with-BERT-and-Knowledge-Distillation
Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD)... |
|
Experimental |
| 2377 |
zjunlp/DynamicKnowledgeCircuits
[ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits... |
|
Experimental |
| 2378 |
shahriargolchin/time-travel-in-llms
The official repository for the paper entitled "Time Travel in LLMs: Tracing... |
|
Experimental |
| 2379 |
teilomillet/retrain
a Python library that uses Reinforcement Learning (RL) to train LLMs. |
|
Experimental |
| 2380 |
davidjosipovic/news-trend-analysis
Automated NLP pipeline for news analysis with sentiment detection, topic... |
|
Experimental |
| 2381 |
cakshat/AlloyBERT
Introducing AlloyBERT: a transformer encoder-based model for predicting... |
|
Experimental |
| 2382 |
ai-forever/model-zoo
NLP model zoo for Russian |
|
Experimental |
| 2383 |
actypedef/ARCQuant
Code for the paper "ARCQuant: Boosting NVFP4 Quantization with Augmented... |
|
Experimental |
| 2384 |
LoserCheems/WonderfulMatrices
Wonderful Matrices to Build Small Language Models |
|
Experimental |
| 2385 |
abcsys/libem
Compound AI toolchain for fast and accurate entity matching, powered by LLMs. |
|
Experimental |
| 2386 |
jinzhuoran/RWKU
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language... |
|
Experimental |
| 2387 |
zrr1999/emotion-recognition
多模态情绪识别方法研究(Multimodal Emotion Recognition) |
|
Experimental |
| 2388 |
alphasecio/groq
A Streamlit chatbot with memory for running open-source text models on Groq. |
|
Experimental |
| 2389 |
sitammeur/qwen2.5-web
Qwen2.5 Instruct, large language model, operates within web browsers via 🤗... |
|
Experimental |
| 2390 |
xlang-ai/text2reward
[ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for... |
|
Experimental |
| 2391 |
ananttripathi/Resume-Analyzer-MLOps
Resume Analyzer is an AI-powered MLOps platform that optimizes your resume... |
|
Experimental |
| 2392 |
DataArcTech/ChartMoE
[ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for... |
|
Experimental |
| 2393 |
astorfi/LLM-Alignment-Project
A comprehensive template for aligning large language models (LLMs) using... |
|
Experimental |
| 2394 |
SALT-NLP/Adaptive-Compositional-Modules
Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive... |
|
Experimental |
| 2395 |
GU-DataLab/stance-detection-KE-MLM
Official resource of the paper "Knowledge Enhanced Masked Language Model for... |
|
Experimental |
| 2396 |
jaketae/vit-breast-cancer
Transfer learning pretrained vision transformers for breast histopathology |
|
Experimental |
| 2397 |
affjljoo3581/Inverse-DALL-E-for-Optical-Character-Recognition
Inverse DALL-E for Optical Character Recognition |
|
Experimental |
| 2398 |
MileBench/MileBench
This repo contains evaluation code for the paper "MileBench: Benchmarking... |
|
Experimental |
| 2399 |
gentaiscool/few-shot-lm
The source code of "Language Models are Few-shot Multilingual Learners" (MRL... |
|
Experimental |
| 2400 |
jordandeklerk/SwinViT
Modified Swin Transformer model in PyTorch on CIFAR-10 for image classification |
|
Experimental |