All Transformer Models
6,429 models ranked by quality score · Page 42 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 4101 |
taherfattahi/MetaWorld-VLA-openai-clip-vit
A lightweight Vision-Language-Action (VLA) baseline for MetaWorld robot-arm... |
|
Experimental |
| 4102 |
theonesud/embedia
Create LLM-powered webapps with ease |
|
Experimental |
| 4103 |
horenbergerb/llamagotchi
A bunch of LLaMa model investigations, including recreating generative... |
|
Experimental |
| 4104 |
yejoon-lee/kr3
KR3: Korean Restaurant Review with Ratings / Experiments on... |
|
Experimental |
| 4105 |
ponderous-dustiness314/awesome-claude-skills
📚 Discover essential Claude skills for tasks like document editing, data... |
|
Experimental |
| 4106 |
IsaacRodgz/Multimodal-Adapters
Adapter modules with support for multimodal fusion of information (text,... |
|
Experimental |
| 4107 |
itsShnik/adaptively-finetuning-transformers
Adaptively fine tuning transformer based models for multiple domains and... |
|
Experimental |
| 4108 |
Fortyseven/chit-v2
Chit is a lightweight privacy-focused web chat front-end for Ollama... |
|
Experimental |
| 4109 |
lucataco/cog-llama-3-vision-alpha
Cog wrapper for qresearch/llama-3-vision-alpha |
|
Experimental |
| 4110 |
coderonion/awesome-mojo-max-mlir
A collection of some awesome public MAX platform, Mojo programming language... |
|
Experimental |
| 4111 |
alphadl/OOP-eval
The first Object-Oriented Programming (OOP) Evaluation Benchmark for LLMs |
|
Experimental |
| 4112 |
ai-center-kth/cuBERT-source-code-clustering
Fine-tuning cuBERT embeddings for clustering source code by functionality |
|
Experimental |
| 4113 |
kriskrisliu/PAT
[AAAI 2025] PAT: Pruning-Aware Tuning for Large Language Models |
|
Experimental |
| 4114 |
pr0ximaCent/Langchain-Chat-Driven-Expense-Tracker-
FinChain is an AI-powered, chat-driven expense tracker. Log your expenses in... |
|
Experimental |
| 4115 |
webnizam/alpaca-telegram-bot
Simplest way to host a local ChatGPT like model for Telegram. |
|
Experimental |
| 4116 |
pphuc25/distil-cd
Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive... |
|
Experimental |
| 4117 |
s-omranpour/Shirin-Sokhan
A Persian Poet Transformer! (finetuned GPT2 on Ganjoor data) |
|
Experimental |
| 4118 |
Andras7/gpt2-pytorch
Extremely simple and understandable GPT2 implementation with minor tweaks |
|
Experimental |
| 4119 |
codex-mohan/novaflow
An AI Assistant with Image and Voice Recognition and Modular Node-Based UI |
|
Experimental |
| 4120 |
codiceSpaghetti/numpyGPT
A from-scratch GPT built with NumPy and Python’s standard library. No... |
|
Experimental |
| 4121 |
aditeyabaral/maple
Implementation of the paper, MAPLE - MAsking words to generate blackout... |
|
Experimental |
| 4122 |
hipml/syllabify
a 5M parameter solution to a problem you could solve by counting on your fingers |
|
Experimental |
| 4123 |
rafaelvp-db/db-ancient-code-translation
Simple repo showing code-to-code and code-to-text capabilities using LLMs on... |
|
Experimental |
| 4124 |
yashjakhotiya/Adversarial-Attacks-On-Transformers
Exploring vulnerabilities of Transformers-based Malware Detectors to... |
|
Experimental |
| 4125 |
VoxDroid/llm-wikipedia
A project for fine-tuning large language models (LLMs) on curated Wikipedia... |
|
Experimental |
| 4126 |
VincLee8188/Spatio-temporal-forecasting-PyTorch
Leverage on recent advances in graph convolution and sequence modeling to... |
|
Experimental |
| 4127 |
SharathHebbar/ML-Project-list
List of all ML projects |
|
Experimental |
| 4128 |
instavm/llm-token-visualizer
See How Big Exactly A 128k Token Text Is |
|
Experimental |
| 4129 |
inuwamobarak/nougat
Nougat is a Meta AI's revolutionary OCR model designed to transcribe... |
|
Experimental |
| 4130 |
RubenCasal/owl_vit_detector
NanoOWL Detection System enables real-time open-vocabulary object detection... |
|
Experimental |
| 4131 |
koesan/Manga_Comic_Colorization_and_Translation_v1
AI-powered manga and comic translator using EasyOCR and Hugging Face... |
|
Experimental |
| 4132 |
M4D-MKLab-ITI/Crisis-Event-Detection-in-Short-Texts
implementation of "Leveraging Transformer Self Attention Encoder for Crisis... |
|
Experimental |
| 4133 |
mhajder/llama.cpp-updater
A shell script to automatically update or build llama.cpp with optimal GPU... |
|
Experimental |
| 4134 |
Omid-Nejati/Locality-iN-Locality
Robust Transformer with Locality Inductive Bias and Feature Normalization... |
|
Experimental |
| 4135 |
AdamCoscia/iScore
Upload, score, and visually compare multiple LLM-graded summaries simultaneously! |
|
Experimental |
| 4136 |
didar00/Final-Project
SELFIES-Transformer: Learning the Representation of Chemical Space for... |
|
Experimental |
| 4137 |
Curtis-Wu/Equivariant-Graph-Transformer
A deep neural network with hybrid architecture (EGNN + Transformer) for... |
|
Experimental |
| 4138 |
IbrahimSobh/askpdf
In this tutorial we will see 💡 How to get answers from a PDF file using... |
|
Experimental |
| 4139 |
fshnkarimi/train_scheduling_assistant
This project utilizes a fine-tuned Large Language Model (LLM) to generate... |
|
Experimental |
| 4140 |
IbrahimSobh/askdoc
In this tutorial we will see 💡 How to get answers from documents using... |
|
Experimental |
| 4141 |
PRITHIVSAKTHIUR/Qwen-Image-LoRA-DLC
Qwen-Image model with various LoRA (Low-Rank Adaptation) styles. This tool... |
|
Experimental |
| 4142 |
Josephrp/SmolFactory
finetune gpt-oss and smollm3 on your data easily and cheaply |
|
Experimental |
| 4143 |
DrRuin/Lightweight-Fine-Tuning
Lightweight fine-tuning is one of the most important techniques for adapting... |
|
Experimental |
| 4144 |
Eric2i/LLM-MindMap
EMNLP 2025 - "Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning... |
|
Experimental |
| 4145 |
yuki-2025/llama3-8b-fine-tuning-math
Fine-Tuning Llama 3-8B for Structured Math Reasoning: Fine-tuning Llama3 8b... |
|
Experimental |
| 4146 |
mirzayasirabdullahbaig07/Fine-Tuning-LLaMA-3.2-3B-Using-PEFT-LoRA
This project showcases parameter-efficient fine-tuning of the LLaMA 3.2 (3B)... |
|
Experimental |
| 4147 |
hewei2001/ReachQA
[EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs |
|
Experimental |
| 4148 |
AmoghPradeep/abstractive-text-summarizer
Abstractive text summarization using BART. |
|
Experimental |
| 4149 |
AbdBarho/transformers-stack
A full stack solution for deploying a transformers model from HuggingFace |
|
Experimental |
| 4150 |
ahmed19999520-alt/Veronica-X-Pro-open-source-code-2.0
Advanced AI system with real quantum computing integration, sophisticated... |
|
Experimental |
| 4151 |
ahmedshahriar/restaurant-menu-pricing
Predict menu prices from 5M+ UberEats menus with an end-to-end MLOps... |
|
Experimental |
| 4152 |
KrishnanJothi/MT5_Language_identification_NLP
MT5-small is fine-tuned on the downstream task of Natural Language... |
|
Experimental |
| 4153 |
H0NEYP0T-466/Isabella
⚙️ Isabella – a full-stack 🚀 conversational system built on FastAPI ✨... |
|
Experimental |
| 4154 |
mosh98/MMBT
Multi modal BiTransformer [ Reimplementation ] in Pytorch That Acutally Works ! |
|
Experimental |
| 4155 |
harshpimpale/LegalMind
A project that uses Large Language Models (LLMs) to assist users with legal... |
|
Experimental |
| 4156 |
Nikunj2003/Jira-Standup-Report-API
Automate daily scrum reports with AI-powered insights from Jira data |
|
Experimental |
| 4157 |
MIbnEKhalid/ChatAPI
A chatbot built using Node.js, Handlebars, and PostgreSQL, leveraging the... |
|
Experimental |
| 4158 |
longday1102/VietAI-experiment-LLaMA2
⚡ LLaMA-2 model experiment |
|
Experimental |
| 4159 |
nehalvaghasiya/RecipeBot
AI chatbot that provides recipe suggestions and cooking instructions based... |
|
Experimental |
| 4160 |
Andrew2077/Alpaca
Simple Q/A app, where i created a UI for alpaca (fine tuned LLAMA) model... |
|
Experimental |
| 4161 |
visresearch/SDMPrune
The official implementation of "SDMPrune: Self-Distillation MLP Pruning for... |
|
Experimental |
| 4162 |
amanongithub7/classical-music-generation
Comparing LSTM and Transformer-based deep learning approaches for classical... |
|
Experimental |
| 4163 |
NJX-njx/microgpt
🔬 The most atomic GPT-2 implementation in 265 lines of pure Python & CUDA. A... |
|
Experimental |
| 4164 |
danelpeng/Awesome-Continual-Leaning-with-PTMs
This is a curated list of "Continual Learning with Pretrained Models" research. |
|
Experimental |
| 4165 |
Silvestre17/TM_StockTweetSentimentAnalysis_MasterProject
📈 Master's project using NLP (FinTwitBERT, LSTMs) to classify stock market... |
|
Experimental |
| 4166 |
eason69113-source/Chat-HuanHuan
基于 Meta-Llama-3.1-8B-Instruct + 4-bit 量化 + QLoRA,训练与推理全程显存占用 < 9 GB,RTX... |
|
Experimental |
| 4167 |
hululuzhu/llama-lora-chinese-couplet
llama-lora e2e example to demo a Chinese Couplet AI in 10 mins. some... |
|
Experimental |
| 4168 |
zhengyima/knowqa
预训练模型知识量度量竞赛 Baseline F1 0.35 BERTForMaskedLM |
|
Experimental |
| 4169 |
LlamaGenAI/llamagenai-openapi
LlamaGen.Ai REST API, LlamaGen is AI Comic Factory - Generate Comics with... |
|
Experimental |
| 4170 |
1tangerine1day/chinese-QA-chatbot
A simple chinese QA chatbot implement with pytorch and transformer trained... |
|
Experimental |
| 4171 |
guanlisheng/synochatgpt
Synology Chat + Ollama + Chatgpt => synochatgpt |
|
Experimental |
| 4172 |
lenticularis39/llama2.inferno
Inference Llama 2 in one file of pure Limbo |
|
Experimental |
| 4173 |
styfeng/SMERTI
Code for SMERTI for Semantic Text Exchange. |
|
Experimental |
| 4174 |
ia-labo/French-News-Clustering
Text classification and clustering using transformers and Denstream. |
|
Experimental |
| 4175 |
Zoclee/xojo-llama
A wrapper module to do local LLM inference on GGUF models using the... |
|
Experimental |
| 4176 |
Ebimsv/LLM-Lab
Pretraining and Finetuning Language Model |
|
Experimental |
| 4177 |
timvvvht/HKEX-Announcement-Classifier
A project on data exploration, analysis and using a neural network to... |
|
Experimental |
| 4178 |
dragonnomada/ipn-cic-diplomado-ia-2025
Diplomado en Inteligencia Artificial del CIC / IPN |
|
Experimental |
| 4179 |
HKUNLP/multilingual-transfer
Code for paper ”Language Versatilists vs. Specialists: An Empirical... |
|
Experimental |
| 4180 |
Rin313/StegLLM
离线的LLM文本隐写程序。Offline LLM text steganography program. |
|
Experimental |
| 4181 |
Jorffy/NoteMR
[CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with... |
|
Experimental |
| 4182 |
GPUforLLM/llm-vram-calculator
Accurate VRAM calculator for Local LLMs (Llama 4, DeepSeek V3, Qwen 2.5).... |
|
Experimental |
| 4183 |
SiemonCha/stock-ai
AI-powered stock prediction with complete MLOps: Deep Learning... |
|
Experimental |
| 4184 |
NS027/medical_chatbot_project_genAI
Multimodal AI-powered medical assistant with LLMs, speech, and image understanding. |
|
Experimental |
| 4185 |
DigitalHarborFoundation/FlexEval
FlexEval is an LLM evaluation tool designed for practical quantitative analysis. |
|
Experimental |
| 4186 |
ilanaliouchouche/KANBert
Implementation of an Encoder only MoE usable as an Embedding Model,... |
|
Experimental |
| 4187 |
davide-coccomini/Cross-Forgery-Analysis-of-Vision-Transformers-and-CNNs-for-Deepfake-Image-Detection
Code for the paper Cross Forgery Analysis of Vision Transformers and CNNs... |
|
Experimental |
| 4188 |
RUCKBReasoning/CodeRM
Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of... |
|
Experimental |
| 4189 |
afspies/attention-tutorial
Jupyter Notebook tutorial on Attention Mechanisms, Position Embeddings and... |
|
Experimental |
| 4190 |
lakshyaag/Deep-Learning-From-Scratch
Implementing popular deep learning papers in PyTorch |
|
Experimental |
| 4191 |
bassrehab/credit_risk
Forecast long sequence default/downgrade of corporate entities and financial... |
|
Experimental |
| 4192 |
capybara-brain346/moe-router
A small Mixture-of-Experts (MoE) Transformer trained from scratch to learn... |
|
Experimental |
| 4193 |
tensorchord/inference-benchmark
Benchmark for machine learning model online serving (LLM, embedding,... |
|
Experimental |
| 4194 |
tianzhaotju/LEAM
We propose a novel DL-based mutation technique (LEAM), which adapts the... |
|
Experimental |
| 4195 |
h3nock/ai-deep-dive
An open-source interactive learning platform for understanding LLMs through... |
|
Experimental |
| 4196 |
UNITES-Lab/HEXA-MoE
Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE... |
|
Experimental |
| 4197 |
Yash-Kavaiya/30-Days-LLM-Mastery-Course
30-Days-LLM-Mastery-Course: A comprehensive, hands-on course diving deep... |
|
Experimental |
| 4198 |
elixpo/emoji_transnetv1
A Machine Learning Initiative Taken to fine tune MT5_SMALL to contextually... |
|
Experimental |
| 4199 |
zixi-liu/Transformers-Learning
Stanford CS25 - Transformer United and CS224n learning notes and code dump. |
|
Experimental |
| 4200 |
juancmacias/Small_Lenguage_Model
Píldora formativa sobre SLM (Small Lenguage Model) |
|
Experimental |