All Transformer Models

6,429 models ranked by quality score · Page 42 of 65

Showing 4101–4200 of 6,429
# Model Score Tier
4101 taherfattahi/MetaWorld-VLA-openai-clip-vit

A lightweight Vision-Language-Action (VLA) baseline for MetaWorld robot-arm...

18
Experimental
4102 theonesud/embedia

Create LLM-powered webapps with ease

18
Experimental
4103 horenbergerb/llamagotchi

A bunch of LLaMa model investigations, including recreating generative...

18
Experimental
4104 yejoon-lee/kr3

KR3: Korean Restaurant Review with Ratings / Experiments on...

18
Experimental
4105 ponderous-dustiness314/awesome-claude-skills

📚 Discover essential Claude skills for tasks like document editing, data...

18
Experimental
4106 IsaacRodgz/Multimodal-Adapters

Adapter modules with support for multimodal fusion of information (text,...

18
Experimental
4107 itsShnik/adaptively-finetuning-transformers

Adaptively fine tuning transformer based models for multiple domains and...

18
Experimental
4108 Fortyseven/chit-v2

Chit is a lightweight privacy-focused web chat front-end for Ollama...

18
Experimental
4109 lucataco/cog-llama-3-vision-alpha

Cog wrapper for qresearch/llama-3-vision-alpha

18
Experimental
4110 coderonion/awesome-mojo-max-mlir

A collection of some awesome public MAX platform, Mojo programming language...

18
Experimental
4111 alphadl/OOP-eval

The first Object-Oriented Programming (OOP) Evaluation Benchmark for LLMs

18
Experimental
4112 ai-center-kth/cuBERT-source-code-clustering

Fine-tuning cuBERT embeddings for clustering source code by functionality

18
Experimental
4113 kriskrisliu/PAT

[AAAI 2025] PAT: Pruning-Aware Tuning for Large Language Models

18
Experimental
4114 pr0ximaCent/Langchain-Chat-Driven-Expense-Tracker-

FinChain is an AI-powered, chat-driven expense tracker. Log your expenses in...

18
Experimental
4115 webnizam/alpaca-telegram-bot

Simplest way to host a local ChatGPT like model for Telegram.

18
Experimental
4116 pphuc25/distil-cd

Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive...

18
Experimental
4117 s-omranpour/Shirin-Sokhan

A Persian Poet Transformer! (finetuned GPT2 on Ganjoor data)

18
Experimental
4118 Andras7/gpt2-pytorch

Extremely simple and understandable GPT2 implementation with minor tweaks

18
Experimental
4119 codex-mohan/novaflow

An AI Assistant with Image and Voice Recognition and Modular Node-Based UI

18
Experimental
4120 codiceSpaghetti/numpyGPT

A from-scratch GPT built with NumPy and Python’s standard library. No...

18
Experimental
4121 aditeyabaral/maple

Implementation of the paper, MAPLE - MAsking words to generate blackout...

18
Experimental
4122 hipml/syllabify

a 5M parameter solution to a problem you could solve by counting on your fingers

18
Experimental
4123 rafaelvp-db/db-ancient-code-translation

Simple repo showing code-to-code and code-to-text capabilities using LLMs on...

18
Experimental
4124 yashjakhotiya/Adversarial-Attacks-On-Transformers

Exploring vulnerabilities of Transformers-based Malware Detectors to...

18
Experimental
4125 VoxDroid/llm-wikipedia

A project for fine-tuning large language models (LLMs) on curated Wikipedia...

18
Experimental
4126 VincLee8188/Spatio-temporal-forecasting-PyTorch

Leverage on recent advances in graph convolution and sequence modeling to...

18
Experimental
4127 SharathHebbar/ML-Project-list

List of all ML projects

18
Experimental
4128 instavm/llm-token-visualizer

See How Big Exactly A 128k Token Text Is

18
Experimental
4129 inuwamobarak/nougat

Nougat is a Meta AI's revolutionary OCR model designed to transcribe...

18
Experimental
4130 RubenCasal/owl_vit_detector

NanoOWL Detection System enables real-time open-vocabulary object detection...

18
Experimental
4131 koesan/Manga_Comic_Colorization_and_Translation_v1

AI-powered manga and comic translator using EasyOCR and Hugging Face...

18
Experimental
4132 M4D-MKLab-ITI/Crisis-Event-Detection-in-Short-Texts

implementation of "Leveraging Transformer Self Attention Encoder for Crisis...

18
Experimental
4133 mhajder/llama.cpp-updater

A shell script to automatically update or build llama.cpp with optimal GPU...

18
Experimental
4134 Omid-Nejati/Locality-iN-Locality

Robust Transformer with Locality Inductive Bias and Feature Normalization...

18
Experimental
4135 AdamCoscia/iScore

Upload, score, and visually compare multiple LLM-graded summaries simultaneously!

18
Experimental
4136 didar00/Final-Project

SELFIES-Transformer: Learning the Representation of Chemical Space for...

18
Experimental
4137 Curtis-Wu/Equivariant-Graph-Transformer

A deep neural network with hybrid architecture (EGNN + Transformer) for...

18
Experimental
4138 IbrahimSobh/askpdf

In this tutorial we will see 💡 How to get answers from a PDF file using...

18
Experimental
4139 fshnkarimi/train_scheduling_assistant

This project utilizes a fine-tuned Large Language Model (LLM) to generate...

18
Experimental
4140 IbrahimSobh/askdoc

In this tutorial we will see 💡 How to get answers from documents using...

18
Experimental
4141 PRITHIVSAKTHIUR/Qwen-Image-LoRA-DLC

Qwen-Image model with various LoRA (Low-Rank Adaptation) styles. This tool...

18
Experimental
4142 Josephrp/SmolFactory

finetune gpt-oss and smollm3 on your data easily and cheaply

18
Experimental
4143 DrRuin/Lightweight-Fine-Tuning

Lightweight fine-tuning is one of the most important techniques for adapting...

18
Experimental
4144 Eric2i/LLM-MindMap

EMNLP 2025 - "Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning...

18
Experimental
4145 yuki-2025/llama3-8b-fine-tuning-math

Fine-Tuning Llama 3-8B for Structured Math Reasoning: Fine-tuning Llama3 8b...

18
Experimental
4146 mirzayasirabdullahbaig07/Fine-Tuning-LLaMA-3.2-3B-Using-PEFT-LoRA

This project showcases parameter-efficient fine-tuning of the LLaMA 3.2 (3B)...

18
Experimental
4147 hewei2001/ReachQA

[EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs

18
Experimental
4148 AmoghPradeep/abstractive-text-summarizer

Abstractive text summarization using BART.

18
Experimental
4149 AbdBarho/transformers-stack

A full stack solution for deploying a transformers model from HuggingFace

18
Experimental
4150 ahmed19999520-alt/Veronica-X-Pro-open-source-code-2.0

Advanced AI system with real quantum computing integration, sophisticated...

18
Experimental
4151 ahmedshahriar/restaurant-menu-pricing

Predict menu prices from 5M+ UberEats menus with an end-to-end MLOps...

18
Experimental
4152 KrishnanJothi/MT5_Language_identification_NLP

MT5-small is fine-tuned on the downstream task of Natural Language...

18
Experimental
4153 H0NEYP0T-466/Isabella

⚙️ Isabella – a full-stack 🚀 conversational system built on FastAPI ✨...

18
Experimental
4154 mosh98/MMBT

Multi modal BiTransformer [ Reimplementation ] in Pytorch That Acutally Works !

17
Experimental
4155 harshpimpale/LegalMind

A project that uses Large Language Models (LLMs) to assist users with legal...

17
Experimental
4156 Nikunj2003/Jira-Standup-Report-API

Automate daily scrum reports with AI-powered insights from Jira data

17
Experimental
4157 MIbnEKhalid/ChatAPI

A chatbot built using Node.js, Handlebars, and PostgreSQL, leveraging the...

17
Experimental
4158 longday1102/VietAI-experiment-LLaMA2

⚡ LLaMA-2 model experiment

17
Experimental
4159 nehalvaghasiya/RecipeBot

AI chatbot that provides recipe suggestions and cooking instructions based...

17
Experimental
4160 Andrew2077/Alpaca

Simple Q/A app, where i created a UI for alpaca (fine tuned LLAMA) model...

17
Experimental
4161 visresearch/SDMPrune

The official implementation of "SDMPrune: Self-Distillation MLP Pruning for...

17
Experimental
4162 amanongithub7/classical-music-generation

Comparing LSTM and Transformer-based deep learning approaches for classical...

17
Experimental
4163 NJX-njx/microgpt

🔬 The most atomic GPT-2 implementation in 265 lines of pure Python & CUDA. A...

17
Experimental
4164 danelpeng/Awesome-Continual-Leaning-with-PTMs

This is a curated list of "Continual Learning with Pretrained Models" research.

17
Experimental
4165 Silvestre17/TM_StockTweetSentimentAnalysis_MasterProject

📈 Master's project using NLP (FinTwitBERT, LSTMs) to classify stock market...

17
Experimental
4166 eason69113-source/Chat-HuanHuan

基于 Meta-Llama-3.1-8B-Instruct + 4-bit 量化 + QLoRA,训练与推理全程显存占用 < 9 GB,RTX...

17
Experimental
4167 hululuzhu/llama-lora-chinese-couplet

llama-lora e2e example to demo a Chinese Couplet AI in 10 mins. some...

17
Experimental
4168 zhengyima/knowqa

预训练模型知识量度量竞赛 Baseline F1 0.35 BERTForMaskedLM

17
Experimental
4169 LlamaGenAI/llamagenai-openapi

LlamaGen.Ai REST API, LlamaGen is AI Comic Factory - Generate Comics with...

17
Experimental
4170 1tangerine1day/chinese-QA-chatbot

A simple chinese QA chatbot implement with pytorch and transformer trained...

17
Experimental
4171 guanlisheng/synochatgpt

Synology Chat + Ollama + Chatgpt => synochatgpt

17
Experimental
4172 lenticularis39/llama2.inferno

Inference Llama 2 in one file of pure Limbo

17
Experimental
4173 styfeng/SMERTI

Code for SMERTI for Semantic Text Exchange.

17
Experimental
4174 ia-labo/French-News-Clustering

Text classification and clustering using transformers and Denstream.

17
Experimental
4175 Zoclee/xojo-llama

A wrapper module to do local LLM inference on GGUF models using the...

17
Experimental
4176 Ebimsv/LLM-Lab

Pretraining and Finetuning Language Model

17
Experimental
4177 timvvvht/HKEX-Announcement-Classifier

A project on data exploration, analysis and using a neural network to...

17
Experimental
4178 dragonnomada/ipn-cic-diplomado-ia-2025

Diplomado en Inteligencia Artificial del CIC / IPN

17
Experimental
4179 HKUNLP/multilingual-transfer

Code for paper ”Language Versatilists vs. Specialists: An Empirical...

17
Experimental
4180 Rin313/StegLLM

离线的LLM文本隐写程序。Offline LLM text steganography program.

17
Experimental
4181 Jorffy/NoteMR

[CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with...

17
Experimental
4182 GPUforLLM/llm-vram-calculator

Accurate VRAM calculator for Local LLMs (Llama 4, DeepSeek V3, Qwen 2.5)....

17
Experimental
4183 SiemonCha/stock-ai

AI-powered stock prediction with complete MLOps: Deep Learning...

17
Experimental
4184 NS027/medical_chatbot_project_genAI

Multimodal AI-powered medical assistant with LLMs, speech, and image understanding.

17
Experimental
4185 DigitalHarborFoundation/FlexEval

FlexEval is an LLM evaluation tool designed for practical quantitative analysis.

17
Experimental
4186 ilanaliouchouche/KANBert

Implementation of an Encoder only MoE usable as an Embedding Model,...

17
Experimental
4187 davide-coccomini/Cross-Forgery-Analysis-of-Vision-Transformers-and-CNNs-for-Deepfake-Image-Detection

Code for the paper Cross Forgery Analysis of Vision Transformers and CNNs...

17
Experimental
4188 RUCKBReasoning/CodeRM

Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of...

17
Experimental
4189 afspies/attention-tutorial

Jupyter Notebook tutorial on Attention Mechanisms, Position Embeddings and...

17
Experimental
4190 lakshyaag/Deep-Learning-From-Scratch

Implementing popular deep learning papers in PyTorch

17
Experimental
4191 bassrehab/credit_risk

Forecast long sequence default/downgrade of corporate entities and financial...

17
Experimental
4192 capybara-brain346/moe-router

A small Mixture-of-Experts (MoE) Transformer trained from scratch to learn...

17
Experimental
4193 tensorchord/inference-benchmark

Benchmark for machine learning model online serving (LLM, embedding,...

17
Experimental
4194 tianzhaotju/LEAM

We propose a novel DL-based mutation technique (LEAM), which adapts the...

17
Experimental
4195 h3nock/ai-deep-dive

An open-source interactive learning platform for understanding LLMs through...

17
Experimental
4196 UNITES-Lab/HEXA-MoE

Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE...

17
Experimental
4197 Yash-Kavaiya/30-Days-LLM-Mastery-Course

30-Days-LLM-Mastery-Course: A comprehensive, hands-on course diving deep...

17
Experimental
4198 elixpo/emoji_transnetv1

A Machine Learning Initiative Taken to fine tune MT5_SMALL to contextually...

17
Experimental
4199 zixi-liu/Transformers-Learning

Stanford CS25 - Transformer United and CS224n learning notes and code dump.

17
Experimental
4200 juancmacias/Small_Lenguage_Model

Píldora formativa sobre SLM (Small Lenguage Model)

17
Experimental
« Prev 1 2 3 40 41 42 43 44 63 64 65 Next »