All Transformer Models
6,429 models ranked by quality score · Page 38 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 3701 |
OpenNLG/OpenBA-v2
OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing... |
|
Experimental |
| 3702 |
CameLLM/CameLLM
Run your favourite LLMs locally on macOS from Swift |
|
Experimental |
| 3703 |
hitz-zentroa/This-is-not-a-Dataset
We introduce a large semi-automatically generated dataset of ~400,000... |
|
Experimental |
| 3704 |
LookUpMark/dylem-grid
DYLEM-GRID is a deep learning project for dynamic hand gesture recognition... |
|
Experimental |
| 3705 |
yyy01/PAC
The official implementation of the paper "Data Contamination Calibration for... |
|
Experimental |
| 3706 |
thevasudevgupta/transformers-adapters
This repositary hosts my experiments for the project, I did with OffNote Labs. |
|
Experimental |
| 3707 |
TRISTAN-ORF/RiboTIE
Scripts and instructions to apply RiboTIE on Ribo-seq data |
|
Experimental |
| 3708 |
gersongerardcruz/extractive_and_abstractive_text_summarization
A combination of extractive and abstractive text summarization for... |
|
Experimental |
| 3709 |
UCSC-VLAA/Sight-Beyond-Text
[TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal... |
|
Experimental |
| 3710 |
DanHrmti/SenTransformer-VAE-pytorch
Sentence VAE using the Transformer encoder-decoder architecture. |
|
Experimental |
| 3711 |
AntonioVFranco/elamonica
Production-ready test-time compute optimization framework for LLM inference.... |
|
Experimental |
| 3712 |
dunktra/attention-binding-a11y
Code for tracking concept emergence via attention-head binding (EB*). Pythia... |
|
Experimental |
| 3713 |
chizkidd/fastai
Implementation of fast.ai deep learning courses: "Practical Deep Learning... |
|
Experimental |
| 3714 |
plutonium-239/memsave_torch
Lowering PyTorch's Memory Consumption for Selective Differentiation |
|
Experimental |
| 3715 |
CanvaChen/chinese-llama-tokenizer
目标:构建一个更符合语言学的小而美的 llama 分词器,支持中英日三国语言 |
|
Experimental |
| 3716 |
vbuyel/PointerLM-AI-File-Assistant
Your helpful file and web chatbot assistant. Made with DDD architecture |
|
Experimental |
| 3717 |
navamai/navamai
Use NavamAI to supercharge your productivity and workflow with personal,... |
|
Experimental |
| 3718 |
januverma/transformers-for-sequential-recommendation
Notebooks on using transformers for sequential recommendation tasks |
|
Experimental |
| 3719 |
Traffic-Alpha/VLMLight
Official implementation of VLMLight |
|
Experimental |
| 3720 |
StarLight1212/LLM-and-Generative-Models-Community
AI Community Tutorial, including: LoRA/Qlora LLM fine-tuning, Training GPT-2... |
|
Experimental |
| 3721 |
DDDOH/LLM_News
LOLA_ LLM-Assisted Online Learning Algorithm for Content Experiments |
|
Experimental |
| 3722 |
noah-hein/mazeGPT
AI model for making mazes that extends OpenAIs GPT2 model |
|
Experimental |
| 3723 |
llap4585/T5-Refiner-DomainFocus-TrainOnly
This project provides code for fine-tuning T5/mT5 models on data... |
|
Experimental |
| 3724 |
Orfeous/llamacpp.net
C#/.NET binding of llama.cpp |
|
Experimental |
| 3725 |
waybarrios/dgx-spark-finetune-llm
LLM fine-tuning with LoRA + NVFP4/MXFP8 on NVIDIA DGX Spark (Blackwell GB10) |
|
Experimental |
| 3726 |
LSquaredM/mutual_info_scaling_law
(NeurIPS 2025) Official Code for L²M: Mutual Information Scaling Law for... |
|
Experimental |
| 3727 |
illoonego/gemma-finetune-emails
LoRA fine-tuning pipeline for Google’s Gemma-2B language model to classify... |
|
Experimental |
| 3728 |
sandeeppanem/qwen3-resume-extraction
Fine-tune Qwen3-0.6B for resume parsing using LoRA |
|
Experimental |
| 3729 |
mamounyosef/commit-message-llm
Fine-tuning Qwen2.5-Coder-0.5B LLM using QLoRA (4-bit quantization + LoRA)... |
|
Experimental |
| 3730 |
mazurkin/ptn
train own virtual "PTN" LLM model |
|
Experimental |
| 3731 |
jdleo/tinysafe-2
141M param safety model (not much better than v1, but a great learning) |
|
Experimental |
| 3732 |
GGBond2424648901/transformers-29-tasks
🎨 Transformers实战训练项目 -... |
|
Experimental |
| 3733 |
tobifinn/ensemble_transformer
Official PyTorch implementation of "Self-Attentive Ensemble Transformer:... |
|
Experimental |
| 3734 |
gsarti/pecore
Materials for "Quantifying the Plausibility of Context Reliance in Neural... |
|
Experimental |
| 3735 |
JamesVorder/python-tddpp
This LLM generates code based on tests, and makes sure they pass. |
|
Experimental |
| 3736 |
twitter-research/lmsoc
Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining |
|
Experimental |
| 3737 |
leonjovanovic/keywords-extraction
Keyword extraction using Scake, KeyBERT, Fine-tuning Transformer BERT-like... |
|
Experimental |
| 3738 |
AlexIoannides/llm-regression
Exploring the classical regression capabilities of LLMs. |
|
Experimental |
| 3739 |
rachel-pai/T5Elasticsearch
Elasticsearch with T5/Bert/Other models provided by huggingface Transfomers. |
|
Experimental |
| 3740 |
MitulNakrani003/AI-Enhanced-IR-System
AI-enhanced search pipeline using hybrid retrieval + transformer models for... |
|
Experimental |
| 3741 |
MysterionRise/transformers-nlp-suite
Enterprise NLP Platform - Production REST API with auth, rate limiting,... |
|
Experimental |
| 3742 |
ThaminduR/mt5-simplification
Scripts related to training and predicting Google's mt5 model |
|
Experimental |
| 3743 |
isaacus-dev/terge
An easy-to-use Python library for merging PyTorch models. |
|
Experimental |
| 3744 |
fbotathome/butia_speech
This package provides some tools to make the robot DoRIS speak and listen.... |
|
Experimental |
| 3745 |
pszemraj/decoder-pytorch-template
Hackable PyTorch template for decoder-only transformer architecture... |
|
Experimental |
| 3746 |
Sarah111-AHM/ZakeyTeam-arabic-qa-system-arabert
an AI powered Arabic Question Answering system built by fine tuning the... |
|
Experimental |
| 3747 |
oooranz/Baby-CoThought
🍼 Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models... |
|
Experimental |
| 3748 |
TheDarkchip/nfp
Lean 4 library + CLI for rigorous bounds in transformer computations... |
|
Experimental |
| 3749 |
Praful932/llmsearch
Find better generation parameters for your LLM |
|
Experimental |
| 3750 |
abhilashpuli98/Deep-Learning-Paper-Implementations
A collection of paper implementations using the PyTorch framework |
|
Experimental |
| 3751 |
mohsenMahmoodzadeh/image-and-text-classifier
Deep learning models(CNN, LSTM, BERT) for image and text classification task... |
|
Experimental |
| 3752 |
loryanstrant/ha-transformers-theme
A Transformers theme for Home Assistant |
|
Experimental |
| 3753 |
cronenberg64/SciBERT-CTFT
SciBERT-based scientific abstract classification using SetFit framework with... |
|
Experimental |
| 3754 |
givkashi/Awesome-unet-like-transformers
Awesome UNet with Transformer |
|
Experimental |
| 3755 |
JuliusScheuerer/nlp-job-classifier
Text classification with fine-tuned DistilBERT — FastAPI + Streamlit |
|
Experimental |
| 3756 |
jankstar/pydocu
fastapi server for classification of documents and extraction of data |
|
Experimental |
| 3757 |
atomlayer/llama_cute_voice_assistant
Llama cute voice assistant |
|
Experimental |
| 3758 |
eigencore/Tlama_124M
Tlama (124M) is a language model based on LlaMa3 (127M) optimized by... |
|
Experimental |
| 3759 |
ImMohammadHosseini/MKP-RL
:sparkles: Solve multi_dimensional multiple knapsack problem using... |
|
Experimental |
| 3760 |
bagh2178/GC-VLN
[CoRL 2025] GC-VLN: Instruction as Graph Constraints for Training-free... |
|
Experimental |
| 3761 |
HROlive/Advanced-Deep-Learning-with-Transformers
Workshop that will take you from Graph Neural Networks (GNNs) to... |
|
Experimental |
| 3762 |
lrusso/LlamaWebServer
Web server implementation of Llama |
|
Experimental |
| 3763 |
januverma/transformers-stuff
Codes, scripts, and notebooks on various aspects of transformer models. |
|
Experimental |
| 3764 |
sreekarvamsi/vehiclebert
Domain-Specific NLP for Automotive Entities |
|
Experimental |
| 3765 |
devsynck/llama-panel
A web-based control panel for managing and monitoring local llama.cpp server... |
|
Experimental |
| 3766 |
Timothy-Logan/Sentiment-Analyzer
Text sentiment analysis CLI tool using pre-trained transformers. Classify... |
|
Experimental |
| 3767 |
daedalus/distiller
Model distiller automator |
|
Experimental |
| 3768 |
newfull5/NLLB-200-Distilled-350M-en-ko
nllb-200 distilled 350M for English to Korean translation |
|
Experimental |
| 3769 |
Hyun-Ryu/clover
Official code for "Divide and Translate: Compositional First-Order Logic... |
|
Experimental |
| 3770 |
othmanelhoufi/LM-for-FactChecking
An automated solution for fact-checking using available claims and fake-news... |
|
Experimental |
| 3771 |
James-Crockett/Support-Ticket-Auto-Triage
An automated ticket classification system using NLP. Compares traditional... |
|
Experimental |
| 3772 |
alipay/fin_domain_llm
Implementation of the paper: WeaverBird: Empowering Financial... |
|
Experimental |
| 3773 |
Arlchoose-code/Indonesian-LLM-Starter
A starter kit for building your own Indonesian Large Language Model (LLM)... |
|
Experimental |
| 3774 |
svn05/vietnamese-nmt
Vietnamese-English-Japanese NMT with fine-tuned NLLB-200, beam search, and... |
|
Experimental |
| 3775 |
MusfiqDehan/Llama2-Finetuned-for-Translation
Fine-Tuned Llama-2 For Machine Translation |
|
Experimental |
| 3776 |
rivas-lab/Smiles2Dock
Smiles2Dock: an open large-scale multi-task dataset for ML-based molecular... |
|
Experimental |
| 3777 |
isaaccorley/segmenter-pytorch
PyTorch implementation of "Segmenter: Transformer for Semantic Segmentation"... |
|
Experimental |
| 3778 |
cui-shaobo/causal-strength
evaluating the causal strength between cause and effect |
|
Experimental |
| 3779 |
ZifanL/TSDS
Implementation of TSDS: Data Selection for Task-Specific Model Finetuning.... |
|
Experimental |
| 3780 |
Adam-maz/GenAI-assisted-tool-for-Virtual-Screening
This repository introduces a proof-of-concept toolkit for generative virtual... |
|
Experimental |
| 3781 |
bandirevanth/Verbix
AI Grammar Scoring Engine with Evaluation & Feedback |
|
Experimental |
| 3782 |
kozodoi/Text_Readability_Prediction
Predicting text reading complexity with transformers (top-9% Kaggle solution... |
|
Experimental |
| 3783 |
nishantb06/smolLM
Reverse Engineering SmolLM2 model and training it from scratch |
|
Experimental |
| 3784 |
johnbrodowski/DuckAiChatAPI
A .NET console application for chatting with DuckDuckGo's AI Chat service... |
|
Experimental |
| 3785 |
csm9493/efficient-llm-unlearning
Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMs (ICLR 2025) |
|
Experimental |
| 3786 |
CeMOS-IS/GenFormer
[ICPR 2024] Official repository of the paper "GenFormer - Generated Images... |
|
Experimental |
| 3787 |
snexus/nlp-question-answering-system
Question answering system with transformers |
|
Experimental |
| 3788 |
khairulislam/Timeseries-Explained
Interpreting Deep Learning timeseries models using Local Interpretation methods |
|
Experimental |
| 3789 |
aswinvinodd/emotion-detection-system
AI-based Emotion Detection and Sentiment Analysis System using NLP and Streamlit |
|
Experimental |
| 3790 |
Manohara-Ai/Reinforcement_Learning_Framework_to_Prevent_Jailbreaks
A reinforcement learning-based system designed to detect and prevent... |
|
Experimental |
| 3791 |
Ankit6174/Domain-Specific-Mini-LLM-for-Genomic-Mutation-Prediction
An encoder only transformer model that accurately predicts the type, genomic... |
|
Experimental |
| 3792 |
skyline-GTRr32/OKI-TRACE
OKI TRACE: Local LLM observability. See step-by-step, layer-by-layer what... |
|
Experimental |
| 3793 |
AbineshSivakumar/Llama-2-7B-QLoRA-Vicuna
This repository contains code to fine-tune a Llama-7B-Uncensored model using... |
|
Experimental |
| 3794 |
YASSER-27/LLMs
A high-performance, cross-platform desktop application for chatting with... |
|
Experimental |
| 3795 |
kyegomez/MLXTransformer
Simple Implementation of a Transformer in the new framework MLX by Apple |
|
Experimental |
| 3796 |
himanshu231204/hk-devbrain
HK-DevBrain is a lightweight AI developer assistant built on Llama 3.2 (3B)... |
|
Experimental |
| 3797 |
IsmaelMousa/TTL
Full-stack simulator for a todo task list application using FastAPI, I built... |
|
Experimental |
| 3798 |
mytechnotalent/mechanistic_interpretability
Mechanistic Interpretability (MI) is a subfield of AI alignment and safety... |
|
Experimental |
| 3799 |
li-plus/nanoRLHF
Train a tiny LLaMA model from scratch to repeat your words using... |
|
Experimental |
| 3800 |
theanasuddin/Advanced-Deep-Learning
Computer exercises for Advanced Deep Learning. Includes implementations of... |
|
Experimental |