All Transformer Models
6,429 models ranked by quality score · Page 20 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 1901 |
duyhominhnguyen/Exgra-Med
[NeurIPS 2025] ExGra-Med: Medical Multi-Modal LLM with Extended Context Alignment |
|
Emerging |
| 1902 |
BIDS-Xu-Lab/Me-LLaMA
A novel medical large language model family with 13/70B parameters, which... |
|
Emerging |
| 1903 |
asahi417/lm-vocab-trimmer
Vocabulary Trimming (VT) is a model compression technique, which reduces a... |
|
Emerging |
| 1904 |
ai-glimpse/toyllm
ToyLLM: Learning LLM from Scratch |
|
Emerging |
| 1905 |
sajjjadayobi/ParsBigBird
Persian Bert For Long-Range Sequences |
|
Emerging |
| 1906 |
zake7749/Kyara
[Kaggle-2nd] Lightweight yet Effective Chinese LLM. |
|
Emerging |
| 1907 |
Nondzu/LlamaTor
LlamaTor: Decentralized AI model sharing via BitTorrent for efficient,... |
|
Emerging |
| 1908 |
akanyaani/miniLLAMA
A simplified LLAMA implementation for training and inference tasks. |
|
Emerging |
| 1909 |
kyegomez/MM1
PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from... |
|
Emerging |
| 1910 |
yuecao0119/MMFuser
The official implementation of the paper "MMFuser: Multimodal Multi-Layer... |
|
Emerging |
| 1911 |
adithya-s-k/CompanionLLM
CompanionLLM - A framework to finetune LLMs to be your own sentient... |
|
Emerging |
| 1912 |
benitomartin/food-images-finetuning
Fine-tuning of LiquidAI LFM2-VL vision-language models on food image... |
|
Emerging |
| 1913 |
rkinas/reasoning_models_how_to
This repository serves as a collection of research notes and resources on... |
|
Emerging |
| 1914 |
horseee/LLaMA-Pruning
Structural Pruning for LLaMA |
|
Emerging |
| 1915 |
poloclub/tsr-convstem
High-Performance Transformers for Table Structure Recognition Need Early Convolutions |
|
Emerging |
| 1916 |
AlexandrosChrtn/llama-fine-tune-guide
Fine-tune the newly released Llama-3.2 lightweight models. |
|
Emerging |
| 1917 |
olaflaitinen/llm-proteomics-hallucination
Systematic evaluation of hallucination risks in Large Language Models... |
|
Emerging |
| 1918 |
cokeshao/HoliTom
[NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models |
|
Emerging |
| 1919 |
dhpollack/huggingface_libtorch
Minimal example of using a traced huggingface transformers model with libtorch |
|
Emerging |
| 1920 |
sytelus/nanuGPT
Simple, reliable and well tested training code for quick experiments with... |
|
Emerging |
| 1921 |
YunzeMan/Lexicon3D
[NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D... |
|
Emerging |
| 1922 |
Beomi/KcBERT-Finetune
KcBERT/KcELECTRA Fine Tune Benchmarks code (forked from... |
|
Emerging |
| 1923 |
pat-jj/KG-FIT
[NeurIPS'24] Knowledge Graph Fine-Tuning using LLMs |
|
Emerging |
| 1924 |
casinca/LLM-quest
Verbose implementations of LLMs architectures, techniques and research... |
|
Emerging |
| 1925 |
UCDvision/NOLA
Code for NOLA, an implementation of "nola: Compressing LoRA using Linear... |
|
Emerging |
| 1926 |
RhinoDevel/mt_llm
Pure C wrapper library to use llama.cpp with Linux and Windows as simple as... |
|
Emerging |
| 1927 |
xuanlinli17/large_vlm_distillation_ood
Distilling Large Vision-Language Model with Out-of-Distribution... |
|
Emerging |
| 1928 |
psmarter/mini-infer
A high-performance LLM inference engine with PagedAttention |... |
|
Emerging |
| 1929 |
bloomberg/minilmv2.bb
Our open source implementation of MiniLMv2... |
|
Emerging |
| 1930 |
Yog-Sotho/LLM-fine-tuner
Powerful no-code LLM fine-tuner: upload data โ train โ deploy in minutes.... |
|
Emerging |
| 1931 |
muhtalhakhan/Hacktoberfest2024
Hacktoberfest 2024 ๐ง๐ปโ๐ป OPEN FIRST Pull Request ๐ |
|
Emerging |
| 1932 |
SuperBianC/scMulan
Repository for paper scMulan: a multitask generative pre-trained language... |
|
Emerging |
| 1933 |
mcbal/deep-implicit-attention
Implementation of deep implicit attention in PyTorch |
|
Emerging |
| 1934 |
BauplanLabs/Making-Databases-Faster-with-LLM-Evolutionary-Sampling
Repository hosting code to reproduce our paper (with Stanford and... |
|
Emerging |
| 1935 |
luciusssss/ZhuangBench
[ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly |
|
Emerging |
| 1936 |
microsoft/MMLU-CF
A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025] |
|
Emerging |
| 1937 |
Anjum48/commonlitreadabilityprize
4th Place solution for the Kaggle CommonLit Readability Prize |
|
Emerging |
| 1938 |
Scicrop/llm-vision-basics
Educational notebooks that demystify Large Language Models and Computer... |
|
Emerging |
| 1939 |
lennartpollvogt/ollama-instructor
Python library for the instruction and reliable validation of structured... |
|
Emerging |
| 1940 |
r1cc4rd0m4zz4/traNsLatorLaB
translatorlab: a machine translation tool that uses artificial intelligence... |
|
Emerging |
| 1941 |
HKUDS/RecLM
[ACL2025] "RecLM: Recommendation Instruction Tuning" |
|
Emerging |
| 1942 |
anyscale/llm-router
Tutorial for building LLM router |
|
Emerging |
| 1943 |
monologg/korean-hate-speech-koelectra
Bias, Hate classification with KoELECTRA ๐ฟ |
|
Emerging |
| 1944 |
darkwebdesign/symfony-addon-pack
Symfony Add-on Pack |
|
Emerging |
| 1945 |
hollobit/GenAI_LLM_timeline
ChatGPT, GenerativeAI and LLMs Timeline |
|
Emerging |
| 1946 |
deep-div/Custom-Transformer-Pytorch
A clean, ground-up implementation of the Transformer architecture in... |
|
Emerging |
| 1947 |
WooooDyy/BAPO
Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for... |
|
Emerging |
| 1948 |
babycommando/neuralgraffiti
Live-bending a foundation modelโs output at neural network level. |
|
Emerging |
| 1949 |
yaojin17/Unlearning_LLM
[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large... |
|
Emerging |
| 1950 |
kreasof-ai/OpenFormer
A hackable library for running and fine-tuning modern transformer models on... |
|
Emerging |
| 1951 |
nightdessert/Retrieval_Head
open-source code for paper: Retrieval Head Mechanistically Explains... |
|
Emerging |
| 1952 |
ZigeW/data_management_LLM
Collection of training data management explorations for large language models |
|
Emerging |
| 1953 |
Orlando-CS/Awesome-VLA
โจโจlatest advancements in VLA models(VIsion Language Action) |
|
Emerging |
| 1954 |
UBC-NLP/marbert
UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic |
|
Emerging |
| 1955 |
ArchAIve-Project/Backend
A complex Flask API system empowered by custom ML models, LLMs and... |
|
Emerging |
| 1956 |
josStorer/llama.cpp-unicode-windows
llama.cpp with unicode (windows) support |
|
Emerging |
| 1957 |
Sahaj33-op/StudySage-Offline-Online-AI-Note-Assistant
StudySage ๐ง โ An offline, AI-powered note assistant that helps students... |
|
Emerging |
| 1958 |
guoriyue/LangCommand
LangCommand is a local inference command-line tool that transforms natural... |
|
Emerging |
| 1959 |
sisinflab/Ducho
Ducho is a Python framework aimed to extract multimodal features used in... |
|
Emerging |
| 1960 |
kyegomez/VortexFusion
Transformers + Mambas + LSTMS All in One Model |
|
Emerging |
| 1961 |
innightwolfsleep/old_llm_telegram_bot
Connect llama-cpp, transformers or text-generation-webui to telegram bot api. |
|
Emerging |
| 1962 |
tanishqgautam/Image-Captioning
Implemented 3 different architectures to tackle the Image Caption problem,... |
|
Emerging |
| 1963 |
SORRY-Bench/sorry-bench
Benchmark evaluation code for "SORRY-Bench: Systematically Evaluating Large... |
|
Emerging |
| 1964 |
allenai/x-lxmert
PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer... |
|
Emerging |
| 1965 |
Wang-ML-Lab/multimodal-needle-in-a-haystack
[NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking... |
|
Emerging |
| 1966 |
levashi/reprobe
Phase-aware LLM activation steering and linear probing. A memory-efficient,... |
|
Emerging |
| 1967 |
adapter-hub/efficient-task-transfer
Research code for "What to Pre-Train on? Efficient Intermediate Task... |
|
Emerging |
| 1968 |
xinyanghuang7/Basic-Visual-Language-Model
Build a simple basic multimodal large model from scratch. ไป้ถๆญๅปบไธไธช็ฎๅ็ๅบ็กๅคๆจกๆๅคงๆจกๅ๐ค |
|
Emerging |
| 1969 |
Sunona-AI-labs/sunona
Sunona: Next-generation voice AI infrastructure. Orchestrate intelligent,... |
|
Emerging |
| 1970 |
Shannon-Labs/shannon-control-unit
Shannon Control Unit: Adaptive regularization via control theory for LLM training |
|
Emerging |
| 1971 |
YuanGongND/ltu
Code, Dataset, and Pretrained Models for Audio and Speech Large Language... |
|
Emerging |
| 1972 |
sail-sg/dice
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards |
|
Emerging |
| 1973 |
abaheti95/LoL-RL
Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving... |
|
Emerging |
| 1974 |
urmzd/md-classifier
A deep learning system combining transformers and CNNs to classify diseases... |
|
Emerging |
| 1975 |
augustwester/transformer-xl
A lightweight PyTorch implementation of the Transformer-XL architecture... |
|
Emerging |
| 1976 |
alantess/gtrxl-torch
Gated Transformer Model for Computer Vision |
|
Emerging |
| 1977 |
bipinKrishnan/ml-recipe-book
A book containing step by step instructions to train deep learning models... |
|
Emerging |
| 1978 |
kyegomez/SSM-As-VLM-Bridge
An exploration into leveraging SSM's as Bridge/Adapter Layers for VLM |
|
Emerging |
| 1979 |
mkofinas/neural-graphs
Official source code for "Graph Neural Networks for Learning Equivariant... |
|
Emerging |
| 1980 |
YJiangcm/LTE
[ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing |
|
Emerging |
| 1981 |
tanulsingh/Humour.ai-Language-model-that-can-crack-Jokes
Language Model that makes you Laugh . |
|
Emerging |
| 1982 |
nolancacheux/advanced-machine-learning-implementations
Comprehensive machine learning implementations covering neural networks,... |
|
Emerging |
| 1983 |
uiuctml/Localize-and-Stitch
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic |
|
Emerging |
| 1984 |
AlenVelocity/langchain-llama
Run LLAMA LLMs in Node with Langchain |
|
Emerging |
| 1985 |
zatevakhin/obsidian-local-llm
Obsidian Local LLM is a plugin for Obsidian that provides access to a... |
|
Emerging |
| 1986 |
hanouticelina/deformable-DETR
Implementation of the paper : Deformable DETR: Deformable Transformers for... |
|
Emerging |
| 1987 |
sanjibnarzary/awesome-llm
Curated list of open source and openly accessible large language models |
|
Emerging |
| 1988 |
Jackksonns/CoVALend
CoVALend: a compliance-aware micro-lending default prediction pipeline with... |
|
Emerging |
| 1989 |
kyegomez/AudioMamba
Implementation of the paper: "Audio Mamba: Bidirectional State Space Model... |
|
Emerging |
| 1990 |
avsrma/LLM-based-AI-Assistant
A general purpose AI voice assistant built using GPT-4. |
|
Emerging |
| 1991 |
martin-wey/CodeUltraFeedback
CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025) |
|
Emerging |
| 1992 |
ROIM1998/APT
[ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models... |
|
Emerging |
| 1993 |
vardhin/Humanizer
AI text humanization tool with detection capabilities. Transform... |
|
Emerging |
| 1994 |
Justus0405/LLM-Bot
๐ A Discord chatbot compatible with OpenAI, Ollama, and Llama.cpp |
|
Emerging |
| 1995 |
peacelwh/VT-FSL
[NeurIPS 2025] VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning |
|
Emerging |
| 1996 |
shoppollama/shoppollama
Open Source Agentic Commerce Platform built on Ollama and Stripe โ Run... |
|
Emerging |
| 1997 |
KishanBagaria/OCLB
๐ฆ One Click Llama Button for DeviantArt.com |
|
Emerging |
| 1998 |
AdamCoscia/KnowledgeVIS
Visually compare fill-in-the-blank LLM prompts to uncover learned biases and... |
|
Emerging |
| 1999 |
lxe/llavavision
A simple "Be My Eyes" web app with a llama.cpp/llava backend |
|
Emerging |
| 2000 |
Md-Emon-Hasan/InformaTruth
Fine-tuned roberta-base classifier on the LIAR dataset. Aaccepts multiple... |
|
Emerging |