All Transformer Models

6,429 models ranked by quality score · Page 20 of 65

Showing 1901–2000 of 6,429
# Model Score Tier
1901 duyhominhnguyen/Exgra-Med

[NeurIPS 2025] ExGra-Med: Medical Multi-Modal LLM with Extended Context Alignment

33
Emerging
1902 BIDS-Xu-Lab/Me-LLaMA

A novel medical large language model family with 13/70B parameters, which...

33
Emerging
1903 asahi417/lm-vocab-trimmer

Vocabulary Trimming (VT) is a model compression technique, which reduces a...

33
Emerging
1904 ai-glimpse/toyllm

ToyLLM: Learning LLM from Scratch

33
Emerging
1905 sajjjadayobi/ParsBigBird

Persian Bert For Long-Range Sequences

33
Emerging
1906 zake7749/Kyara

[Kaggle-2nd] Lightweight yet Effective Chinese LLM.

33
Emerging
1907 Nondzu/LlamaTor

LlamaTor: Decentralized AI model sharing via BitTorrent for efficient,...

33
Emerging
1908 akanyaani/miniLLAMA

A simplified LLAMA implementation for training and inference tasks.

33
Emerging
1909 kyegomez/MM1

PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from...

33
Emerging
1910 yuecao0119/MMFuser

The official implementation of the paper "MMFuser: Multimodal Multi-Layer...

33
Emerging
1911 adithya-s-k/CompanionLLM

CompanionLLM - A framework to finetune LLMs to be your own sentient...

33
Emerging
1912 benitomartin/food-images-finetuning

Fine-tuning of LiquidAI LFM2-VL vision-language models on food image...

33
Emerging
1913 rkinas/reasoning_models_how_to

This repository serves as a collection of research notes and resources on...

33
Emerging
1914 horseee/LLaMA-Pruning

Structural Pruning for LLaMA

33
Emerging
1915 poloclub/tsr-convstem

High-Performance Transformers for Table Structure Recognition Need Early Convolutions

33
Emerging
1916 AlexandrosChrtn/llama-fine-tune-guide

Fine-tune the newly released Llama-3.2 lightweight models.

33
Emerging
1917 olaflaitinen/llm-proteomics-hallucination

Systematic evaluation of hallucination risks in Large Language Models...

33
Emerging
1918 cokeshao/HoliTom

[NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models

33
Emerging
1919 dhpollack/huggingface_libtorch

Minimal example of using a traced huggingface transformers model with libtorch

33
Emerging
1920 sytelus/nanuGPT

Simple, reliable and well tested training code for quick experiments with...

33
Emerging
1921 YunzeMan/Lexicon3D

[NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D...

33
Emerging
1922 Beomi/KcBERT-Finetune

KcBERT/KcELECTRA Fine Tune Benchmarks code (forked from...

33
Emerging
1923 pat-jj/KG-FIT

[NeurIPS'24] Knowledge Graph Fine-Tuning using LLMs

33
Emerging
1924 casinca/LLM-quest

Verbose implementations of LLMs architectures, techniques and research...

33
Emerging
1925 UCDvision/NOLA

Code for NOLA, an implementation of "nola: Compressing LoRA using Linear...

33
Emerging
1926 RhinoDevel/mt_llm

Pure C wrapper library to use llama.cpp with Linux and Windows as simple as...

33
Emerging
1927 xuanlinli17/large_vlm_distillation_ood

Distilling Large Vision-Language Model with Out-of-Distribution...

33
Emerging
1928 psmarter/mini-infer

A high-performance LLM inference engine with PagedAttention |...

33
Emerging
1929 bloomberg/minilmv2.bb

Our open source implementation of MiniLMv2...

33
Emerging
1930 Yog-Sotho/LLM-fine-tuner

Powerful no-code LLM fine-tuner: upload data โ†’ train โ†’ deploy in minutes....

33
Emerging
1931 muhtalhakhan/Hacktoberfest2024

Hacktoberfest 2024 ๐Ÿง‘๐Ÿปโ€๐Ÿ’ป OPEN FIRST Pull Request ๐ŸŽ‰

33
Emerging
1932 SuperBianC/scMulan

Repository for paper scMulan: a multitask generative pre-trained language...

33
Emerging
1933 mcbal/deep-implicit-attention

Implementation of deep implicit attention in PyTorch

33
Emerging
1934 BauplanLabs/Making-Databases-Faster-with-LLM-Evolutionary-Sampling

Repository hosting code to reproduce our paper (with Stanford and...

33
Emerging
1935 luciusssss/ZhuangBench

[ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly

33
Emerging
1936 microsoft/MMLU-CF

A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]

33
Emerging
1937 Anjum48/commonlitreadabilityprize

4th Place solution for the Kaggle CommonLit Readability Prize

33
Emerging
1938 Scicrop/llm-vision-basics

Educational notebooks that demystify Large Language Models and Computer...

33
Emerging
1939 lennartpollvogt/ollama-instructor

Python library for the instruction and reliable validation of structured...

33
Emerging
1940 r1cc4rd0m4zz4/traNsLatorLaB

translatorlab: a machine translation tool that uses artificial intelligence...

33
Emerging
1941 HKUDS/RecLM

[ACL2025] "RecLM: Recommendation Instruction Tuning"

33
Emerging
1942 anyscale/llm-router

Tutorial for building LLM router

33
Emerging
1943 monologg/korean-hate-speech-koelectra

Bias, Hate classification with KoELECTRA ๐Ÿ‘ฟ

33
Emerging
1944 darkwebdesign/symfony-addon-pack

Symfony Add-on Pack

33
Emerging
1945 hollobit/GenAI_LLM_timeline

ChatGPT, GenerativeAI and LLMs Timeline

33
Emerging
1946 deep-div/Custom-Transformer-Pytorch

A clean, ground-up implementation of the Transformer architecture in...

33
Emerging
1947 WooooDyy/BAPO

Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for...

33
Emerging
1948 babycommando/neuralgraffiti

Live-bending a foundation modelโ€™s output at neural network level.

33
Emerging
1949 yaojin17/Unlearning_LLM

[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large...

33
Emerging
1950 kreasof-ai/OpenFormer

A hackable library for running and fine-tuning modern transformer models on...

33
Emerging
1951 nightdessert/Retrieval_Head

open-source code for paper: Retrieval Head Mechanistically Explains...

33
Emerging
1952 ZigeW/data_management_LLM

Collection of training data management explorations for large language models

33
Emerging
1953 Orlando-CS/Awesome-VLA

โœจโœจlatest advancements in VLA models(VIsion Language Action)

33
Emerging
1954 UBC-NLP/marbert

UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic

33
Emerging
1955 ArchAIve-Project/Backend

A complex Flask API system empowered by custom ML models, LLMs and...

33
Emerging
1956 josStorer/llama.cpp-unicode-windows

llama.cpp with unicode (windows) support

33
Emerging
1957 Sahaj33-op/StudySage-Offline-Online-AI-Note-Assistant

StudySage ๐Ÿง  โ€“ An offline, AI-powered note assistant that helps students...

33
Emerging
1958 guoriyue/LangCommand

LangCommand is a local inference command-line tool that transforms natural...

33
Emerging
1959 sisinflab/Ducho

Ducho is a Python framework aimed to extract multimodal features used in...

33
Emerging
1960 kyegomez/VortexFusion

Transformers + Mambas + LSTMS All in One Model

33
Emerging
1961 innightwolfsleep/old_llm_telegram_bot

Connect llama-cpp, transformers or text-generation-webui to telegram bot api.

33
Emerging
1962 tanishqgautam/Image-Captioning

Implemented 3 different architectures to tackle the Image Caption problem,...

33
Emerging
1963 SORRY-Bench/sorry-bench

Benchmark evaluation code for "SORRY-Bench: Systematically Evaluating Large...

33
Emerging
1964 allenai/x-lxmert

PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer...

33
Emerging
1965 Wang-ML-Lab/multimodal-needle-in-a-haystack

[NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking...

33
Emerging
1966 levashi/reprobe

Phase-aware LLM activation steering and linear probing. A memory-efficient,...

33
Emerging
1967 adapter-hub/efficient-task-transfer

Research code for "What to Pre-Train on? Efficient Intermediate Task...

33
Emerging
1968 xinyanghuang7/Basic-Visual-Language-Model

Build a simple basic multimodal large model from scratch. ไปŽ้›ถๆญๅปบไธ€ไธช็ฎ€ๅ•็š„ๅŸบ็ก€ๅคšๆจกๆ€ๅคงๆจกๅž‹๐Ÿค–

33
Emerging
1969 Sunona-AI-labs/sunona

Sunona: Next-generation voice AI infrastructure. Orchestrate intelligent,...

33
Emerging
1970 Shannon-Labs/shannon-control-unit

Shannon Control Unit: Adaptive regularization via control theory for LLM training

33
Emerging
1971 YuanGongND/ltu

Code, Dataset, and Pretrained Models for Audio and Speech Large Language...

33
Emerging
1972 sail-sg/dice

Official implementation of Bootstrapping Language Models via DPO Implicit Rewards

33
Emerging
1973 abaheti95/LoL-RL

Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving...

33
Emerging
1974 urmzd/md-classifier

A deep learning system combining transformers and CNNs to classify diseases...

33
Emerging
1975 augustwester/transformer-xl

A lightweight PyTorch implementation of the Transformer-XL architecture...

33
Emerging
1976 alantess/gtrxl-torch

Gated Transformer Model for Computer Vision

33
Emerging
1977 bipinKrishnan/ml-recipe-book

A book containing step by step instructions to train deep learning models...

33
Emerging
1978 kyegomez/SSM-As-VLM-Bridge

An exploration into leveraging SSM's as Bridge/Adapter Layers for VLM

33
Emerging
1979 mkofinas/neural-graphs

Official source code for "Graph Neural Networks for Learning Equivariant...

33
Emerging
1980 YJiangcm/LTE

[ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing

33
Emerging
1981 tanulsingh/Humour.ai-Language-model-that-can-crack-Jokes

Language Model that makes you Laugh .

33
Emerging
1982 nolancacheux/advanced-machine-learning-implementations

Comprehensive machine learning implementations covering neural networks,...

33
Emerging
1983 uiuctml/Localize-and-Stitch

Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic

33
Emerging
1984 AlenVelocity/langchain-llama

Run LLAMA LLMs in Node with Langchain

33
Emerging
1985 zatevakhin/obsidian-local-llm

Obsidian Local LLM is a plugin for Obsidian that provides access to a...

33
Emerging
1986 hanouticelina/deformable-DETR

Implementation of the paper : Deformable DETR: Deformable Transformers for...

33
Emerging
1987 sanjibnarzary/awesome-llm

Curated list of open source and openly accessible large language models

33
Emerging
1988 Jackksonns/CoVALend

CoVALend: a compliance-aware micro-lending default prediction pipeline with...

33
Emerging
1989 kyegomez/AudioMamba

Implementation of the paper: "Audio Mamba: Bidirectional State Space Model...

33
Emerging
1990 avsrma/LLM-based-AI-Assistant

A general purpose AI voice assistant built using GPT-4.

33
Emerging
1991 martin-wey/CodeUltraFeedback

CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)

33
Emerging
1992 ROIM1998/APT

[ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models...

33
Emerging
1993 vardhin/Humanizer

AI text humanization tool with detection capabilities. Transform...

32
Emerging
1994 Justus0405/LLM-Bot

๐Ÿ“Ž A Discord chatbot compatible with OpenAI, Ollama, and Llama.cpp

32
Emerging
1995 peacelwh/VT-FSL

[NeurIPS 2025] VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning

32
Emerging
1996 shoppollama/shoppollama

Open Source Agentic Commerce Platform built on Ollama and Stripe โ€” Run...

32
Emerging
1997 KishanBagaria/OCLB

๐Ÿฆ™ One Click Llama Button for DeviantArt.com

32
Emerging
1998 AdamCoscia/KnowledgeVIS

Visually compare fill-in-the-blank LLM prompts to uncover learned biases and...

32
Emerging
1999 lxe/llavavision

A simple "Be My Eyes" web app with a llama.cpp/llava backend

32
Emerging
2000 Md-Emon-Hasan/InformaTruth

Fine-tuned roberta-base classifier on the LIAR dataset. Aaccepts multiple...

32
Emerging
« Prev 1 2 3 18 19 20 21 22 63 64 65 Next »