All Transformer Models

6,429 models ranked by quality score · Page 20 of 65

Showing 1901–2000 of 6,429

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
1901	duyhominhnguyen/Exgra-Med [NeurIPS 2025] ExGra-Med: Medical Multi-Modal LLM with Extended Context Alignment	33	Emerging	clinical-llm-tools	41	Python
1902	BIDS-Xu-Lab/Me-LLaMA A novel medical large language model family with 13/70B parameters, which...	33	Emerging	multilingual-llm-adaptation	167	Python
1903	asahi417/lm-vocab-trimmer Vocabulary Trimming (VT) is a model compression technique, which reduces a...	33	Emerging	llm-compression-optimization	63	Python
1904	ai-glimpse/toyllm ToyLLM: Learning LLM from Scratch	33	Emerging	llm-implementation-tutorials	25	Python
1905	sajjjadayobi/ParsBigBird Persian Bert For Long-Range Sequences	33	Emerging	korean-language-models	63	Jupyter Notebook
1906	zake7749/Kyara [Kaggle-2nd] Lightweight yet Effective Chinese LLM.	33	Emerging	llm-frameworks-libraries	53	Jupyter Notebook
1907	Nondzu/LlamaTor LlamaTor: Decentralized AI model sharing via BitTorrent for efficient,...	33	Emerging	llama-model-implementations	58	Python
1908	akanyaani/miniLLAMA A simplified LLAMA implementation for training and inference tasks.	33	Emerging	llama-model-implementations	36	Python
1909	kyegomez/MM1 PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from...	33	Emerging	vision-language-models	26	Python
1910	yuecao0119/MMFuser The official implementation of the paper "MMFuser: Multimodal Multi-Layer...	33	Emerging	multimodal-vision-language	64	Python
1911	adithya-s-k/CompanionLLM CompanionLLM - A framework to finetune LLMs to be your own sentient...	33	Emerging	lora-qlora-fine-tuning	50	Jupyter Notebook
1912	benitomartin/food-images-finetuning Fine-tuning of LiquidAI LFM2-VL vision-language models on food image...	33	Emerging	lora-qlora-fine-tuning	7	Jupyter Notebook
1913	rkinas/reasoning_models_how_to This repository serves as a collection of research notes and resources on...	33	Emerging	llm-reasoning-research	132	Python
1914	horseee/LLaMA-Pruning Structural Pruning for LLaMA	33	Emerging	llm-pruning-compression	54	Python
1915	poloclub/tsr-convstem High-Performance Transformers for Table Structure Recognition Need Early Convolutions	33	Emerging	academic-thesis-repositories	45	Python
1916	AlexandrosChrtn/llama-fine-tune-guide Fine-tune the newly released Llama-3.2 lightweight models.	33	Emerging	llm-fine-tuning	22	Python
1917	olaflaitinen/llm-proteomics-hallucination Systematic evaluation of hallucination risks in Large Language Models...	33	Emerging	llm-finetuning-frameworks	9	Python
1918	cokeshao/HoliTom [NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models	33	Emerging	multimodal-vision-language	72	Python
1919	dhpollack/huggingface_libtorch Minimal example of using a traced huggingface transformers model with libtorch	33	Emerging	machine-translation-transformers	35	C++
1920	sytelus/nanuGPT Simple, reliable and well tested training code for quick experiments with...	33	Emerging	gpt2-pretraining-fine-tuning	13	Python
1921	YunzeMan/Lexicon3D [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D...	33	Emerging	multimodal-vision-language	100	Python
1922	Beomi/KcBERT-Finetune KcBERT/KcELECTRA Fine Tune Benchmarks code (forked from...	33	Emerging	bert-model-implementations	47	Python
1923	pat-jj/KG-FIT [NeurIPS'24] Knowledge Graph Fine-Tuning using LLMs	33	Emerging	llm-knowledge-graph-generation	130	Python
1924	casinca/LLM-quest Verbose implementations of LLMs architectures, techniques and research...	33	Emerging	llm-finetuning-frameworks	12	Python
1925	UCDvision/NOLA Code for NOLA, an implementation of "nola: Compressing LoRA using Linear...	33	Emerging	lora-qlora-fine-tuning	57	Python
1926	RhinoDevel/mt_llm Pure C wrapper library to use llama.cpp with Linux and Windows as simple as...	33	Emerging	llm-docker-deployments	14	C++
1927	xuanlinli17/large_vlm_distillation_ood Distilling Large Vision-Language Model with Out-of-Distribution...	33	Emerging	domain-adaptation-frameworks	61	Python
1928	psmarter/mini-infer A high-performance LLM inference engine with PagedAttention \|...	33	Emerging	llm-inference-engines	61	Python
1929	bloomberg/minilmv2.bb Our open source implementation of MiniLMv2...	33	Emerging	llm-implementation-from-scratch	61	Python
1930	Yog-Sotho/LLM-fine-tuner Powerful no-code LLM fine-tuner: upload data → train → deploy in minutes....	33	Emerging	llm-fine-tuning	13	Python
1931	muhtalhakhan/Hacktoberfest2024 Hacktoberfest 2024 🧑🏻‍💻 OPEN FIRST Pull Request 🎉	33	Emerging	ai-powered-saas-startups	8	HTML
1932	SuperBianC/scMulan Repository for paper scMulan: a multitask generative pre-trained language...	33	Emerging	llm-learning-resources	62	Jupyter Notebook
1933	mcbal/deep-implicit-attention Implementation of deep implicit attention in PyTorch	33	Emerging	transformer-architecture-tutorials	65	Python
1934	BauplanLabs/Making-Databases-Faster-with-LLM-Evolutionary-Sampling Repository hosting code to reproduce our paper (with Stanford and...	33	Emerging	llm-compression-optimization	18	Python
1935	luciusssss/ZhuangBench [ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly	33	Emerging	llm-scaling-architecture	25	Python
1936	microsoft/MMLU-CF A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]	33	Emerging	llm-interpretability-explainability	123	—
1937	Anjum48/commonlitreadabilityprize 4th Place solution for the Kaggle CommonLit Readability Prize	33	Emerging	essay-scoring-grading	38	Jupyter Notebook
1938	Scicrop/llm-vision-basics Educational notebooks that demystify Large Language Models and Computer...	33	Emerging	defect-detection-quality-forensics	18	Jupyter Notebook
1939	lennartpollvogt/ollama-instructor Python library for the instruction and reliable validation of structured...	33	Emerging	llm-docker-deployments	77	Python
1940	r1cc4rd0m4zz4/traNsLatorLaB translatorlab: a machine translation tool that uses artificial intelligence...	33	Emerging	neural-machine-translation	2	Python
1941	HKUDS/RecLM [ACL2025] "RecLM: Recommendation Instruction Tuning"	33	Emerging	llm-recommendation-systems	109	Python
1942	anyscale/llm-router Tutorial for building LLM router	33	Emerging	llm-request-routing	246	Python
1943	monologg/korean-hate-speech-koelectra Bias, Hate classification with KoELECTRA 👿	33	Emerging	korean-language-models	27	Python
1944	darkwebdesign/symfony-addon-pack Symfony Add-on Pack	33	Emerging	php-ai-sdks	6	PHP
1945	hollobit/GenAI_LLM_timeline ChatGPT, GenerativeAI and LLMs Timeline	33	Emerging	prompt-engineering-security	956	—
1946	deep-div/Custom-Transformer-Pytorch A clean, ground-up implementation of the Transformer architecture in...	33	Emerging	transformer-architecture-tutorials	16	Jupyter Notebook
1947	WooooDyy/BAPO Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for...	33	Emerging	rlhf-alignment-training	91	Python
1948	babycommando/neuralgraffiti Live-bending a foundation model’s output at neural network level.	33	Emerging	lora-qlora-fine-tuning	273	Jupyter Notebook
1949	yaojin17/Unlearning_LLM [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large...	33	Emerging	rlhf-alignment-training	66	Python
1950	kreasof-ai/OpenFormer A hackable library for running and fine-tuning modern transformer models on...	33	Emerging	transformer-architecture-tutorials	28	Python
1951	nightdessert/Retrieval_Head open-source code for paper: Retrieval Head Mechanistically Explains...	33	Emerging	llm-bias-evaluation	236	Python
1952	ZigeW/data_management_LLM Collection of training data management explorations for large language models	33	Emerging	llm-scaling-architecture	337	—
1953	Orlando-CS/Awesome-VLA ✨✨latest advancements in VLA models(VIsion Language Action)	33	Emerging	multimodal-vision-language-models	109	—
1954	UBC-NLP/marbert UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic	33	Emerging	bert-model-frameworks	114	—
1955	ArchAIve-Project/Backend A complex Flask API system empowered by custom ML models, LLMs and...	33	Emerging	ml-api-deployment	2	Python
1956	josStorer/llama.cpp-unicode-windows llama.cpp with unicode (windows) support	33	Emerging	interactive-ai-chat-uis	54	C
1957	Sahaj33-op/StudySage-Offline-Online-AI-Note-Assistant StudySage 🧠 – An offline, AI-powered note assistant that helps students...	33	Emerging	study-aid-generators	6	Python
1958	guoriyue/LangCommand LangCommand is a local inference command-line tool that transforms natural...	33	Emerging	llm-terminal-automation	118	C++
1959	sisinflab/Ducho Ducho is a Python framework aimed to extract multimodal features used in...	33	Emerging	multimodal-fusion-transformers	26	Python
1960	kyegomez/VortexFusion Transformers + Mambas + LSTMS All in One Model	33	Emerging	multimodal-fusion-transformers	14	Python
1961	innightwolfsleep/old_llm_telegram_bot Connect llama-cpp, transformers or text-generation-webui to telegram bot api.	33	Emerging	messaging-platform-chatbots	28	Python
1962	tanishqgautam/Image-Captioning Implemented 3 different architectures to tackle the Image Caption problem,...	33	Emerging	image-captioning-transformers	40	Jupyter Notebook
1963	SORRY-Bench/sorry-bench Benchmark evaluation code for "SORRY-Bench: Systematically Evaluating Large...	33	Emerging	domain-specific-benchmarks	77	Jupyter Notebook
1964	allenai/x-lxmert PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer...	33	Emerging	text-to-image-generation	50	Python
1965	Wang-ML-Lab/multimodal-needle-in-a-haystack [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking...	33	Emerging	multimodal-vision-language	54	Python
1966	levashi/reprobe Phase-aware LLM activation steering and linear probing. A memory-efficient,...	33	Emerging	mathematical-reasoning-transformers	2	Python
1967	adapter-hub/efficient-task-transfer Research code for "What to Pre-Train on? Efficient Intermediate Task...	33	Emerging	parameter-efficient-adapters	37	Python
1968	xinyanghuang7/Basic-Visual-Language-Model Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖	33	Emerging	multimodal-vision-language	47	Python
1969	Sunona-AI-labs/sunona Sunona: Next-generation voice AI infrastructure. Orchestrate intelligent,...	33	Emerging	conversational-chatbot-applications	15	Python
1970	Shannon-Labs/shannon-control-unit Shannon Control Unit: Adaptive regularization via control theory for LLM training	33	Emerging	lora-qlora-fine-tuning	6	Python
1971	YuanGongND/ltu Code, Dataset, and Pretrained Models for Audio and Speech Large Language...	33	Emerging	llm-scaling-architecture	472	Python
1972	sail-sg/dice Official implementation of Bootstrapping Language Models via DPO Implicit Rewards	33	Emerging	direct-preference-optimization	47	Python
1973	abaheti95/LoL-RL Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving...	33	Emerging	variational-autoencoders-nlp	26	Python
1974	urmzd/md-classifier A deep learning system combining transformers and CNNs to classify diseases...	33	Emerging	medical-image-diagnosis-transformers	2	Jupyter Notebook
1975	augustwester/transformer-xl A lightweight PyTorch implementation of the Transformer-XL architecture...	33	Emerging	machine-translation-transformers	37	Python
1976	alantess/gtrxl-torch Gated Transformer Model for Computer Vision	33	Emerging	vision-language-models	25	Python
1977	bipinKrishnan/ml-recipe-book A book containing step by step instructions to train deep learning models...	33	Emerging	ml-foundations-curricula	37	HTML
1978	kyegomez/SSM-As-VLM-Bridge An exploration into leveraging SSM's as Bridge/Adapter Layers for VLM	33	Emerging	vision-language-models	2	Python
1979	mkofinas/neural-graphs Official source code for "Graph Neural Networks for Learning Equivariant...	33	Emerging	graph-transformers	82	Python
1980	YJiangcm/LTE [ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing	33	Emerging	rlhf-alignment-training	37	Python
1981	tanulsingh/Humour.ai-Language-model-that-can-crack-Jokes Language Model that makes you Laugh .	33	Emerging	gpt2-pretraining-fine-tuning	41	Python
1982	nolancacheux/advanced-machine-learning-implementations Comprehensive machine learning implementations covering neural networks,...	33	Emerging	ml-foundations-curricula	2	Jupyter Notebook
1983	uiuctml/Localize-and-Stitch Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic	33	Emerging	diffusion-language-models	32	Python
1984	AlenVelocity/langchain-llama Run LLAMA LLMs in Node with Langchain	33	Emerging	local-llm-deployment	39	TypeScript
1985	zatevakhin/obsidian-local-llm Obsidian Local LLM is a plugin for Obsidian that provides access to a...	33	Emerging	local-llm-deployment	135	TypeScript
1986	hanouticelina/deformable-DETR Implementation of the paper : Deformable DETR: Deformable Transformers for...	33	Emerging	object-detection-transformers	27	Python
1987	sanjibnarzary/awesome-llm Curated list of open source and openly accessible large language models	33	Emerging	multilingual-llm-adaptation	25	—
1988	Jackksonns/CoVALend CoVALend: a compliance-aware micro-lending default prediction pipeline with...	33	Emerging	llm-training-experimentation	2	Python
1989	kyegomez/AudioMamba Implementation of the paper: "Audio Mamba: Bidirectional State Space Model...	33	Emerging	3d-vision-transformers	14	Shell
1990	avsrma/LLM-based-AI-Assistant A general purpose AI voice assistant built using GPT-4.	33	Emerging	conversational-chatbot-applications	33	Python
1991	martin-wey/CodeUltraFeedback CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)	33	Emerging	math-reasoning-datasets	73	Python
1992	ROIM1998/APT [ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models...	33	Emerging	llm-knowledge-distillation	47	Python
1993	vardhin/Humanizer AI text humanization tool with detection capabilities. Transform...	32	Emerging	ai-content-detection	6	Python
1994	Justus0405/LLM-Bot 📎 A Discord chatbot compatible with OpenAI, Ollama, and Llama.cpp	32	Emerging	messaging-platform-chatbots	1	JavaScript
1995	peacelwh/VT-FSL [NeurIPS 2025] VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning	32	Emerging	multimodal-vision-language	31	Python
1996	shoppollama/shoppollama Open Source Agentic Commerce Platform built on Ollama and Stripe — Run...	32	Emerging	interactive-ai-chat-uis	1	Elixir
1997	KishanBagaria/OCLB 🦙 One Click Llama Button for DeviantArt.com	32	Emerging	interactive-ai-chat-uis	17	JavaScript
1998	AdamCoscia/KnowledgeVIS Visually compare fill-in-the-blank LLM prompts to uncover learned biases and...	32	Emerging	llm-benchmark-leaderboards	7	JavaScript
1999	lxe/llavavision A simple "Be My Eyes" web app with a llama.cpp/llava backend	32	Emerging	interactive-ai-chat-uis	493	JavaScript
2000	Md-Emon-Hasan/InformaTruth Fine-tuned roberta-base classifier on the LIAR dataset. Aaccepts multiple...	32	Emerging	fake-news-detection	1	Jupyter Notebook

« Prev 1 2 3 … 18 19 20 21 22 … 63 64 65 Next »