All Transformer Models

6,427 models ranked by quality score · Page 2 of 65

Showing 101–200 of 6,427

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
101	NX-AI/xlstm Official repository of the xLSTM.	63	Established	llm-frameworks-libraries	2,124	Python
102	inseq-team/inseq Interpretability for sequence generation models 🐛 🔍	63	Established	transformer-interpretability-mechanistic	462	Python
103	csinva/imodelsX Interpret text data with LLMs (sklearn compatible).	63	Established	explainability-interpretability-frameworks	175	Python
104	EricLBuehler/mistral.rs Fast, flexible LLM inference	62	Established	rust-llm-infrastructure	6,681	Rust
105	sauravpanda/BrowserAI Run local LLMs like llama, deepseek-distill, kokoro and more inside your browser	62	Established	llm-terminal-automation	1,381	TypeScript
106	NVlabs/MambaVision [CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid...	62	Established	3d-vision-transformers	2,060	Python
107	lucidrains/dreamer4 Implementation of Danijar's latest iteration for his Dreamer line of work	62	Established	transformer-architecture-tutorials	165	Python
108	NVIDIA/sphinx-llm LLM extensions for Sphinx Documentation	62	Established	llm-function-calling	16	Python
109	RBLN-SW/optimum-rbln ⚡ A seamless integration of HuggingFace Transformers & Diffusers with RBLN...	61	Established	transformer-training-optimization	15	Python
110	sintel-dev/sigllm Using Large Language Models for Time Series Anomaly Detection	61	Established	prompt-engineering-techniques	85	Jupyter Notebook
111	cyberchitta/llm-context.py Share code with LLMs via Model Context Protocol or clipboard. Rule-based...	61	Established	llm-orchestration-routing	295	Python
112	huggingface/alignment-handbook Robust recipes to align language models with human and AI preferences	61	Established	rlhf-alignment-training	5,523	Python
113	hassancs91/SimplerLLM Simplify interactions with Large Language Models	61	Established	llm-framework-abstractions	192	Python
114	deeppavlov/AutoIntent Automated machine learning for text classification	60	Established	named-entity-recognition	53	Python
115	jncraton/languagemodels Explore large language models in 512MB of RAM	60	Established	llm-scaling-architecture	1,197	HTML
116	Michael-A-Kuykendall/shimmy ⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF +...	60	Established	local-llm-deployment	3,793	Rust
117	UbiquitousLearning/mllm Fast Multimodal LLM on Mobile Devices	60	Established	local-llm-deployment	1,429	C++
118	om-ai-lab/VLM-R1 Solve Visual Understanding with Reinforced VLMs	60	Established	multimodal-vision-language	5,864	Python
119	skyzh/tiny-llm A course of learning LLM inference serving on Apple Silicon for systems...	60	Established	llm-inference-serving	3,935	Python
120	kaito-project/aikit 🏗️ Fine-tune, build, and deploy open-source LLMs easily!	60	Established	local-llm-deployment	512	Go
121	zjunlp/EasyEdit [ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.	60	Established	rlhf-alignment-training	2,744	Jupyter Notebook
122	stas00/ml-engineering Machine Learning Engineering Open Book	60	Established	ml-foundations-curricula	17,380	Python
123	mybigday/llama.rn React Native binding of llama.cpp	60	Established	local-llm-deployment	851	C++
124	ModelTC/LightCompress [EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models...	60	Established	llm-compression-optimization	688	Python
125	poloclub/transformer-explainer Transformer Explained Visually: Learn How LLM Transformer Models Work with...	59	Established	gpt-model-fine-tuning	6,916	JavaScript
126	FastFlowLM/FastFlowLM Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but...	59	Established	llm-inference-engines	942	C++
127	arcee-ai/mergekit Tools for merging pretrained large language models.	59	Established	llm-training-experimentation	6,857	Python
128	structuredllm/syncode Efficient and general syntactical decoding for Large Language Models	59	Established	speculative-decoding-algorithms	328	Python
129	zhihu/ZhiLight A highly optimized LLM inference acceleration engine for Llama and its variants.	59	Established	llm-inference-engines	905	C++
130	OptimalScale/LMFlow An Extensible Toolkit for Finetuning and Inference of Large Foundation...	59	Established	llm-fine-tuning	8,489	Python
131	changyeyu/LLM-RL-Visualized 🌟100+ 原创 LLM / RL 原理图📚，《大模型算法》作者巨献！💥（100+ LLM/RL Algorithm Maps ）	58	Established	llm-training-experimentation	3,766	Python
132	peremartra/optipfair Structured pruning and bias visualization for Large Language Models. Tools...	58	Established	llm-pruning-compression	29	Python
133	eole-nlp/eole Open language modeling toolkit based on PyTorch	58	Established	transformer-training-optimization	176	Python
134	lucidrains/simple-hierarchical-transformer Experiments around a simple idea for inducing multiple hierarchical...	58	Established	transformer-architecture-tutorials	225	Python
135	roboflow/maestro streamline the fine-tuning process for multimodal models: PaliGemma 2,...	58	Established	lora-qlora-fine-tuning	2,661	Python
136	argosopentech/argos-translate Open-source offline translation library written in Python	58	Established	neural-machine-translation	5,745	Python
137	EleutherAI/gpt-neox An implementation of model parallel autoregressive transformers on GPUs,...	58	Established	gpt2-pretraining-fine-tuning	7,399	Python
138	BlinkDL/RWKV-LM RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can...	57	Established	llm-quantization-methods	14,414	Python
139	azukds/tubular Python package implementing ML feature engineering and pre-processing for...	57	Established	huggingface-learning-resources	93	Python
140	HandsOnLLM/Hands-On-Large-Language-Models Official code repo for the O'Reilly Book - "Hands-On Large Language Models"	57	Established	llm-learning-resources	23,351	Jupyter Notebook
141	kyegomez/BitNet Implementation of "BitNet: Scaling 1-bit Transformers for Large Language...	57	Established	loss-function-implementations	1,898	Python
142	huggingface/optimum-habana Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)	57	Established	transformer-training-optimization	207	Python
143	clusterzx/paperless-ai An automated document analyzer for Paperless-ngx using OpenAI API, Ollama,...	57	Established	ocr-document-extraction	5,410	JavaScript
144	mlabonne/llm-course Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.	57	Established	llm-learning-resources	76,573	—
145	microsoft/unilm Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities	57	Established	llm-scaling-architecture	22,042	Python
146	xaviviro/python-toon 🐍 TOON for Python (Token-Oriented Object Notation) Encoder/Decoder - Reduce...	57	Established	llm-serialization-formats	331	Python
147	NexaAI/nexa-sdk Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and...	57	Established	llm-inference-engines	7,797	Kotlin
148	thu-ml/SageAttention [ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves...	57	Established	sparse-attention-optimization	3,213	Cuda
149	sign-language-translator/sign-language-translator Python library & framework to build custom translators for the...	57	Established	3d-vision-transformers	329	Python
150	nyu-mll/jiant jiant is an nlp toolkit	56	Established	bert-model-implementations	1,674	Python
151	scaleapi/llm-engine Scale LLM Engine public repository	56	Established	llm-knowledge-distillation	821	Python
152	kyegomez/LFM2 A simple and minimal open source implementation of "Introducing LFM2: The...	56	Established	llm-training-experimentation	23	Python
153	NVIDIA-NeMo/Automodel Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging...	56	Established	llm-inference-engines	366	Python
154	OpenNMT/CTranslate2 Fast inference engine for Transformer models	56	Established	ml-inference-benchmarking	4,354	C++
155	BradyFU/Awesome-Multimodal-Large-Language-Models :sparkles::sparkles:Latest Advances on Multimodal Large Language Models	56	Established	multimodal-vision-language-models	17,448	—
156	niedev/RTranslator Open source real-time translation app for Android that runs locally	56	Established	neural-machine-translation	9,686	C++
157	microsoft/mup maximal update parametrization (µP)	56	Established	transformer-training-optimization	1,689	Jupyter Notebook
158	mistralai/mistral-inference Official inference library for Mistral models	56	Established	mistral-ai-tools	10,705	Jupyter Notebook
159	labmlai/annotated_deep_learning_paper_implementations 🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side...	56	Established	ml-foundations-curricula	65,913	Python
160	rickiepark/nlp-with-transformers <트랜스포머를 활용한 자연어 처리> 예제 코드를 위한 저장소입니다.	56	Established	huggingface-learning-resources	144	Jupyter Notebook
161	Picovoice/picollm On-device LLM Inference Powered by X-Bit Quantization	55	Established	llm-quantization-methods	305	Python
162	ManuelSLemos/RabbitLLM Run 70B+ LLMs on a single 4GB GPU — no quantization required.	55	Established	llm-cuda-optimization	38	Python
163	explosion/spacy-llm 🦙 Integrating LLMs into structured NLP pipelines	55	Established	llm-fine-tuning-frameworks	1,367	Python
164	fashn-AI/fashn-human-parser Human parsing model for fashion and virtual try-on applications	55	Established	3d-vision-transformers	24	Python
165	b4rtaz/distributed-llama Distributed LLM inference. Connect home devices into a powerful cluster to...	55	Established	apple-silicon-llm-inference	2,856	C++
166	Freed-Wu/translate-shell Translate text by google, bing, youdaozhiyun, haici, stardict, openai, large...	55	Established	neural-machine-translation	48	Python
167	CrazyBoyM/llama3-Chinese-chat Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。	55	Established	multilingual-llm-adaptation	4,154	Python
168	BeastByteAI/scikit-llm Seamlessly integrate LLMs into scikit-learn.	55	Established	llm-training-experimentation	3,490	Python
169	NVIDIA/kvpress LLM KV cache compression made easy	55	Established	llm-quantization-methods	954	Python
170	jakobdylanc/llmcord Make Discord your LLM frontend - Supports any OpenAI compatible API (Ollama,...	54	Established	messaging-platform-chatbots	767	Python
171	GradientHQ/parallax Parallax is a distributed model serving framework that lets you build your...	54	Established	multilingual-llm-adaptation	1,152	Python
172	TinyLLaVA/TinyLLaVA_Factory A Framework of Small-scale Large Multimodal Models	54	Established	vision-language-instruction-tuning	962	Python
173	mdsrqbl/omnihuman AI model that understands text & humanoids.	54	Established	ml-foundations-curricula	134	Python
174	label-sleuth/label-sleuth Open source no-code system for text annotation and building of text classifiers	54	Established	blip-image-captioning	271	Python
175	nrl-ai/llama-assistant AI-powered assistant to help you with your daily tasks, powered by Llama 3,...	54	Established	conversational-chatbot-applications	530	Python
176	cheahjs/free-llm-api-resources A list of free LLM inference resources accessible via API.	54	Established	local-llm-deployment	15,475	Python
177	Tiiny-AI/PowerInfer High-speed Large Language Model Serving for Local Deployment	54	Established	llm-inference-engines	8,808	C++
178	analyticalrohit/AI-ML-Cheatsheets All Stanford Cheatsheets: Artificial Intelligence, Transformers, LLMs, Deep...	54	Established	ml-foundations-curricula	817	—
179	OpenMachine-ai/transformer-tricks A collection of tricks and tools to speed up transformer models	54	Established	gpt-model-fine-tuning	197	TeX
180	quic/efficient-transformers This library empowers users to seamlessly port pretrained models and...	54	Established	llm-cuda-optimization	87	Python
181	peremartra/Large-Language-Model-Notebooks-Course Practical course about Large Language Models.	54	Established	nlp-fundamentals-tutorials	1,777	Jupyter Notebook
182	ericmjl/llamabot Pythonic class-based interface to LLMs	54	Established	chatglm-fine-tuning	179	Python
183	albertan017/LLM4Decompile Reverse Engineering: Decompiling Binary Code with Large Language Models	54	Established	llm-scaling-architecture	6,407	Python
184	Shivanandroy/simpleT5 simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets...	54	Established	t5-mt5-fine-tuning	400	Python
185	huggingface/audio-transformers-course The Hugging Face Course on Transformers for Audio	54	Established	huggingface-learning-resources	486	MDX
186	MattyB95/Jabberjay 🦜 Synthetic Voice Detection	53	Established	wav2vec2-speech-recognition	5	Python
187	sgl-project/ome Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU...	53	Established	local-llm-deployment	393	Go
188	muxi-ai/onellm Unified interface for interacting with various LLMs hundreds of models,...	53	Established	llm-orchestration-platforms	44	Python
189	ServerlessLLM/ServerlessLLM Serverless LLM Serving for Everyone.	53	Established	llm-inference-serving	663	Python
190	floneum/floneum Instant, controllable, local pre-trained AI models in Rust	53	Established	local-llm-deployment	2,153	Rust
191	underneathall/pinferencia Python + Inference - Model Deployment library in Python. Simplest model...	53	Established	llm-inference-engines	545	Python
192	davidpirogov/toon-llm Token-Oriented Object Notation (TOON) is an LLM-optimized data serialization...	53	Established	llm-learning-resources	9	Python
193	lucidrains/locoformer LocoFormer - Generalist Locomotion via Long-Context Adaptation	53	Established	transformer-architecture-tutorials	102	Python
194	avilum/minrlm Token-efficient Recursive Language Model. 3.6x fewer tokens than vanilla...	53	Established	llm-framework-abstractions	31	Python
195	PKU-Alignment/align-anything Align Anything: Training All-modality Model with Feedback	53	Established	rlhf-alignment-training	4,635	Python
196	shibing624/textgen TextGen: Implementation of Text Generation models, include LLaMA, BLOOM,...	53	Established	gpt2-pretraining-fine-tuning	979	Python
197	GeeeekExplorer/nano-vllm Nano vLLM	53	Established	llm-inference-engines	12,189	Python
198	mlabonne/llm-datasets Curated list of datasets and tools for post-training.	53	Established	llm-domain-datasets	4,319	—
199	HowieHwong/TrustLLM [ICML 2024] TrustLLM: Trustworthiness in Large Language Models	52	Established	safety-robustness-evaluation	619	Python
200	Mobile-Artificial-Intelligence/llama_sdk lcpp is a dart implementation of llama.cpp used by the mobile artificial...	52	Established	local-llm-deployment	115	C++

« Prev 1 2 3 4 … 63 64 65 Next »