All Transformer Models

6,427 models ranked by quality score · Page 3 of 65

Showing 201–300 of 6,427

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
201	OpenVoiceOS/ovos-audio-transformer-plugin-ggwave data over sound plugin	52	Established	text-to-speech-tts	2	Python
202	ScrapeGraphAI/toonify Toonify: Compact data format reducing LLM token usage by 30-60%	52	Established	llm-serialization-formats	322	Python
203	PRIME-RL/TTRL [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning	52	Established	llm-reasoning-research	1,014	Python
204	nerdai/llms-from-scratch-rs A comprehensive Rust translation of the code from Sebastian Raschka's Build...	52	Established	rust-llm-infrastructure	306	Rust
205	avikumart/LLM-GenAI-Transformers-Notebooks An repository containing all the LLM notebooks with tutorial and projects	52	Established	ml-foundations-curricula	135	Jupyter Notebook
206	mgonzs13/llama_ros llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2	52	Established	llm-orchestration-platforms	245	C++
207	TharinduDR/TransQuest Transformer based translation quality estimation	51	Established	neural-machine-translation	114	Python
208	jadore801120/attention-is-all-you-need-pytorch A PyTorch implementation of the Transformer model in "Attention is All You Need".	51	Established	attention-mechanism-implementations	9,651	Python
209	PacktPublishing/Mastering-NLP-from-Foundations-to-LLMs Mastering NLP from Foundations to LLMs, Published by Packt	51	Established	llm-learning-resources	124	Jupyter Notebook
210	explosion/curated-transformers 🤖 A PyTorch library of curated Transformer models and their composable components	51	Established	transformer-frameworks-wrappers	894	Python
211	ai-decentralized/BloomBee Decentralized LLMs fine-tuning and inference with offloading	51	Established	llm-inference-engines	111	Python
212	SalesforceAIResearch/uni2ts Unified Training of Universal Time Series Forecasting Transformers	51	Established	time-series-forecasting-transformers	1,436	Jupyter Notebook
213	ServiceNow/TACTiS TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time...	51	Established	time-series-forecasting-transformers	139	Python
214	fixie-ai/ultravox A fast multimodal LLM for real-time voice	51	Established	multimodal-vision-language	4,377	Python
215	helpmefindaname/transformer-smaller-training-vocab Temporary remove unused tokens during training to save ram and speed.	51	Established	transformer-architecture-tutorials	23	Python
216	google/deepconsensus DeepConsensus uses gap-aware sequence transformers to correct errors in...	50	Established	transformer-frameworks-wrappers	256	Python
217	stanfordnlp/axbench Stanford NLP Python library for benchmarking the utility of LLM...	50	Established	domain-specific-benchmarks	175	Python
218	UKPLab/gpl Powerful unsupervised domain adaptation method for dense retrieval. Requires...	50	Established	mathematical-reasoning-transformers	340	Python
219	mindspore-lab/step_into_llm MindSpore online courses: Step into LLM	50	Established	llm-training-experimentation	484	Jupyter Notebook
220	alesanfra/toons A high-performance TOON (Token Oriented Object Notation) parser and...	50	Established	llm-serialization-formats	11	Rust
221	adithya-s-k/AI-Engineering.academy Mastering Applied AI, One Concept at a Time	50	Established	llm-fine-tuning	2,140	Jupyter Notebook
222	jsksxs360/How-to-use-Transformers Transformers 库快速入门教程	50	Established	transformer-frameworks-wrappers	1,850	Python
223	huggingface/transformers.js-examples A collection of 🤗 Transformers.js demos and example applications	50	Established	browser-based-ml-inference	1,987	JavaScript
224	dvgodoy/FineTuningLLMs Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with...	50	Established	lora-qlora-fine-tuning	786	Jupyter Notebook
225	moment-timeseries-foundation-model/moment MOMENT: A Family of Open Time-series Foundation Models, ICML'24	50	Established	time-series-forecasting-transformers	723	TypeScript
226	ridgerchu/matmulfreellm Implementation for MatMul-free LM.	50	Established	llm-implementation-tutorials	3,058	Python
227	Omid-Nejati/MedViTV2 MedViTV2: Medical Image Classification with KAN-Integrated Transformers and...	50	Established	medical-image-diagnosis-transformers	91	Jupyter Notebook
228	minggnim/nlp-models A repository for training transformer based models	50	Established	model-evaluation-diagnostics	2	Jupyter Notebook
229	yjg30737/pyqt-openai VividNode: Multi-purpose Text & Image Generation Desktop Chatbot (supporting...	50	Established	multi-provider-llm-interfaces	150	Python
230	ruanchaves/hashformers Accurate word segmentation for hashtags and text, powered by Transformers...	50	Established	transformer-frameworks-wrappers	77	Python
231	serge-chat/serge A web interface for chatting with Alpaca through llama.cpp. Fully...	50	Established	interactive-ai-chat-uis	5,741	Svelte
232	ggml-org/llama.vscode VS Code extension for LLM-assisted code/text completion	50	Established	code-completion-copilots	1,197	TypeScript
233	kyegomez/MambaTransformer Integrating Mamba/SSMs with Transformer for Enhanced Long Context and...	50	Established	state-space-model-architectures	215	Python
234	hyunwoongko/nanoRLHF nanoRLHF: from-scratch journey into how LLMs and RLHF really work.	50	Established	rlhf-alignment-training	168	Python
235	SafeAILab/EAGLE Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and...	50	Established	speculative-decoding-algorithms	2,213	Python
236	tattn/LocalLLMClient Swift package to run local LLMs on iOS, macOS, Linux	50	Established	local-llm-deployment	168	Swift
237	Strvm/meta-ai-api Llama 3 API 70B & 405B (MetaAI Reverse Engineered)	50	Established	local-llm-deployment	396	Python
238	higgsfield-ai/higgsfield Fault-tolerant, highly scalable GPU orchestration, and a machine learning...	49	Emerging	llm-inference-engines	3,558	Jupyter Notebook
239	iusztinpaul/hands-on-llms 🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training,...	49	Emerging	llm-training-experimentation	3,401	Jupyter Notebook
240	lucidrains/alphagenome Implementation of AlphaGenome, Deepmind's updated genomic attention model	49	Emerging	protein-transformers-ml	97	Jupyter Notebook
241	IntelLabs/nlp-architect A model library for exploring state-of-the-art deep learning topologies and...	49	Emerging	model-evaluation-diagnostics	2,935	Python
242	mukel/llama3.java Practical Llama 3 inference in Java	49	Emerging	local-llm-deployment	800	Java
243	bodaay/HuggingFaceModelDownloader Simple go utility to download HuggingFace Models and Datasets	49	Emerging	llm-quantization-methods	915	Go
244	abelriboulot/onnxt5 Summarization, translation, sentiment-analysis, text-generation and more at...	49	Emerging	text-summarization-transformers	256	Python
245	yuanzhoulvpi2017/zero_nlp 中文nlp解决方案(大模型、数据、模型、训练、推理)	49	Emerging	model-evaluation-diagnostics	3,783	Jupyter Notebook
246	louisfb01/start-llms A complete guide to start and improve your LLM skills in 2026 with little...	49	Emerging	llm-learning-resources	949	—
247	intel/ipex-llm Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM,...	49	Emerging	llm-inference-engines	8,724	Python
248	KimMeen/Time-LLM [ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting...	49	Emerging	multimodal-vision-language	2,563	Python
249	sapientinc/HRM Hierarchical Reasoning Model Official Release	49	Emerging	llm-reasoning-research	12,358	Python
250	CLUEbenchmark/CLUE 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets,...	49	Emerging	multilingual-llm-adaptation	4,237	Python
251	galilai-group/stable-pretraining Reliable, minimal and scalable library for pretraining foundation and world models	49	Emerging	mathematical-reasoning-transformers	133	Python
252	kossisoroyce/timber Ollama for classical ML models. AOT compiler that turns XGBoost, LightGBM,...	49	Emerging	distributed-training-frameworks	636	Python
253	kyegomez/Jamba PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"	49	Emerging	3d-vision-transformers	208	Python
254	kyegomez/MambaByte Implementation of MambaByte in "MambaByte: Token-free Selective State Space...	49	Emerging	state-space-model-architectures	125	Python
255	maziyarpanahi/openmed open-source healthcare ai	49	Emerging	therapeutic-chatbot-applications	343	Python
256	DashyDashOrg/pandas-llm Pandas-LLM	49	Emerging	llm-frameworks-libraries	46	Python
257	AXERA-TECH/ax-llm Explore LLM model deployment based on AXera's AI chips	49	Emerging	llm-inference-serving	142	C++
258	jhkchan/translategemma-cli Local CLI for Google's TranslateGemma translation models with multi-platform...	49	Emerging	machine-translation-systems	21	Python
259	TIGER-AI-Lab/MMLU-Pro The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task...	49	Emerging	math-reasoning-datasets	347	Python
260	ZHZisZZ/dllm dLLM: Simple Diffusion Language Modeling	49	Emerging	diffusion-language-models	2,193	Python
261	multimodal-art-projection/YuE YuE: Open Full-song Music Generation Foundation Model, something similar to...	49	Emerging	music-generation-transformers	6,083	Python
262	telekom/mltb2 Machine Learning Toolbox 2	49	Emerging	ml-foundations-curricula	13	Python
263	dbiir/UER-py Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo	49	Emerging	nlp-learning-resources	3,106	Python
264	kyegomez/LFM An open source implementation of LFMs from Liquid AI: Liquid Foundation Models	49	Emerging	llm-training-experimentation	207	Python
265	eth-sri/matharena Evaluation of LLMs on latest math competitions	48	Emerging	evaluation-frameworks-metrics	229	Python
266	ddh0/easy-llama Python package wrapping llama.cpp for on-device LLM inference	48	Emerging	llm-quantization-methods	101	Python
267	TIGER-AI-Lab/VLM2Vec This repo contains the code for "VLM2Vec: Training Vision-Language Models...	48	Emerging	multimodal-rag-systems	592	Python
268	edwko/OuteTTS Interface for OuteTTS models.	48	Emerging	text-to-speech-tts	1,429	Python
269	DadaNanjesha/AI-Text-Humanizer-App Transform AI-generated text into formal, human-like, and academic writing...	48	Emerging	ai-content-detection	189	Python
270	UdbhavPrasad072300/Transformer-Implementations Library - Vanilla, ViT, DeiT, BERT, GPT	48	Emerging	vit-image-classification	69	Jupyter Notebook
271	ymcui/Chinese-LLaMA-Alpaca 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)	48	Emerging	multilingual-llm-adaptation	18,970	Python
272	Facico/Chinese-Vicuna Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model ——...	48	Emerging	multilingual-llm-adaptation	4,136	C
273	ggml-org/llama.vim Vim plugin for LLM-assisted code/text completion	48	Emerging	code-completion-copilots	1,913	Vim Script
274	guinmoon/LLMFarm llama and other large language models on iOS and MacOS offline using GGML library.	48	Emerging	local-llm-deployment	1,994	C
275	lone-cloud/gerbil A desktop app for running Large Language Models locally.	48	Emerging	llm-terminal-automation	445	TypeScript
276	tensorchord/modelz-llm OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and...	48	Emerging	llm-function-calling	276	Python
277	socialfoundations/folktexts Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on...	48	Emerging	llm-training-experimentation	25	Jupyter Notebook
278	google-deepmind/long-form-factuality Benchmarking long-form factuality in large language models. Original code...	48	Emerging	llm-bias-evaluation	672	Python
279	MadryLab/context-cite Attribute (or cite) statements generated by LLMs back to in-context information.	48	Emerging	llm-interpretability-explainability	325	Jupyter Notebook
280	OFA-Sys/Chinese-CLIP Chinese version of CLIP which achieves Chinese cross-modal retrieval and...	48	Emerging	clip-image-embeddings	5,820	Jupyter Notebook
281	megagonlabs/ginza-transformers Use custom tokenizers in spacy-transformers	48	Emerging	tokenizer-libraries	16	Python
282	NVIDIA/FasterTransformer Transformer related optimization, including BERT, GPT	48	Emerging	transformer-architecture-education	6,398	C++
283	AdityaNG/kan-gpt The PyTorch implementation of Generative Pre-trained Transformers (GPTs)...	48	Emerging	gpt2-pretraining-fine-tuning	725	Python
284	datawhalechina/llms-from-scratch-cn 仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理	48	Emerging	llm-implementation-from-scratch	4,010	Jupyter Notebook
285	CASE-Lab-UMD/LLM-Drop The official implementation of the paper "Uncovering the Redundancy in...	48	Emerging	llm-implementation-tutorials	189	Python
286	autonomousvision/transfuser [PAMI'23] TransFuser: Imitation with Transformer-Based Sensor Fusion for...	48	Emerging	3d-vision-transformers	1,516	Python
287	yotambraun/APDTFlow APDTFlow is a modern and extensible forecasting framework for time series...	48	Emerging	time-series-forecasting-transformers	43	Python
288	AI-Hypercomputer/JetStream JetStream is a throughput and memory optimized engine for LLM inference on...	48	Emerging	llm-inference-engines	415	Python
289	kyegomez/attn_res A clean, single-file PyTorch implementation of Attention Residuals (Kimi...	48	Emerging	transformer-architecture-tutorials	8	Python
290	BiomedSciAI/biomed-multi-omic Build foundation model for RNA or DNA data	48	Emerging	protein-transformers-ml	56	Jupyter Notebook
291	mirpo/fastapi-gen Build LLM-enabled FastAPI applications without build configuration.	48	Emerging	local-llm-deployment	11	Python
292	beehive-lab/GPULlama3.java GPU-accelerated Llama3.java inference in pure Java using TornadoVM.	48	Emerging	llm-docker-deployments	238	Java
293	MiniMax-AI/MiniMax-01 The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model...	48	Emerging	llm-frameworks-libraries	3,363	Python
294	belladoreai/llama3-tokenizer-js JS tokenizer for LLaMA 3 and LLaMA 3.1	48	Emerging	local-llm-deployment	117	JavaScript
295	NiuTrans/LaTeXTrans A tool for translating the content of LaTeX documents into various other...	48	Emerging	llm-translation-tools	443	TeX
296	LoicGrobol/zeldarose Train transformer-based models.	48	Emerging	model-evaluation-diagnostics	28	Python
297	Kohulan/DECIMER-Image_Transformer DECIMER Image Transformer is a deep-learning-based tool designed for...	48	Emerging	vision-transformer-implementations	345	Python
298	YerbaPage/LongCodeZip LongCodeZip: Compress Long Context for Code Language Models [ASE2025]	48	Emerging	llm-compression-optimization	142	Python
299	haizelabs/verdict Inference-time scaling for LLMs-as-a-judge.	48	Emerging	llm-scaling-architecture	332	Jupyter Notebook
300	zjunlp/EasyInstruct [ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.	48	Emerging	vision-language-instruction-tuning	409	Python

« Prev 1 2 3 4 5 … 63 64 65 Next »