Trending Transformer Models

Models with the biggest quality score improvements over the last 6 days.

#	Model	Change	Score	Tier	Category	Stars
1	kha-white/manga-ocr Optical character recognition for Japanese text, with the main focus being...	+17	64	Established	ocr-document-extraction	2,582
2	SwanHubX/SwanLab ⚡️SwanLab - an open-source, modern-design AI training tracking and...	+17	89	Verified	ml-foundations-curricula	3,670
3	OpenVoiceOS/ovos-audio-transformer-plugin-ggwave data over sound plugin	+17	52	Established	text-to-speech-tts	2
4	Riko0/messenger_logger_callback messenger-logger-callback — Send ML training logs to Telegram. Standalone...	+15	31	Emerging	conversational-chatbot-applications	2
5	rxn4chemistry/rxn-onmt-models Training of OpenNMT-based RXN models	+14	45	Emerging	molecular-generation-transformers	2
6	lpalbou/model-quantizer Effortlessly quantize, benchmark, and publish Hugging Face models with...	+14	25	Experimental	llm-quantization-methods	2
7	ndoll1998/active-transformers Active Learning for Transformer with focus on Sequence Tagging tasks	+13	24	Experimental	transformer-frameworks-wrappers	2
8	kmaurinjones/AllMeans Automatic topic modelling using minimal external input and computational resources	+13	30	Emerging	text-clustering-topic-modeling	2
9	yingding/applyllm A python package for applying LLM with LangChain and Hugging Face on local...	+13	30	Emerging	llm-inference-engines	2
10	Blaizzy/mlx-vlm MLX-VLM is a package for inference and fine-tuning of Vision Language Models...	+12	89	Verified	apple-silicon-llm-inference	2,287
11	touhi99/askagent Simple mac/unix terminal assistant with LLM agents capable of various tasks	+12	35	Emerging	multi-agent-orchestration	2
12	mim-solutions/mim_nlp A Python package with ready-to-use models for various NLP tasks and text...	+12	23	Experimental	nlp-learning-coursework	2
13	sagorbrur/fillblank Fill The Blank	+12	23	Experimental	bert-model-implementations	2
14	duck4i/retro-ui Retro Llama	+11	14	Experimental	interactive-ai-chat-uis	2
15	BradyFU/Awesome-Multimodal-Large-Language-Models :sparkles::sparkles:Latest Advances on Multimodal Large Language Models	+11	56	Established	multimodal-vision-language-models	17,448
16	argosopentech/argos-translate Open-source offline translation library written in Python	+10	58	Established	neural-machine-translation	5,745
17	cui-shaobo/causal-strength evaluating the causal strength between cause and effect	+9	20	Experimental	mathematical-reasoning-transformers	2
18	earthai-tech/fusionlab-learn fusionlab-learn: Igniting Next-Gen Temporal Fusion Architectures	+9	33	Emerging	time-series-forecasting-transformers	2
19	ash-01xor/Imgcap A CLI to generate captions for images	+9	12	Experimental	blip-image-captioning	2
20	changyeyu/LLM-RL-Visualized 🌟100+ 原创 LLM / RL 原理图📚，《大模型算法》作者巨献！💥（100+ LLM/RL Algorithm Maps ）	+9	58	Established	llm-training-experimentation	3,766
21	levashi/reprobe Phase-aware LLM activation steering and linear probing. A memory-efficient,...	+9	33	Emerging	mathematical-reasoning-transformers	2
22	ThilinaRajapakse/simpletransformers Transformers for Information Retrieval, Text Classification, NER, QA,...	+8	75	Verified	transformer-frameworks-wrappers	4,234
23	AutoGPTQ/AutoGPTQ An easy-to-use LLMs quantization package with user-friendly apis, based on...	+7	46	Emerging	llm-quantization-methods	5,033
24	labmlai/annotated_deep_learning_paper_implementations 🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side...	+7	56	Established	ml-foundations-curricula	65,913
25	stas00/ml-engineering Machine Learning Engineering Open Book	+7	60	Established	ml-foundations-curricula	17,380
26	xorbitsai/inference Swap GPT for any LLM by changing a single line of code. Xinference lets you...	+7	89	Verified	llm-inference-engines	9,129
27	hiyouga/LlamaFactory Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)	+7	70	Verified	lora-qlora-fine-tuning	68,347
28	LLMBook-zh/LLMBook-zh.github.io 《大语言模型》作者：赵鑫，李军毅，周昆，唐天一，文继荣	+7	39	Emerging	llm-learning-resources	4,371
29	unslothai/unsloth Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss,...	+7	94	Verified	lora-qlora-fine-tuning	53,879
30	AI-Hypercomputer/maxtext A simple, performant and scalable Jax LLM!	+7	92	Verified	llm-implementation-tutorials	2,169
31	mosaicml/llm-foundry LLM training code for Databricks foundation models	+7	71	Verified	llm-implementation-tutorials	4,397
32	h2oai/h2o-llmstudio H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs....	+7	66	Established	lora-qlora-fine-tuning	4,897
33	IDEA-CCNL/Fengshenbang-LM Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系，成为中文AIGC和认知智能的基础设施。	+7	46	Emerging	multilingual-llm-adaptation	4,149
34	MiniMax-AI/MiniMax-01 The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model...	+7	48	Emerging	llm-frameworks-libraries	3,363
35	deepseek-ai/Janus Janus-Series: Unified Multimodal Understanding and Generation Models	+7	47	Emerging	multimodal-vision-language	17,708
36	fixie-ai/ultravox A fast multimodal LLM for real-time voice	+7	51	Established	multimodal-vision-language	4,377
37	datawhalechina/llm-cookbook 面向开发者的 LLM 入门教程，吴恩达大模型系列课程中文版	+7	41	Emerging	llm-learning-resources	23,496
38	multimodal-art-projection/YuE YuE: Open Full-song Music Generation Foundation Model, something similar to...	+7	49	Emerging	music-generation-transformers	6,083
39	b4rtaz/distributed-llama Distributed LLM inference. Connect home devices into a powerful cluster to...	+7	55	Established	apple-silicon-llm-inference	2,856
40	qingsongedu/time-series-transformers-review A professionally curated list of awesome resources (paper, code, data, etc.)...	+7	46	Emerging	time-series-forecasting-transformers	2,968
41	EleutherAI/gpt-neo An implementation of model parallel GPT-2 and GPT-3-style models using the...	+7	47	Emerging	gpt2-pretraining-fine-tuning	8,286
42	EleutherAI/gpt-neox An implementation of model parallel autoregressive transformers on GPUs,...	+7	58	Established	gpt2-pretraining-fine-tuning	7,399
43	0hq/WebGPT Run GPT model on the browser with WebGPU. An implementation of GPT inference...	+7	44	Emerging	gpt2-pretraining-fine-tuning	3,784
44	jingyaogong/minimind 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!	+7	64	Established	gpt-implementation-tutorials	41,159
45	VainF/Torch-Pruning [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision...	+7	69	Established	llm-pruning-compression	3,267
46	OFA-Sys/Chinese-CLIP Chinese version of CLIP which achieves Chinese cross-modal retrieval and...	+7	48	Emerging	clip-image-embeddings	5,820
47	cmhungsteve/Awesome-Transformer-Attention An ultimately comprehensive paper list of Vision Transformer/Attention,...	+7	38	Emerging	vision-transformer-implementations	5,022
48	rasbt/LLMs-from-scratch Implement a ChatGPT-like LLM in PyTorch from scratch, step by step	+7	69	Established	llm-implementation-from-scratch	87,892
49	huggingface/transformers.js State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly...	+7	68	Established	browser-based-ml-inference	15,538
50	tensorzero/tensorzero TensorZero is an open-source stack for industrial-grade LLM applications. It...	+7	89	Verified	llm-inference-engines	11,080
51	mistralai/mistral-inference Official inference library for Mistral models	+7	56	Established	mistral-ai-tools	10,705
52	OpenRLHF/OpenRLHF An Easy-to-use, Scalable and High-performance Agentic RL Framework based on...	+7	69	Established	rlhf-alignment-training	9,158
53	bitsandbytes-foundation/bitsandbytes Accessible large language models via k-bit quantization for PyTorch.	+7	90	Verified	llm-quantization-techniques	8,033
54	NexaAI/nexa-sdk Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and...	+7	57	Established	llm-inference-engines	7,797
55	transformerlab/transformerlab-app The open source research environment for AI researchers to seamlessly train,...	+7	71	Verified	power-transformer-design	4,820
56	albertan017/LLM4Decompile Reverse Engineering: Decompiling Binary Code with Large Language Models	+7	54	Established	llm-scaling-architecture	6,407
57	modelscope/ms-swift Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5,...	+7	91	Verified	lora-qlora-fine-tuning	13,105
58	vllm-project/vllm A high-throughput and memory-efficient inference and serving engine for LLMs	+7	100	Verified	llm-inference-engines	73,007
59	mudler/LocalAI :robot: The free, Open Source alternative to OpenAI, Claude and others....	+7	70	Verified	local-llm-deployment	43,530
60	gpustack/gpustack Performance-optimized AI inference on your GPUs. Unlock superior throughput...	+7	71	Verified	llm-inference-engines	4,630
61	mlabonne/llm-datasets Curated list of datasets and tools for post-training.	+7	53	Established	llm-domain-datasets	4,319
62	rasbt/reasoning-from-scratch Implement a reasoning LLM in PyTorch from scratch, step by step	+7	71	Verified	llm-implementation-tutorials	3,452
63	huggingface/optimum 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and...	+7	90	Verified	transformer-training-optimization	3,325
64	pytorch/ao PyTorch native quantization and sparsity for training and inference	+7	74	Verified	llm-quantization-methods	2,729
65	AI4Finance-Foundation/FinGPT FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We...	+7	82	Verified	financial-ai-agents	18,815
66	huggingface/transformers 🤗 Transformers: the model-definition framework for state-of-the-art machine...	+7	100	Verified	transformer-architecture-education	157,811
67	haotian-liu/LLaVA [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V...	+7	47	Emerging	vision-language-instruction-tuning	24,554
68	DAMO-NLP-SG/Video-LLaMA [EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language...	+7	46	Emerging	vision-language-instruction-tuning	3,134
69	Instruction-Tuning-with-GPT-4/GPT-4-LLM Instruction Tuning with GPT-4	+7	45	Emerging	vision-language-instruction-tuning	4,339
70	CLUEbenchmark/CLUE 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets,...	+7	49	Emerging	multilingual-llm-adaptation	4,237
71	PhoebusSi/Alpaca-CoT We unified the interfaces of instruction-tuning data (e.g., CoT data),...	+7	45	Emerging	multilingual-llm-adaptation	2,801
72	HqWu-HITCS/Awesome-Chinese-LLM 整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。	+7	40	Emerging	multilingual-llm-adaptation	22,371
73	LianjiaTech/BELLE BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）	+7	46	Emerging	multilingual-llm-adaptation	8,284
74	Facico/Chinese-Vicuna Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model ——...	+7	48	Emerging	multilingual-llm-adaptation	4,136
75	datawhalechina/llms-from-scratch-cn 仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理	+7	48	Emerging	llm-implementation-from-scratch	4,010
76	cocktailpeanut/dalai The simplest way to run LLaMA on your local machine	+7	38	Emerging	local-llm-deployment	12,980
77	ashishpatel26/LLM-Finetuning LLM Finetuning with peft	+7	45	Emerging	lora-qlora-fine-tuning	2,827
78	alibaba/MNN MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba,...	+7	93	Verified	llm-inference-engines	14,526
79	higgsfield-ai/higgsfield Fault-tolerant, highly scalable GPU orchestration, and a machine learning...	+7	49	Emerging	llm-inference-engines	3,558
80	Tiiny-AI/PowerInfer High-speed Large Language Model Serving for Local Deployment	+7	54	Established	llm-inference-engines	8,808
81	run-llama/LlamaIndexTS Data framework for your LLM applications. Focus on server side solution	+7	65	Established	interactive-ai-chat-uis	3,066
82	sgl-project/sglang SGLang is a high-performance serving framework for large language models and...	+7	100	Verified	llm-inference-engines	24,410
83	HandsOnLLM/Hands-On-Large-Language-Models Official code repo for the O'Reilly Book - "Hands-On Large Language Models"	+7	57	Established	llm-learning-resources	23,351
84	OptimalScale/LMFlow An Extensible Toolkit for Finetuning and Inference of Large Foundation...	+7	59	Established	llm-fine-tuning	8,489
85	fla-org/flash-linear-attention 🚀 Efficient implementations of state-of-the-art linear attention models	+7	89	Verified	sparse-attention-optimization	4,549
86	intel/neural-compressor SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity;...	+7	90	Verified	llm-quantization-techniques	2,597
87	huggingface/text-generation-inference Large Language Model Text Generation Inference	+7	82	Verified	gpt-model-fine-tuning	10,802
88	baichuan-inc/Baichuan-7B A large-scale 7B pretraining language model developed by BaiChuan-Inc.	+7	45	Emerging	multilingual-llm-adaptation	5,680
89	oumi-ai/oumi Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any...	+7	88	Verified	lora-qlora-fine-tuning	8,907
90	EricLBuehler/mistral.rs Fast, flexible LLM inference	+7	62	Established	rust-llm-infrastructure	6,681
91	jadore801120/attention-is-all-you-need-pytorch A PyTorch implementation of the Transformer model in "Attention is All You Need".	+7	51	Established	attention-mechanism-implementations	9,651
92	OpenNMT/OpenNMT-py Open Source Neural Machine Translation and (Large) Language Models in PyTorch	+7	76	Verified	neural-machine-translation	7,000
93	linkedin/Liger-Kernel Efficient Triton Kernels for LLM Training	+7	90	Verified	lora-qlora-fine-tuning	6,206
94	NielsRogge/Transformers-Tutorials This repository contains demos I made with the Transformers library by HuggingFace.	+7	64	Established	huggingface-learning-resources	11,519
95	zyds/transformers-code 手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube	+7	40	Emerging	huggingface-learning-resources	3,853
96	huggingface/course The Hugging Face course on Transformers	+7	67	Established	huggingface-learning-resources	3,771
97	ymcui/Chinese-LLaMA-Alpaca 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)	+7	48	Emerging	multilingual-llm-adaptation	18,970
98	LlamaFamily/Llama-Chinese Llama中文社区，实时汇总最新Llama学习资料，构建最好的中文Llama大模型开源生态，完全开源可商用	+7	39	Emerging	multilingual-llm-adaptation	14,737
99	ymcui/Chinese-LLaMA-Alpaca-2 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs...	+7	47	Emerging	multilingual-llm-adaptation	7,163
100	yangjianxin1/Firefly Firefly:...	+7	37	Emerging	multilingual-llm-adaptation	6,644