All Transformer Models

6,429 models ranked by quality score · Page 7 of 65

Showing 601–700 of 6,429

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
601	young-geng/EasyLM Large language models (LLMs) made easy, EasyLM is a one stop solution for...	46	Emerging	llm-training-experimentation	2,522	Python
602	ItsPi3141/alpaca-electron The simplest way to run Alpaca (and other LLaMA-based local LLMs) on your...	46	Emerging	interactive-ai-chat-uis	1,313	JavaScript
603	MahmoudWahdan/dialog-nlu Tensorflow and Keras implementation of the state of the art researches in...	46	Emerging	model-evaluation-diagnostics	100	Jupyter Notebook
604	WangRongsheng/CareGPT 🌞 CareGPT...	46	Emerging	multilingual-llm-adaptation	1,009	Python
605	FoundationVision/Liquid (Accepted by IJCV) Liquid: Language Models are Scalable and Unified...	46	Emerging	multimodal-vision-language-models	640	Python
606	Chongjie-Si/Subspace-Tuning A generalized framework for subspace tuning methods in parameter efficient...	46	Emerging	lora-qlora-fine-tuning	177	Python
607	xNul/chat-llama-discord-bot A Discord Bot for chatting with LLaMA, Vicuna, Alpaca, MPT, or any other...	46	Emerging	messaging-platform-chatbots	120	Python
608	replit/ReplitLM Inference code and configs for the ReplitLM model family	46	Emerging	llm-inference-serving	1,042	Python
609	LianjiaTech/BELLE BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）	46	Emerging	multilingual-llm-adaptation	8,284	HTML
610	SCIR-HI/Huatuo-Llama-Med-Chinese Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large...	46	Emerging	multilingual-llm-adaptation	4,938	Python
611	DAMO-NLP-SG/Video-LLaMA [EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language...	46	Emerging	vision-language-instruction-tuning	3,134	Python
612	THU-SI/Spatial-MLLM [NeurIPS 2025] Official implementation of Spatial-MLLM: Boosting MLLM...	46	Emerging	multimodal-vision-language	447	Python
613	Paranioar/Awesome_Matching_Pretraining_Transfering The Paper List of Large Multi-Modality Model (Perception, Generation,...	46	Emerging	multimodal-vision-language-models	445	—
614	AutoGPTQ/AutoGPTQ An easy-to-use LLMs quantization package with user-friendly apis, based on...	46	Emerging	llm-quantization-methods	5,033	Python
615	zai-org/CogView Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView:...	46	Emerging	text-to-image-generation	1,796	Python
616	deepreinforce-ai/CUDA-L2 CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through...	46	Emerging	llm-cuda-optimization	472	Cuda
617	bytedance/byteir A model compilation solution for various hardware	46	Emerging	llm-inference-engines	465	MLIR
618	KB-AI-Research/KB-ALBERT KB국민은행에서 제공하는 경제/금융 도메인에 특화된 한국어 ALBERT 모델	46	Emerging	korean-language-models	241	Python
619	skylight-org/sparse-attention-hub Advancing the frontier of efficient AI	46	Emerging	sparse-attention-optimization	54	Python
620	intel/intel-extension-for-transformers ⚡ Build your chatbot within minutes on your favorite device; offer SOTA...	46	Emerging	llm-chat-interfaces	2,177	Python
621	kmeng01/memit Mass-editing thousands of facts into a transformer memory (ICLR 2023)	46	Emerging	llm-knowledge-editing	543	Python
622	voidful/TFkit 🤖📇 handling multiple nlp task in one pipeline	46	Emerging	bert-model-implementations	57	Python
623	dvmazur/mixtral-offloading Run Mixtral-8x7B models in Colab or consumer desktops	46	Emerging	mistral-ai-tools	2,327	Python
624	Cognitive-AI-Systems/MAPF-GPT-DDG [IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding...	46	Emerging	mathematical-reasoning-transformers	61	Python
625	JIA-Lab-research/LongLoRA Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)	46	Emerging	llm-fine-tuning	2,694	Python
626	j-min/VL-T5 PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)	46	Emerging	multimodal-fusion-transformers	374	Python
627	HumanSignal/label-studio-transformers Label data using HuggingFace's transformers and automatically get a...	46	Emerging	text-clustering-topic-modeling	194	Python
628	bradyz/cross_view_transformers Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral)	46	Emerging	semantic-segmentation-techniques	573	Python
629	OctoberChang/X-Transformer X-Transformer: Taming Pretrained Transformers for eXtreme Multi-label Text...	46	Emerging	text-classification-transformers	142	C++
630	synacktraa/tool-parse Making LLM Tool-Calling Simpler.	46	Emerging	llm-function-calling	30	Jupyter Notebook
631	huggingface/optimum-graphcore Blazing fast training of 🤗 Transformers on Graphcore IPUs	46	Emerging	transformer-training-optimization	87	Python
632	Czi24/Awesome-MLLM-LLM-Colab Happy experimenting with MLLM and LLM models!	46	Emerging	llm-learning-resources	129	Jupyter Notebook
633	yuanzhoulvpi2017/quick_sentence_transformers sentence-transformers to onnx 让sbert模型推理效率更快	46	Emerging	model-evaluation-diagnostics	166	Python
634	naru-project/naru Neural Relation Understanding: neural cardinality estimators for tabular data	46	Emerging	power-transformer-design	104	Python
635	quantium-ai/research Research experiments exploring uncommon quant techniques.	46	Emerging	ml-foundations-curricula	34	Jupyter Notebook
636	patil-suraj/onnx_transformers Accelerated NLP pipelines for fast inference on CPU. Built with Transformers...	46	Emerging	transformer-training-optimization	127	Jupyter Notebook
637	LowinLi/fastgpt ⚡ boost inference speed of GPT models in transformers by onnxruntime	46	Emerging	transformer-training-optimization	52	Python
638	AviSoori1x/makeMoE From scratch implementation of a sparse mixture of experts language model...	46	Emerging	mixture-of-experts-llms	793	Jupyter Notebook
639	chaitjo/learning-tsp Code for the paper 'Learning TSP Requires Rethinking Generalization' (CP 2021)	46	Emerging	mathematical-reasoning-transformers	241	Jupyter Notebook
640	tintn/vision-transformer-from-scratch A Simplified PyTorch Implementation of Vision Transformer (ViT)	46	Emerging	vit-image-classification	241	Jupyter Notebook
641	icon-lab/ResViT Official Implementation of ResViT: Residual Vision Transformers for...	46	Emerging	vit-image-classification	177	Python
642	qubvel/transformers-notebooks Inference and fine-tuning examples for vision models from 🤗 Transformers	46	Emerging	vision-transformer-implementations	165	Jupyter Notebook
643	davidiommi/Pytorch--3D-Medical-Images-Segmentation--SALMON Segmentation deep learning ALgorithm based on MONai toolbox: single and...	46	Emerging	medical-image-segmentation-transformers	124	Python
644	dddzg/up-detr [TPAMI 2022 & CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object...	46	Emerging	object-detection-transformers	489	Python
645	ai4co/routefinder [TMLR 2025 + ICML 2024 FM-Wild Oral] RouteFinder: Towards Foundation Models...	46	Emerging	mathematical-reasoning-transformers	111	Python
646	jmisilo/clip-gpt-captioning CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.	46	Emerging	clip-vision-language	118	Python
647	THUDM/ProteinLM Protein Language Model	46	Emerging	protein-language-models	122	Python
648	USC-FORTIS/AD-LLM [ACL Findings 2025] A benchmark for anomaly detection using large language...	46	Emerging	llm-research-curation	41	Python
649	deveix/react-native-apple-llm React Native Apple LLM plugin using Foundation Models	46	Emerging	ios-nlp-frameworks	319	Swift
650	Emmi-AI/noether Deep-learning framework for Engineering AI. Built on transformer building...	46	Emerging	transformer-architecture-tutorials	131	Python
651	KristiyanVachev/Leaf-Question-Generation Easy to use and understand multiple-choice question generation algorithm...	46	Emerging	question-answering-systems	139	Jupyter Notebook
652	thu-nics/MoA [CoLM'25] The official implementation of the paper	46	Emerging	mixture-of-experts-llms	156	Python
653	Graphlet-AI/eridu Deep fuzzy matching people and company names for multilingual entity...	46	Emerging	named-entity-recognition	3	Python
654	cli99/llm-analysis Latency and Memory Analysis of Transformer Models for Training and Inference	46	Emerging	llm-benchmark-leaderboards	479	Python
655	mbzuai-oryx/LLMVoX LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM	46	Emerging	text-to-speech-tts	299	Python
656	inboxpraveen/LLM-Minutes-of-Meeting 🎤📄 An innovative tool that transforms audio or video files into text...	46	Emerging	text-to-speech-tts	163	Python
657	qingsongedu/time-series-transformers-review A professionally curated list of awesome resources (paper, code, data, etc.)...	46	Emerging	time-series-forecasting-transformers	2,968	—
658	AIoT-MLSys-Lab/SVD-LLM [ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2	46	Emerging	diffusion-language-models	284	Python
659	RLHFlow/RLHF-Reward-Modeling Recipes to train reward model for RLHF.	46	Emerging	rlhf-alignment-training	1,520	Python
660	sinanuozdemir/oreilly-optimizing-llms Optimizing LLMs with Fine-Tuning and Prompt Engineering	46	Emerging	llm-scaling-architecture	88	Jupyter Notebook
661	verifai/multiLLM 🚀 Invoke multiple large language models concurrently and the rank results....	46	Emerging	llm-frameworks-libraries	83	Python
662	FudanDISC/DISC-LawLLM [中文法律大模型] DISC-LawLLM: an intelligent legal system powered by large language...	46	Emerging	legal-document-analysis	874	Python
663	mit-han-lab/lite-transformer [ICLR 2020] Lite Transformer with Long-Short Range Attention	46	Emerging	machine-translation-transformers	610	Python
664	TigerResearch/TigerBot TigerBot: A multi-language multi-task LLM	45	Emerging	llm-frameworks-libraries	2,263	Python
665	zhvng/open-musiclm Implementation of MusicLM, a text to music model published by Google...	45	Emerging	ai-music-generation	562	Python
666	FareedKhan-dev/train-llama4 Building LLaMA 4 MoE from Scratch	45	Emerging	llm-implementation-tutorials	72	Jupyter Notebook
667	Deep-Spark/DeepSparkInference DeepSparkInference has selected 216 inference models of both small and large...	45	Emerging	llm-inference-engines	28	Python
668	FasterDecoding/Medusa Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads	45	Emerging	llm-frameworks-libraries	2,717	Jupyter Notebook
669	kyegomez/PALM-E Implementation of "PaLM-E: An Embodied Multimodal Language Model"	45	Emerging	vision-language-models	335	Python
670	hiyouga/Dual-Contrastive-Learning Code for our paper "Dual Contrastive Learning: Text Classification via...	45	Emerging	text-clustering-topic-modeling	167	Python
671	Wang-ML-Lab/bayesian-peft Bayesian Low-Rank Adaptation of LLMs: BLoB [NeurIPS 2024] and TFB [NeurIPS 2025]	45	Emerging	llm-knowledge-distillation	35	Python
672	imoneoi/openchat OpenChat: Advancing Open-source Language Models with Imperfect Data	45	Emerging	multi-provider-llm-interfaces	5,476	Python
673	lyuchenyang/Macaw-LLM Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text...	45	Emerging	vision-language-models	1,593	Python
674	InternLM/SIM-CoT [ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit...	45	Emerging	chain-of-thought-reasoning	185	Python
675	X-PLUG/mPLUG-Owl mPLUG-Owl: The Powerful Multi-modal Large Language Model Family	45	Emerging	vision-language-instruction-tuning	2,540	Python
676	FairyFali/SLMs-Survey Survey of Small Language Models from Penn State, ...	45	Emerging	llm-research-curation	248	—
677	gabeur/mmt Multi-Modal Transformer for Video Retrieval	45	Emerging	multimodal-visual-grounding	265	Python
678	domschl/HuggingFaceGuidedTourForMac A guided tour on how to use HuggingFace large language models on Macs with...	45	Emerging	huggingface-learning-resources	201	Jupyter Notebook
679	danielzuegner/code-transformer Implementation of the paper "Language-agnostic representation learning of...	45	Emerging	power-transformer-design	173	Python
680	jxiw/MambaInLlama [NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and...	45	Emerging	diffusion-language-models	238	Python
681	snwfdhmp/llm Use any LLM from the command line.	45	Emerging	llm-terminal-automation	59	JavaScript
682	IBM/regression-transformer Regression Transformer (2023; Nature Machine Intelligence)	45	Emerging	transformer-architecture-education	159	Python
683	ashishpatel26/LLM-Finetuning LLM Finetuning with peft	45	Emerging	lora-qlora-fine-tuning	2,827	Jupyter Notebook
684	JIA-Lab-research/LISA Project Page for "LISA: Reasoning Segmentation via Large Language Model"	45	Emerging	llm-scaling-architecture	2,604	Python
685	deepglint/unicom Large-Scale Visual Representation Model	45	Emerging	multimodal-vision-language	704	Python
686	QData/C-Tran General Multi-label Image Classification with Transformers	45	Emerging	vision-transformer-classification	280	Python
687	VarunGumma/IndicTransToolkit A simple, consistent and extendable toolkit for IndicTrans2. (Pypi:...	45	Emerging	indic-language-translation	38	Cython
688	DarshanDeshpande/jax-models Unofficial JAX implementations of deep learning research papers	45	Emerging	transformer-frameworks-wrappers	161	Python
689	THUDM/LongBench LongBench v2 and LongBench (ACL 25'&24')	45	Emerging	domain-specific-benchmarks	1,113	Python
690	marella/ctransformers Python bindings for the Transformer models implemented in C/C++ using GGML library.	45	Emerging	transformer-architecture-tutorials	1,882	C
691	microsoft/LLF-Bench A benchmark for evaluating learning agents based on just language feedback	45	Emerging	domain-specific-benchmarks	95	Python
692	PhoebusSi/Alpaca-CoT We unified the interfaces of instruction-tuning data (e.g., CoT data),...	45	Emerging	multilingual-llm-adaptation	2,801	Jupyter Notebook
693	sobelio/llm-chain `llm-chain` is a powerful rust crate for building chains in large language...	45	Emerging	local-llm-deployment	1,593	Rust
694	open-mmlab/Multimodal-GPT Multimodal-GPT	45	Emerging	vision-language-instruction-tuning	1,517	Python
695	rxn4chemistry/rxn-onmt-models Training of OpenNMT-based RXN models	45	Emerging	molecular-generation-transformers	2	Python
696	Yangyi-Chen/Multimodal-AND-Large-Language-Models Paper list about multimodal and large language models, only used to record...	45	Emerging	multimodal-vision-language-models	756	—
697	donderom/llm4s Scala 3 bindings for llama.cpp 🦙	45	Emerging	local-llm-deployment	65	Scala
698	YJiangcm/FollowBench [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following...	45	Emerging	domain-specific-benchmarks	119	Python
699	rishikksh20/convolution-vision-transformers PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers	45	Emerging	vision-transformer-implementations	226	Python
700	RWKV/rwkv.cpp INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model	45	Emerging	llm-inference-engines	1,563	C++

« Prev 1 2 3 … 5 6 7 8 9 … 63 64 65 Next »