All Transformer Models

6,429 models ranked by quality score · Page 9 of 65

Showing 801–900 of 6,429

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
801	yuchenlin/LLM-Blender [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to...	44	Emerging	llm-frameworks-libraries	976	Python
802	tommasomncttn/mergenetic Flexible library for merging large language models (LLMs) via evolutionary...	44	Emerging	llm-compression-optimization	100	Jupyter Notebook
803	hiyouga/FastEdit 🩹Editing large language models within 10 seconds⚡	44	Emerging	rlhf-alignment-training	1,359	Python
804	monologg/transformers-android-demo 📲 Transformers android examples (Tensorflow Lite & Pytorch Mobile)	44	Emerging	transformer-frameworks-wrappers	83	Java
805	1b5d/llm-api Run any Large Language Model behind a unified API	44	Emerging	llm-inference-engines	171	Python
806	poloclub/llm-landscape NeurIPS'24 - LLM Safety Landscape	44	Emerging	prompt-engineering-techniques	39	Python
807	cdpierse/transformers-interpret Model explainability that works seamlessly with 🤗 transformers. Explain your...	44	Emerging	transformer-interpretability-mechanistic	1,413	Jupyter Notebook
808	iaalm/llama-api-server A OpenAI API compatible REST server for llama.	44	Emerging	local-llm-deployment	209	Python
809	gluonfield/enchanted Enchanted is iOS and macOS app for chatting with private self hosted...	44	Emerging	interactive-ai-chat-uis	5,838	Swift
810	srgtuszy/llama-cpp-swift Swift bindings for llama-cpp library	44	Emerging	llm-docker-deployments	67	Swift
811	gitabtion/BertBasedCorrectionModels PyTorch impelementations of BERT-based Spelling Error Correction Models. ...	44	Emerging	bert-model-implementations	279	Python
812	freshllms/freshqa Data and code for FreshLLMs (https://arxiv.org/abs/2310.03214)	44	Emerging	llm-learning-resources	389	Jupyter Notebook
813	hao-ai-lab/Dynasor [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model...	44	Emerging	llm-reasoning-research	224	Python
814	vijaydwivedi75/gnn-lspe Source code for GNN-LSPE (Graph Neural Networks with Learnable Structural...	44	Emerging	graph-transformers	267	Python
815	uclaml/SPIN The official implementation of Self-Play Fine-Tuning (SPIN)	44	Emerging	rlhf-alignment-training	1,235	Python
816	monologg/KoBERT-KorQuAD Korean MRC (KorQuAD) with KoBERT	44	Emerging	korean-language-models	65	Python
817	SqueezeAILab/LLMCompiler [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling	44	Emerging	llm-inference-engines	1,828	Python
818	0hq/WebGPT Run GPT model on the browser with WebGPU. An implementation of GPT inference...	44	Emerging	gpt2-pretraining-fine-tuning	3,784	JavaScript
819	joyehuang/minimind-notes 🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 \| A...	44	Emerging	llm-implementation-tutorials	67	Python
820	LibreTranslate/Locomotive Toolkit for training/converting LibreTranslate compatible language models 🚂	44	Emerging	machine-translation-transformers	79	Python
821	FMInference/FlexLLMGen Running large language models on a single GPU for throughput-oriented scenarios.	44	Emerging	llm-compression-optimization	9,380	Python
822	RManLuo/reasoning-on-graphs Official Implementation of ICLR 2024 paper: "Reasoning on Graphs: Faithful...	44	Emerging	graph-language-models	497	Python
823	anseryuer/Local_LLM_Deployment_Guide_Chinese 本地部署大语言模型的中文教学	44	Emerging	llm-finetuning-frameworks	43	—
824	powerserve-project/PowerServe High-speed and easy-use LLM serving framework for local deployment	44	Emerging	llm-inference-engines	146	C++
825	CASE-Lab-UMD/Unified-MoE-Compression The official implementation of the paper "Towards Efficient Mixture of...	44	Emerging	mixture-of-experts-llms	89	Python
826	SakanaAI/text-to-lora Hypernetworks that adapt LLMs for specific benchmark tasks using only...	44	Emerging	llm-fine-tuning	1,214	Python
827	AI-Hypercomputer/jetstream-pytorch PyTorch/XLA integration with JetStream (https://github.com/google/JetStream)...	44	Emerging	llm-inference-engines	79	Python
828	Arunprakash-A/DL-Pytorch-Workshop Develop DL models using Pytorch and Hugging Face	44	Emerging	huggingface-learning-resources	42	—
829	boyiwei/alignment-attribution-code [ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and...	44	Emerging	llm-knowledge-editing	89	Python
830	NX-AI/mlstm_kernels Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.	44	Emerging	sparse-attention-optimization	87	Jupyter Notebook
831	git-disl/Vaccine This is the official code for the paper "Vaccine: Perturbation-aware...	44	Emerging	llm-hallucination-mitigation	49	Shell
832	mdrokz/rust-llama.cpp LLama.cpp rust bindings	44	Emerging	local-llm-deployment	416	Rust
833	lxe/simple-llm-finetuner Simple UI for LLM Model Finetuning	44	Emerging	lora-qlora-fine-tuning	2,062	Jupyter Notebook
834	salesforce/ETSformer PyTorch code for ETSformer: Exponential Smoothing Transformers for...	44	Emerging	time-series-forecasting-transformers	306	Python
835	deep-symbolic-mathematics/TPSR [NeurIPS 2023] This is the official code for the paper "TPSR:...	44	Emerging	mathematical-reasoning-transformers	81	Python
836	SkalskiP/vlms-zero-to-hero This series will take you on a journey from the fundamentals of NLP and...	44	Emerging	nlp-education-courses	1,158	Jupyter Notebook
837	JetRunner/BERT-of-Theseus ⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT...	43	Emerging	bert-model-implementations	315	Python
838	dipanjanS/adv_nlp_workshop_odsc_europe22 Extensive tutorials for the Advanced NLP Workshop in Open Data Science...	43	Emerging	nlp-learning-coursework	51	Jupyter Notebook
839	sinanuozdemir/oreilly-pytorch-dl Code for Deep Learning for Modern AI	43	Emerging	huggingface-learning-resources	49	Jupyter Notebook
840	IST-DASLab/marlin FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up...	43	Emerging	llm-cuda-optimization	1,039	Python
841	cdpierse/script_buddy_v2 Script Buddy v2 is a film script text generation tool built using film...	43	Emerging	gpt2-pretraining-fine-tuning	47	Jupyter Notebook
842	snap-stanford/relgt Relational Graph Transformer	43	Emerging	graph-transformers	65	Python
843	stay-leave/enhance_llm 大模型相关实践记录	43	Emerging	llm-frameworks-libraries	158	Python
844	IlyaGusev/rulm Language modeling and instruction tuning for Russian	43	Emerging	llm-learning-resources	465	Jupyter Notebook
845	armbues/SiLLM SiLLM simplifies the process of training and running Large Language Models...	43	Emerging	apple-silicon-llm-inference	284	Python
846	ShivamRajSharma/Transformer-Architectures-From-Scratch Implementation of transformers based architecture in PyTorch.	43	Emerging	transformer-architecture-education	55	Python
847	AviSoori1x/Tuning-the-Finetuning Tuning the Finetuning: An exploration of achieving success with QLoRA	43	Emerging	lora-qlora-fine-tuning	46	Python
848	jasonvanf/llama-trl LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA	43	Emerging	lora-qlora-fine-tuning	238	Python
849	sinanuozdemir/oreilly-huggingface-tour A Crash Course in Hugging Face	43	Emerging	huggingface-learning-resources	63	Jupyter Notebook
850	gohjiayi/suicidal-text-detection Building a suicidal text detection model and mental health chatbot with deep...	43	Emerging	emotion-detection-transformers	42	Jupyter Notebook
851	cambridgeltl/visual-med-alpaca Visual Med-Alpaca is an open-source, multi-modal foundation model designed...	43	Emerging	clinical-llm-tools	394	Python
852	turtlesoupy/this-word-does-not-exist This Word Does Not Exist	43	Emerging	gpt2-pretraining-fine-tuning	1,021	Python
853	Tongjilibo/build_MiniLLM_from_scratch 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)	43	Emerging	llm-implementation-tutorials	537	Python
854	analyticalrohit/llms-from-scratch Build a ChatGPT like LLM from scratch in PyTorch, explained step by step.	43	Emerging	llm-implementation-from-scratch	26	Jupyter Notebook
855	laelhalawani/gguf_modeldb A quick and optimized solution to manage llama based gguf quantized models,...	43	Emerging	llm-quantization-methods	12	Python
856	bayesgroup/code_transformers Empirical Study of Transformers for Source Code & A Simple Approach for...	43	Emerging	transformer-architecture-education	66	Python
857	openjlc/riscv64-library Some of the libraries (docs) on the RISCV64 architecture are easy for users...	43	Emerging	local-llm-deployment	69	—
858	princeton-nlp/SimPO [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward	43	Emerging	direct-preference-optimization	946	Python
859	RManLuo/graph-constrained-reasoning Official Implementation of ICML 2025 Paper: "Graph-constrained Reasoning:...	43	Emerging	llm-knowledge-graph-generation	238	Python
860	ddzipp/AutoAudit AutoAudit—— the LLM for Cyber Security 网络安全大语言模型	43	Emerging	multilingual-llm-adaptation	353	HTML
861	lxuechen/private-transformers A codebase that makes differentially private training of transformers easy.	43	Emerging	transformer-architecture-tutorials	185	Python
862	datastone-spirit/spirit-lora-trainer Spirit Lora Trainer is a robust toolkit for training Flux1-LoRA models with...	43	Emerging	lora-training-tools	87	Python
863	rohan-paul/LLM-FineTuning-Large-Language-Models LLM (Large Language Model) FineTuning	43	Emerging	llm-fine-tuning	566	Jupyter Notebook
864	iPieter/RobBERT A Dutch RoBERTa-based language model	43	Emerging	bert-model-implementations	207	Jupyter Notebook
865	hoangsonww/Spot-the-Scam-AI-Job-Fraud 🎒 An AI/ML-powered, full-stack job-posting fraud copilot delivering...	43	Emerging	resume-job-matching	11	Python
866	salesforce/CodeTF CodeTF: One-stop Transformer Library for State-of-the-art Code LLM	43	Emerging	power-transformer-design	1,480	Python
867	kyegomez/SparseAttention Pytorch Implementation of the sparse attention from the paper: "Generating...	43	Emerging	attention-mechanism-implementations	94	Python
868	gitabtion/SoftMaskedBert-PyTorch 🙈 An unofficial implementation of SoftMaskedBert based on huggingface/transformers.	43	Emerging	bert-model-implementations	97	Python
869	kevinMEH/keyscan Keyscan: AI-powered API key scanner for GitHub Gists.	43	Emerging	multi-agent-orchestration	37	Python
870	Atome-FE/llama-node Believe in AI democratization. llama for nodejs backed by llama-rs,...	43	Emerging	llm-orchestration-platforms	867	Rust
871	MagedSaeed/generate-sequences A python package made to generate sequences (greedy and beam-search) from...	43	Emerging	creative-text-generation	18	Python
872	eric-ai-lab/MiniGPT-5 Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language...	43	Emerging	gpt2-pretraining-fine-tuning	863	Python
873	Kartik-3004/SegFace [AAAI 25] SegFace: Face Segmentation of Long-tail classes	43	Emerging	medical-image-segmentation-transformers	100	Python
874	varunkumar-dev/TransformersDataAugmentation Code associated with the "Data Augmentation using Pre-trained Transformer...	43	Emerging	essay-scoring-grading	135	Python
875	gitkaz/mlx_gguf_server This is a FastAPI based LLM server. Load multiple LLM models (MLX or...	43	Emerging	llm-docker-deployments	17	Python
876	GAIR-NLP/MegaScience MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning	43	Emerging	lora-qlora-fine-tuning	113	Python
877	datamllab/LongLM [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning	43	Emerging	diffusion-language-models	666	Python
878	magpie-align/magpie [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs...	43	Emerging	llm-domain-datasets	834	Python
879	AliHaiderAhmad001/GPT-from-Scratch-with-Tensorflow Implementation for "Improving Language Understanding by Generative...	43	Emerging	gpt-multilingual-training	19	Python
880	alibaba/GraphTranslator GraphTranslator:Aligning Graph Model to Large Language Model for Open-ended Tasks	43	Emerging	graph-language-models	118	Python
881	zeozeozeo/ellama Friendly interface to chat with an Ollama instance.	43	Emerging	interactive-ai-chat-uis	92	Rust
882	CodeWithKyrian/transformers-php Transformers PHP is a toolkit for PHP developers to add machine learning...	43	Emerging	php-ai-sdks	743	PHP
883	DC-research/TEMPO The official code for "TEMPO: Prompt-based Generative Pre-trained...	43	Emerging	time-series-forecasting-transformers	133	Python
884	CLAIRE-Labo/EvoTune Efficiently discovering algorithms via LLMs with evolutionary search and...	43	Emerging	llm-agent-training-gyms	130	Python
885	deep-diver/llamaduo [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration...	43	Emerging	llm-frameworks-libraries	317	Python
886	SamsungSAILMontreal/nino Code for "Accelerating Training with Neuron Interaction and Nowcasting...	43	Emerging	graph-transformers	28	Python
887	DAGroup-PKU/MHLA MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head...	43	Emerging	compositional-t2i-generation	133	Python
888	ucbrise/graphtrans Representing Long-Range Context for Graph Neural Networks with Global Attention	43	Emerging	graph-neural-networks	136	Python
889	Gleghorn-Lab/Protify Low code molecular property prediction	43	Emerging	protein-transformers-ml	11	Python
890	amirhossein-kz/HiFormer HiFormer: Hierarchical Multi-scale Representations Using Transformers for...	43	Emerging	medical-image-segmentation-transformers	144	Jupyter Notebook
891	hao-ai-lab/JacobiForcing Jacobi Forcing: Fast and Accurate Diffusion-style Decoding	43	Emerging	speculative-decoding-algorithms	143	Python
892	gupta-abhay/pytorch-vit An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale	43	Emerging	vit-image-classification	306	Python
893	aliemo/transfomers-silicon-research Research and Materials on Hardware implementation of Transformer Model	43	Emerging	machine-translation-transformers	299	Jupyter Notebook
894	ml4fp/2025-lbnl ML4FP 2025: notebooks used for the Machine Learning for Fundamental Physics...	43	Emerging	ml-foundations-curricula	21	Jupyter Notebook
895	InternLM/CapRL [ICLR 2026] An official implementation of "CapRL: Stimulating Dense Image...	43	Emerging	multimodal-vision-language	193	Python
896	salcc/QuantumTransformers Quantum Transformers for High Energy Physics Analysis at the Large Hadron Collider	43	Emerging	power-transformer-design	49	Jupyter Notebook
897	nerve-sparks/iris_android IRIS is an android app for interfacing with GGUF / llama.cpp models locally.	43	Emerging	local-llm-deployment	267	Kotlin
898	xlang-ai/Binder [ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"	43	Emerging	math-reasoning-datasets	325	Python
899	modelscope/dash-infer DashInfer is a native LLM inference engine aiming to deliver...	43	Emerging	llm-inference-engines	273	C
900	VikParuchuri/textbook_quality Generate textbook-quality synthetic LLM pretraining data	43	Emerging	synthetic-data-generation	509	Python

« Prev 1 2 3 … 7 8 9 10 11 … 63 64 65 Next »