All Transformer Models

6,429 models ranked by quality score · Page 4 of 65

Showing 301–400 of 6,429

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
301	modelscope/easydistill a toolkit on knowledge distillation for large language models	53	Established	llm-knowledge-distillation	292	Python
302	potamides/DeTikZify Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ.	53	Established	text-to-image-generation	1,738	Python
303	sovit-123/vision_transformers Vision Transformers for image classification, image segmentation, and object...	53	Established	vision-transformer-implementations	65	Python
304	keith2018/TinyGPT Tiny C++ LLM inference implementation from scratch	53	Established	gpt2-pretraining-fine-tuning	106	C++
305	Nicolepcx/Transformers-in-Action This is the corresponding code for the book Transformers in Action	53	Established	transformer-architecture-tutorials	135	Jupyter Notebook
306	guanwei49/LogLLM LogLLM: Log-based Anomaly Detection Using Large Language Models (system log...	53	Established	prompt-engineering-techniques	181	Python
307	tjake/Jlama Jlama is a modern LLM inference engine for Java	53	Established	local-llm-deployment	1,259	Java
308	BioinfoMachineLearning/DeepInteract A geometric deep learning framework (Geometric Transformers) for predicting...	53	Established	protein-transformers-ml	64	Python
309	GeeeekExplorer/nano-vllm Nano vLLM	53	Established	llm-inference-engines	12,189	Python
310	jax-ml/jax-llm-examples Minimal yet performant LLM examples in pure JAX	53	Established	llm-fine-tuning	244	Python
311	hscspring/hcgf Humanable Chat Generative-model Fine-tuning \| LLM微调	53	Established	rlhf-alignment-training	207	Python
312	MattyB95/Jabberjay 🦜 Synthetic Voice Detection	53	Established	wav2vec2-speech-recognition	5	Python
313	tue-mps/eomt [CVPR 2025 Highlight] Official code and models for Encoder-only Mask...	53	Established	medical-image-segmentation-transformers	548	Jupyter Notebook
314	kyegomez/zeta Build high-performance AI models with modular building blocks	53	Established	transformer-architecture-tutorials	579	Python
315	SKTBrain/KoBERT Korean BERT pre-trained cased (KoBERT)	53	Established	korean-language-models	1,407	Python
316	IbrahimSobh/llms Large Language Models: In this repository Language models are introduced...	52	Established	llm-training-experimentation	394	Jupyter Notebook
317	CASE-Lab-UMD/LLM-Drop The official implementation of the paper "Uncovering the Redundancy in...	52	Established	llm-implementation-tutorials	189	Python
318	microsoft/vidur A large-scale simulation framework for LLM inference	52	Established	llm-inference-engines	547	Python
319	fla-org/flame 🔥 A minimal training framework for scaling FLA models	52	Established	sparse-attention-optimization	355	Python
320	sb-ai-lab/RePlay A Comprehensive Framework for Building End-to-End Recommendation Systems...	52	Established	recommendation-systems-transformers	388	Python
321	zhenye234/LLaSA_training LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis	52	Established	llm-inference-engines	659	Python
322	xrsrke/toolformer Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools	52	Established	llm-robot-planning	144	Jupyter Notebook
323	IBM/TabFormer Code & Data for "Tabular Transformers for Modeling Multivariate Time Series"...	52	Established	gpt-model-fine-tuning	360	Python
324	facebookresearch/LayerSkip Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative...	52	Established	llm-implementation-from-scratch	361	Python
325	sinanuozdemir/oreilly-hands-on-gpt-llm Mastering the Art of Scalable and Efficient AI Model Deployment	52	Established	llm-frameworks-libraries	142	Jupyter Notebook
326	bobazooba/xllm 🦖 X—LLM: Cutting Edge & Easy LLM Finetuning	52	Established	llm-training-experimentation	408	Python
327	VectorInstitute/vector-inference Efficient LLM inference on Slurm clusters.	52	Established	llm-inference-serving	95	Python
328	oripress/AlgoTune AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and...	52	Established	code-model-training	95	Python
329	kyegomez/SingLoRA This repository provides a minimal, single-file implementation of SingLoRA...	52	Established	llm-framework-abstractions	44	Python
330	alibaba/InferSim A Lightweight LLM Inference Performance Simulator	52	Established	llm-inference-engines	65	Python
331	FareedKhan-dev/train-llm-from-scratch A straightforward method for training your LLM, from downloading data to...	52	Established	llm-implementation-from-scratch	531	Jupyter Notebook
332	foundation-model-stack/fms-fsdp 🚀 Efficiently (pre)training foundation models with native PyTorch features,...	52	Established	sparse-attention-optimization	282	Python
333	fluxions-ai/vui 100M parameter lightweight conversational text-to-speech model with breaths,...	52	Established	text-to-speech-tts	641	Python
334	yoshoku/llama_cpp.rb llama_cpp.rb provides Ruby bindings for llama.cpp	52	Established	local-llm-deployment	232	C
335	TsinghuaC3I/MARTI A Framework for LLM-based Multi-Agent Reinforced Training and Inference	52	Established	llm-benchmark-leaderboards	453	Python
336	Leeroo-AI/mergoo A library for easily merging multiple LLM experts, and efficiently train the...	52	Established	llm-training-experimentation	507	Python
337	thu-nics/C2C [ICLR'26] The official code implementation for "Cache-to-Cache: Direct...	52	Established	competitive-agent-games	361	Python
338	thammegowda/nllb-serve Meta's "No Language Left Behind" models served as web app and REST API	52	Established	neural-machine-translation	256	Python
339	yuriwa/crewai-sheets-ui Use google sheets as a gui for crewAI	52	Established	interactive-ai-chat-uis	76	Python
340	OpenVoiceOS/ovos-audio-transformer-plugin-ggwave data over sound plugin	52	Established	text-to-speech-tts	2	Python
341	young-geng/scalax A simple library for scaling up JAX programs	52	Established	llm-fine-tuning	146	Python
342	Denis2054/Transformers-for-NLP-2nd-Edition Transformer models from BERT to GPT-4, environments from Hugging Face to...	51	Established	transformer-frameworks-wrappers	957	Jupyter Notebook
343	kyegomez/LongNet Implementation of plug in and play Attention from "LongNet: Scaling...	51	Established	transformer-architecture-education	714	Python
344	NLPOptimize/flash-tokenizer EFFICIENT AND OPTIMIZED TOKENIZER ENGINE FOR LLM INFERENCE SERVING	51	Established	text-tokenization-libraries	509	C++
345	kossisoroyce/timber Ollama for classical ML models. AOT compiler that turns XGBoost, LightGBM,...	51	Established	distributed-training-frameworks	636	Python
346	grammarly/gector Official implementation of the papers "GECToR – Grammatical Error...	51	Established	bert-model-implementations	955	Python
347	tanyuqian/redco NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A...	51	Established	llm-benchmark-leaderboards	69	Python
348	Nicolepcx/transformers-the-definitive-guide This is the official repository for the book Transformers - The Definitive Guide	51	Established	transformer-frameworks-wrappers	80	Jupyter Notebook
349	monologg/KoELECTRA Pretrained ELECTRA Model for Korean	51	Established	korean-language-models	630	Python
350	ikergarcia1996/Easy-Translate Easy-Translate is a script for translating large text files with a SINGLE...	51	Established	neural-machine-translation	227	Python
351	kyegomez/LIMoE Implementation of the "the first large-scale multimodal mixture of experts...	51	Established	mixup-augmentation-frameworks	36	Python
352	cdqa-suite/cdQA ⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.	51	Established	question-answering-systems	617	Python
353	rasbt/LLM-workshop-2024 A 4-hour coding workshop to understand how LLMs are implemented and used	51	Established	llm-learning-resources	1,074	Jupyter Notebook
354	AI-Hypercomputer/JetStream JetStream is a throughput and memory optimized engine for LLM inference on...	51	Established	llm-inference-engines	415	Python
355	ai-forever/ru-gpts Russian GPT3 models.	51	Established	gpt2-pretraining-fine-tuning	2,093	Python
356	fixie-ai/ultravox A fast multimodal LLM for real-time voice	51	Established	multimodal-vision-language	4,377	Python
357	tylerelyt/LLM-Workshop 🌟 Learn Large Language Model development through hands-on projects and...	51	Established	llm-framework-abstractions	88	Python
358	jadore801120/attention-is-all-you-need-pytorch A PyTorch implementation of the Transformer model in "Attention is All You Need".	51	Established	attention-mechanism-implementations	9,651	Python
359	abhimishra91/transformers-tutorials Github repo with tutorials to fine tune transformers for diff NLP tasks	51	Established	transformer-frameworks-wrappers	859	Jupyter Notebook
360	bshao001/ChatLearner A chatbot implemented in TensorFlow based on the seq2seq model, with certain...	51	Established	chatbot-nlp-frameworks	544	Python
361	alephpi/Texo-web The web application for Texo, a minimalist SOTA LaTeX OCR model which...	51	Established	ocr-document-extraction	46	Vue
362	tensorgi/TPA [NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6)...	51	Established	gpt-model-fine-tuning	450	Python
363	monologg/JointBERT Pytorch implementation of JointBERT: "BERT for Joint Intent Classification...	51	Established	bert-model-implementations	738	Python
364	kennethleungty/Llama-2-Open-Source-LLM-CPU-Inference Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A	51	Established	llm-inference-engines	974	Python
365	cure-lab/LTSF-Linear [AAAI-23 Oral] Official implementation of the paper "Are Transformers...	51	Established	time-series-forecasting	2,430	Python
366	vitoplantamura/OnnxStream Lightweight inference library for ONNX files, written in C++. It can run...	51	Established	llm-inference-engines	2,031	C++
367	jeya-maria-jose/Medical-Transformer Official Pytorch Code for "Medical Transformer: Gated Axial-Attention for...	51	Established	medical-image-segmentation-transformers	857	Python
368	OscarKjell/text Using Transformers from HuggingFace in R	51	Established	huggingface-learning-resources	157	R
369	Rishit-dagli/Fast-Transformer An implementation of Additive Attention	51	Established	transformer-architecture-tutorials	148	Jupyter Notebook
370	kyegomez/PALI3 Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS:...	51	Established	vision-language-models	146	Python
371	GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS End to End Generative AI Industry Projects on LLM Models with...	51	Established	prompt-engineering-security	515	—
372	daviddaytw/react-native-transformers Run local LLM from Huggingface in React-Native or Expo using onnxruntime.	51	Established	browser-based-ml-inference	128	TypeScript
373	symfony/ai-platform PHP library for interacting with AI platform provider.	51	Established	php-ai-sdks	51	PHP
374	helpmefindaname/transformer-smaller-training-vocab Temporary remove unused tokens during training to save ram and speed.	51	Established	transformer-architecture-tutorials	23	Python
375	pszemraj/textsum CLI & Python API to easily summarize text-based files with transformers	51	Established	text-summarization-transformers	132	Python
376	shreyansh26/Annotated-ML-Papers Annotations of the interesting ML papers I read	51	Established	ml-foundations-curricula	275	—
377	lonePatient/Bert-Multi-Label-Text-Classification This repo contains a PyTorch implementation of a pretrained BERT model for...	51	Established	text-classification-transformers	924	Python
378	NVlabs/OmniVinci OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and...	51	Established	multimodal-vision-language	639	Python
379	kmeng01/rome Locating and editing factual associations in GPT (NeurIPS 2022)	51	Established	llm-implementation-from-scratch	737	Python
380	showlab/Show-o [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer...	51	Established	multimodal-vision-language	1,894	Python
381	CVHub520/X-AnyLabeling-Server A Simple, Lightweight, and Extensible Serving Framework for X-AnyLabeling	51	Established	blip-image-captioning	166	Python
382	kyegomez/RT-X Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open...	51	Established	vision-language-models	237	Python
383	tensorops/TransformerX Flexible Python library providing building blocks (layers) for reproducible...	51	Established	transformer-architecture-tutorials	53	Python
384	opendilab/LightRFT LightRFT: Light, Efficient, Omni-modal & Reward-model Driven Reinforcement...	51	Established	rlhf-alignment-training	208	Python
385	PKU-Alignment/safe-rlhf Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from...	51	Established	rlhf-alignment-training	1,590	Python
386	tatsu-lab/alpaca_eval An automatic evaluator for instruction-following language models....	51	Established	evaluation-frameworks-metrics	1,957	Jupyter Notebook
387	chanind/frame-semantic-transformer Frame Semantic Parser based on T5 and FrameNet	51	Established	t5-mt5-fine-tuning	65	Python
388	EleutherAI/knowledge-neurons A library for finding knowledge neurons in pretrained transformer models.	50	Established	transformer-interpretability-mechanistic	159	Python
389	pengzhangzhi/Open-dLLM Open diffusion language model for code generation — releasing pretraining,...	50	Established	diffusion-language-models	549	Python
390	cruiseresearchgroup/SensorLLM [EMNLP 2025] Official implementation of "SensorLLM: Aligning Large Language...	50	Established	multimodal-vision-language	83	Python
391	alesanfra/toons A high-performance TOON (Token Oriented Object Notation) parser and...	50	Established	llm-serialization-formats	11	Rust
392	kyegomez/SwitchTransformers Implementation of Switch Transformers from the paper: "Switch Transformers:...	50	Established	transformer-architecture-tutorials	136	Python
393	MDGrey33/pyvisionai The PyVisionAI Official Repo	50	Established	ml-foundations-curricula	112	Python
394	qcri/LLMeBench Benchmarking Large Language Models	50	Established	domain-specific-benchmarks	105	Python
395	BeRo1985/pasllm PasLLM - LLM inference engine in Object Pascal (synced from my private work...	50	Established	local-llm-deployment	76	Pascal
396	ridgerchu/matmulfreellm Implementation for MatMul-free LM.	50	Established	llm-implementation-tutorials	3,058	Python
397	EfficientMoE/MoE-Infinity PyTorch library for cost-effective, fast and easy serving of MoE models.	50	Established	mixture-of-experts-llms	288	Python
398	HPAI-BSC/TuRTLe TuRTLe: A Unified Evaluation of LLMs for RTL Generation 🐢 (MLCAD 2025)	50	Established	evaluation-frameworks-metrics	40	Python
399	jina-ai/rungpt An open-source cloud-native of large multi-modal models (LMMs) serving framework.	50	Established	llm-inference-engines	165	Python
400	Rishit-dagli/Perceiver Implementation of Perceiver, General Perception with Iterative Attention	50	Established	transformer-architecture-tutorials	87	Python

« Prev 1 2 3 4 5 6 … 63 64 65 Next »