All Transformer Models

6,427 models ranked by quality score · Page 5 of 65

Showing 401–500 of 6,427

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
401	microsoft/vidur A large-scale simulation framework for LLM inference	45	Emerging	llm-inference-engines	547	Python
402	facebookresearch/LayerSkip Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative...	45	Emerging	llm-implementation-from-scratch	361	Python
403	yuriwa/crewai-sheets-ui Use google sheets as a gui for crewAI	45	Emerging	interactive-ai-chat-uis	76	Python
404	FasterDecoding/Medusa Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads	45	Emerging	llm-frameworks-libraries	2,717	Jupyter Notebook
405	yoshoku/llama_cpp.rb llama_cpp.rb provides Ruby bindings for llama.cpp	45	Emerging	local-llm-deployment	232	C
406	riyanshibohra/TuneKit Upload your data → Get a fine-tuned SLM. Free.	45	Emerging	llm-fine-tuning	138	Python
407	alephpi/Texo-web The web application for Texo, a minimalist SOTA LaTeX OCR model which...	45	Emerging	ocr-document-extraction	46	Vue
408	rxn4chemistry/rxn-onmt-models Training of OpenNMT-based RXN models	45	Emerging	molecular-generation-transformers	2	Python
409	young-geng/scalax A simple library for scaling up JAX programs	45	Emerging	llm-fine-tuning	146	Python
410	imoneoi/openchat OpenChat: Advancing Open-source Language Models with Imperfect Data	45	Emerging	multi-provider-llm-interfaces	5,476	Python
411	VectorInstitute/vector-inference Efficient LLM inference on Slurm clusters.	45	Emerging	llm-inference-serving	95	Python
412	IbrahimSobh/llms Large Language Models: In this repository Language models are introduced...	45	Emerging	llm-training-experimentation	394	Jupyter Notebook
413	bobazooba/xllm 🦖 X—LLM: Cutting Edge & Easy LLM Finetuning	45	Emerging	llm-training-experimentation	408	Python
414	ashishpatel26/LLM-Finetuning LLM Finetuning with peft	45	Emerging	lora-qlora-fine-tuning	2,827	Jupyter Notebook
415	PhoebusSi/Alpaca-CoT We unified the interfaces of instruction-tuning data (e.g., CoT data),...	45	Emerging	multilingual-llm-adaptation	2,801	Jupyter Notebook
416	Leeroo-AI/mergoo A library for easily merging multiple LLM experts, and efficiently train the...	45	Emerging	llm-training-experimentation	507	Python
417	fla-org/flame 🔥 A minimal training framework for scaling FLA models	45	Emerging	sparse-attention-optimization	355	Python
418	tensorops/TransformerX Flexible Python library providing building blocks (layers) for reproducible...	44	Emerging	transformer-architecture-tutorials	53	Python
419	Denis2054/Transformers-for-NLP-2nd-Edition Transformer models from BERT to GPT-4, environments from Hugging Face to...	44	Emerging	transformer-frameworks-wrappers	957	Jupyter Notebook
420	pytorch/torchchat Run PyTorch LLMs locally on servers, desktop and mobile	44	Emerging	llm-inference-serving	3,625	Python
421	inclusionAI/asystem-awex A high-performance RL training-inference weight synchronization framework,...	44	Emerging	llm-inference-engines	138	Python
422	kennethleungty/Llama-2-Open-Source-LLM-CPU-Inference Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A	44	Emerging	llm-inference-engines	974	Python
423	kyegomez/PALI3 Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS:...	44	Emerging	vision-language-models	146	Python
424	kyegomez/GPT4o Community Open Source Implementation of GPT4o in PyTorch	44	Emerging	gpt2-pretraining-fine-tuning	26	Shell
425	kyegomez/LIMoE Implementation of the "the first large-scale multimodal mixture of experts...	44	Emerging	mixup-augmentation-frameworks	36	Python
426	ai-forever/ru-gpts Russian GPT3 models.	44	Emerging	gpt2-pretraining-fine-tuning	2,093	Python
427	gluonfield/enchanted Enchanted is iOS and macOS app for chatting with private self hosted...	44	Emerging	interactive-ai-chat-uis	5,838	Swift
428	0hq/WebGPT Run GPT model on the browser with WebGPU. An implementation of GPT inference...	44	Emerging	gpt2-pretraining-fine-tuning	3,784	JavaScript
429	PKU-Alignment/safe-rlhf Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from...	44	Emerging	rlhf-alignment-training	1,590	Python
430	FMInference/FlexLLMGen Running large language models on a single GPU for throughput-oriented scenarios.	44	Emerging	llm-compression-optimization	9,380	Python
431	grammarly/gector Official implementation of the papers "GECToR – Grammatical Error...	44	Emerging	bert-model-implementations	955	Python
432	NLPOptimize/flash-tokenizer EFFICIENT AND OPTIMIZED TOKENIZER ENGINE FOR LLM INFERENCE SERVING	44	Emerging	text-tokenization-libraries	509	C++
433	cdqa-suite/cdQA ⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.	44	Emerging	question-answering-systems	617	Python
434	bshao001/ChatLearner A chatbot implemented in TensorFlow based on the seq2seq model, with certain...	44	Emerging	chatbot-nlp-frameworks	544	Python
435	OscarKjell/text Using Transformers from HuggingFace in R	44	Emerging	huggingface-learning-resources	157	R
436	Nicolepcx/transformers-the-definitive-guide This is the official repository for the book Transformers - The Definitive Guide	44	Emerging	transformer-frameworks-wrappers	80	Jupyter Notebook
437	pszemraj/textsum CLI & Python API to easily summarize text-based files with transformers	44	Emerging	text-summarization-transformers	132	Python
438	jeya-maria-jose/Medical-Transformer Official Pytorch Code for "Medical Transformer: Gated Axial-Attention for...	44	Emerging	medical-image-segmentation-transformers	857	Python
439	showlab/Show-o [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer...	44	Emerging	multimodal-vision-language	1,894	Python
440	monologg/KoELECTRA Pretrained ELECTRA Model for Korean	44	Emerging	korean-language-models	630	Python
441	Mann1988/awesome-claude-skills 📊 Explore high-quality Claude skills focused on business analysis and...	44	Emerging	multi-agent-orchestration	20	Python
442	cure-lab/LTSF-Linear [AAAI-23 Oral] Official implementation of the paper "Are Transformers...	44	Emerging	time-series-forecasting	2,430	Python
443	monologg/JointBERT Pytorch implementation of JointBERT: "BERT for Joint Intent Classification...	44	Emerging	bert-model-implementations	738	Python
444	vitoplantamura/OnnxStream Lightweight inference library for ONNX files, written in C++. It can run...	44	Emerging	llm-inference-engines	2,031	C++
445	chanind/frame-semantic-transformer Frame Semantic Parser based on T5 and FrameNet	44	Emerging	t5-mt5-fine-tuning	65	Python
446	tanyuqian/redco NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A...	44	Emerging	llm-benchmark-leaderboards	69	Python
447	lonePatient/Bert-Multi-Label-Text-Classification This repo contains a PyTorch implementation of a pretrained BERT model for...	44	Emerging	text-classification-transformers	924	Python
448	pengzhangzhi/Open-dLLM Open diffusion language model for code generation — releasing pretraining,...	44	Emerging	diffusion-language-models	549	Python
449	shreyansh26/Annotated-ML-Papers Annotations of the interesting ML papers I read	44	Emerging	ml-foundations-curricula	275	—
450	ikergarcia1996/Easy-Translate Easy-Translate is a script for translating large text files with a SINGLE...	44	Emerging	neural-machine-translation	227	Python
451	daviddaytw/react-native-transformers Run local LLM from Huggingface in React-Native or Expo using onnxruntime.	44	Emerging	browser-based-ml-inference	128	TypeScript
452	bytedance/video-SALMONN-2 video-SALMONN 2 is a powerful audio-visual large language model (LLM) that...	44	Emerging	multimodal-vision-language	167	Python
453	Rishit-dagli/Fast-Transformer An implementation of Additive Attention	44	Emerging	transformer-architecture-tutorials	148	Jupyter Notebook
454	olivkoch/nano-trm An implementation of Tiny Recursive Models (TRM)	44	Emerging	transformer-training-optimization	101	Python
455	kmeng01/rome Locating and editing factual associations in GPT (NeurIPS 2022)	44	Emerging	llm-implementation-from-scratch	737	Python
456	GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS End to End Generative AI Industry Projects on LLM Models with...	44	Emerging	prompt-engineering-security	515	—
457	tatsu-lab/alpaca_eval An automatic evaluator for instruction-following language models....	44	Emerging	evaluation-frameworks-metrics	1,957	Jupyter Notebook
458	abhimishra91/transformers-tutorials Github repo with tutorials to fine tune transformers for diff NLP tasks	44	Emerging	transformer-frameworks-wrappers	859	Jupyter Notebook
459	tensorgi/TPA [NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6)...	44	Emerging	gpt-model-fine-tuning	450	Python
460	Gen-Verse/dLLM-RL [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for...	44	Emerging	rlhf-alignment-training	459	Python
461	tylerelyt/LLM-Workshop 🌟 Learn Large Language Model development through hands-on projects and...	44	Emerging	llm-framework-abstractions	88	Python
462	kyegomez/LongNet Implementation of plug in and play Attention from "LongNet: Scaling...	44	Emerging	transformer-architecture-education	714	Python
463	rasbt/LLM-workshop-2024 A 4-hour coding workshop to understand how LLMs are implemented and used	44	Emerging	llm-learning-resources	1,074	Jupyter Notebook
464	Rishit-dagli/Perceiver Implementation of Perceiver, General Perception with Iterative Attention	43	Emerging	transformer-architecture-tutorials	87	Python
465	polakowo/gpt2bot Your new Telegram buddy powered by transformers	43	Emerging	conversational-chatbot-applications	442	Jupyter Notebook
466	willyfh/graph-transformer An unofficial implementation of Graph Transformer (Masked Label Prediction:...	43	Emerging	graph-neural-networks	35	Python
467	jina-ai/rungpt An open-source cloud-native of large multi-modal models (LMMs) serving framework.	43	Emerging	llm-inference-engines	165	Python
468	analyticalrohit/llms-from-scratch Build a ChatGPT like LLM from scratch in PyTorch, explained step by step.	43	Emerging	llm-implementation-from-scratch	26	Jupyter Notebook
469	cruiseresearchgroup/SensorLLM [EMNLP 2025] Official implementation of "SensorLLM: Aligning Large Language...	43	Emerging	multimodal-vision-language	83	Python
470	camenduru/text-generation-webui-colab A colab gradio web UI for running Large Language Models	43	Emerging	local-llm-deployment	2,093	Jupyter Notebook
471	salesforce/TransmogrifAI TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building...	43	Emerging	browser-based-ml-inference	2,272	Scala
472	Tzohar/PassLLM World's most accurate password guessing AI tool. A PyTorch implementation of...	43	Emerging	llm-training-experimentation	85	Python
473	bbruceyuan/LLMs-Zero-to-Hero 从无名小卒到大模型（LLM）大英雄~ 欢迎关注后续！！！	43	Emerging	llm-learning-resources	2,065	Jupyter Notebook
474	sinanuozdemir/oreilly-llm-rl-alignment This training offers an intensive exploration into the frontier of...	43	Emerging	rlhf-alignment-training	59	Jupyter Notebook
475	Tencent-Hunyuan/GradLoc Implementation of GradLoc from the Tencent Hunyuan blog "Stabilizing RLVR...	43	Emerging	llm-scaling-architecture	89	Python
476	uber-research/PPLM Plug and Play Language Model implementation. Allows to steer topic and...	43	Emerging	llm-finetuning-frameworks	1,155	Python
477	SamsungSAILMontreal/nino Code for "Accelerating Training with Neuron Interaction and Nowcasting...	43	Emerging	graph-transformers	28	Python
478	ml4fp/2025-lbnl ML4FP 2025: notebooks used for the Machine Learning for Fundamental Physics...	43	Emerging	ml-foundations-curricula	21	Jupyter Notebook
479	MagedSaeed/generate-sequences A python package made to generate sequences (greedy and beam-search) from...	43	Emerging	creative-text-generation	18	Python
480	AliHaiderAhmad001/GPT-from-Scratch-with-Tensorflow Implementation for "Improving Language Understanding by Generative...	43	Emerging	gpt-multilingual-training	19	Python
481	EleutherAI/knowledge-neurons A library for finding knowledge neurons in pretrained transformer models.	43	Emerging	transformer-interpretability-mechanistic	159	Python
482	microsoft/rat-sql A relation-aware semantic parsing model from English to SQL	43	Emerging	multi-agent-orchestration	446	Python
483	adrienpetralia/NILMFormer [KDD 2025] NILMFormer: A Sequence-To-Sequence Non-Stationarity Aware...	43	Emerging	energy-sector-forecasting	34	Python
484	kenhktsui/anyclassifier One Line To Build Zero-Data Classifiers in Minutes	43	Emerging	text-classification	64	Python
485	huggingface/transformers-bloom-inference Fast Inference Solutions for BLOOM	43	Emerging	machine-translation-transformers	566	Python
486	sammcj/ingest Parse files (e.g. code repos) and websites to clipboard or a file for...	43	Emerging	local-llm-deployment	367	Go
487	JIA-Lab-research/MGM-Omni MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech	43	Emerging	local-voice-assistants	265	Python
488	backprop-ai/backprop Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.	43	Emerging	bert-model-implementations	241	Python
489	Gleghorn-Lab/Protify Low code molecular property prediction	43	Emerging	protein-transformers-ml	11	Python
490	alephpi/Texo A minimalist SOTA LaTeX OCR model with only 20M parameters, running in...	43	Emerging	ocr-document-extraction	747	Python
491	gordicaleksa/pytorch-original-transformer My implementation of the original transformer model (Vaswani et al.). I've...	43	Emerging	transformer-architecture-tutorials	1,085	Jupyter Notebook
492	EfficientMoE/MoE-Infinity PyTorch library for cost-effective, fast and easy serving of MoE models.	43	Emerging	mixture-of-experts-llms	288	Python
493	r2d4/rellm Exact structure out of any language model completion.	43	Emerging	llm-training-experimentation	514	Python
494	hoangsonww/Spot-the-Scam-AI-Job-Fraud 🎒 An AI/ML-powered, full-stack job-posting fraud copilot delivering...	43	Emerging	resume-job-matching	11	Python
495	appvision-ai/fast-bert Super easy library for BERT based NLP models	43	Emerging	bert-model-implementations	1,920	Python
496	MDGrey33/pyvisionai The PyVisionAI Official Repo	43	Emerging	ml-foundations-curricula	112	Python
497	LM-Kit/lm-kit-net-samples .NET samples for LM-Kit.NET	43	Emerging	local-llm-deployment	38	C#
498	laelhalawani/gguf_modeldb A quick and optimized solution to manage llama based gguf quantized models,...	43	Emerging	llm-quantization-methods	12	Python
499	KRR-Oxford/HierarchyTransformers Language Models as Hierarchy Encoders	43	Emerging	transformer-architecture-tutorials	40	Python
500	gitkaz/mlx_gguf_server This is a FastAPI based LLM server. Load multiple LLM models (MLX or...	43	Emerging	llm-docker-deployments	17	Python

« Prev 1 2 3 4 5 6 7 … 63 64 65 Next »