All Transformer Models

6,429 models ranked by quality score · Page 18 of 65

Showing 1701–1800 of 6,429

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
1701	ShelbyJenkins/llm_utils llm_utils: Basic LLM tools, best practices, and minimal abstraction.	35	Emerging	rust-llm-infrastructure	48	Rust
1702	senadkurtisi/pytorch-image-captioning Transformer & CNN Image Captioning model in PyTorch.	35	Emerging	image-captioning-transformers	44	Python
1703	jackaduma/Alpaca-LoRA-RLHF-PyTorch A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer...	35	Emerging	rlhf-alignment-training	61	Python
1704	uncbiag/Awesome-Foundation-Models A curated list of foundation models for vision and language tasks	35	Emerging	multimodal-vision-language-models	1,149	—
1705	zjunlp/LightThinker [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression	35	Emerging	neural-data-compression	134	Python
1706	leftmove/cria Run LLMs locally with as little friction as possible.	35	Emerging	local-llm-deployment	121	Python
1707	nikolaydubina/llama2.go LLaMA-2 in native Go	35	Emerging	local-llm-deployment	194	Go
1708	hoof-ai/hoof "Just hoof it!" - A spotlight like interface to Ollama	35	Emerging	local-llm-deployment	63	Rust
1709	PCfVW/hf-fetch-model Fast HuggingFace model downloads for Rust — an embeddable library for...	35	Emerging	browser-based-ml-inference	1	Rust
1710	ronniross/attention-heatmap-visualizer A set of scripts to generate full attention-head heatmaps for transformer-based LLMs	35	Emerging	llm-implementation-tutorials	13	Jupyter Notebook
1711	hitz-zentroa/whisper-lm Add n-gram and large language model (LLM) support to Whisper models.	35	Emerging	llm-frameworks-libraries	41	Jupyter Notebook
1712	OatmealLiu/FineR [ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models	35	Emerging	llm-knowledge-distillation	190	Python
1713	saddam213/LLamaStack ASP.NET Core Web, WebApi & WPF implementations for LLama.cpp & LLamaSharp	35	Emerging	local-llm-deployment	60	C#
1714	BUAADreamer/SPN4CIR [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning...	35	Emerging	clip-image-embeddings	39	Python
1715	or4cl3-ai-1/ethereal-insights Quantum-Enhanced Paranormal Investigation Platform — benchmarks sensor...	35	Emerging	ai-powered-saas-startups	1	TypeScript
1716	AI4LIFE-GROUP/LLM_Explainer Code for paper: Are Large Language Models Post Hoc Explainers?	35	Emerging	llm-interpretability-explainability	34	Jupyter Notebook
1717	omron-sinicx/crystalframer The official code respository for "Rethinking the role of frames for...	35	Emerging	graph-transformers	15	Python
1718	erevusobolus/THERION-SYSTEM 🦁 THERION — Your AI. Your Hardware. Your Rules. Complete local AI assistant...	35	Emerging	conversational-chatbot-applications	5	Shell
1719	wangcongcong123/ttt A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+	35	Emerging	tokenizer-libraries	37	Python
1720	GeeeekExplorer/transformers-patch patches for huggingface transformers to save memory	35	Emerging	llm-implementation-tutorials	35	Python
1721	OpenNLPLab/TransnormerLLM Official implementation of TransNormerLLM: A Faster and Better LLM	35	Emerging	llm-implementation-tutorials	252	Python
1722	OpenBMB/VisCPM [ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat...	35	Emerging	vision-language-instruction-tuning	1,070	Python
1723	yashbonde/rasp Implementing RASP transformer programming language...	35	Emerging	browser-based-ml-inference	60	Python
1724	yzGuu830/efficient-speech-codec [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector...	35	Emerging	power-transformer-design	125	Jupyter Notebook
1725	teelinsan/parallel-decoding Repository of the paper "Accelerating Transformer Inference for Translation...	35	Emerging	transformer-training-optimization	124	Python
1726	ExplainableML/Vision_by_Language [ICLR 2024] Official repository for "Vision-by-Language for Training-Free...	35	Emerging	multimodal-vision-language	84	Python
1727	Jathurshan0330/Cross-Modal-Transformer Official repository of cross-modal transformer for interpretable automatic...	35	Emerging	multimodal-fusion-transformers	75	Jupyter Notebook
1728	touhi99/askagent Simple mac/unix terminal assistant with LLM agents capable of various tasks	35	Emerging	multi-agent-orchestration	2	Python
1729	systems-genomics-lab/deeptaxa A deep learning framework for hierarchical taxonomy classification of 16S...	35	Emerging	text-classification-transformers	9	Python
1730	ziqipang/RandAR [CVPR 2025 (Oral)] Open implementation of "RandAR"	35	Emerging	vision-language-models	207	Python
1731	JayZhang42/SLED SLED: Self Logits Evolution Decoding for Improving Factuality in Large...	35	Emerging	llm-training-experimentation	119	Python
1732	Lanerra/reasoning-bank-slm An experiment that applies Google Research's `ReasoningBank` technique to...	35	Emerging	llm-reasoning-research	99	Python
1733	YassWorks/Tuna Python library that makes fine-tuning transformer-based models easier and faster.	35	Emerging	lora-qlora-fine-tuning	5	Python
1734	sandseb123/local-lora-cookbook Fine-tune a local LLM on your own app's data in 15 minutes. Runs entirely...	35	Emerging	llm-fine-tuning-optimization	13	Python
1735	iamgmujtaba/llama3.2-webUI LLaMa 3.2 Multimodal Web UI is a user-friendly interface for interacting...	35	Emerging	interactive-ai-chat-uis	35	PHP
1736	its-kumar-yash/deep-study-ai-agent DeepStudy AI automates research, refines queries dynamically, and generates...	35	Emerging	ai-powered-business-analytics	1	TypeScript
1737	Azure99/BlossomData A fluent, scalable, and easy-to-use LLM data processing framework.	35	Emerging	llm-inference-engines	28	Python
1738	Cryolite/kanachan A Japanese (Riichi) Mahjong AI Framework	35	Emerging	ml-foundations-curricula	332	Python
1739	LISA-ITMO/LLM-resume-moderator Автоматизирует модерацию резюме на русском языке с помощью LLM. Для...	35	Emerging	llm-training-experimentation	5	Jupyter Notebook
1740	AIFEG/BenchLMM [ECCV 2024] BenchLMM: Benchmarking Cross-style Visual Capability of Large...	35	Emerging	domain-specific-benchmarks	86	Python
1741	vbdi/divprune [CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large...	35	Emerging	multimodal-vision-language	71	Python
1742	devdhananjay14/multim 🔍 Experiment with neural networks for binary classification on multimodal...	35	Emerging	multimodal-fusion-transformers	1	Python
1743	Wang-ML-Lab/llm-continual-learning-survey [CSUR 2025] Continual Learning of Large Language Models: A Comprehensive Survey	35	Emerging	llm-research-curation	530	—
1744	deep-diver/segformer-tf-transformers This repository demonstrates how to use TensorFlow based SegFormer model in...	35	Emerging	medical-image-segmentation-transformers	30	Jupyter Notebook
1745	HUBioDataLab/SELFormer SELFormer: Molecular Representation Learning via SELFIES Language Models	35	Emerging	molecular-generation-transformers	107	Python
1746	bminixhofer/tokenkit A toolkit implementing advanced methods to transfer models and model...	35	Emerging	text-tokenization-libraries	64	Python
1747	ejaz57/localchat 🌐 Build a private web interface for local LLMs, ensuring complete privacy...	35	Emerging	interactive-ai-chat-uis	1	HTML
1748	vicuna-tools/vicuna-installation-guide The "vicuna-installation-guide" provides step-by-step instructions for...	35	Emerging	mistral-ai-tools	282	—
1749	kyegomez/Fusion3D An extremely experimental model that intakes images and generates 3D scenes...	35	Emerging	text-to-image-generation	7	Python
1750	automorphic-ai/trex Enforce structured output from LLMs 100% of the time	35	Emerging	structured-output-enforcement	251	Python
1751	umbertocappellazzo/Llama-AVSR Official Pytorch implementation of "Large Language Models are Strong...	35	Emerging	multimodal-vision-language	57	Python
1752	hitz-zentroa/whisper-lm-transformers Add n-gram and LLM language model support to HF Transformers Whisper models.	35	Emerging	llm-implementation-tutorials	14	Python
1753	thongnt99/learned-sparse-retrieval Unified Learned Sparse Retrieval Framework	35	Emerging	power-transformer-design	68	Python
1754	ValentinOliveira/ai-recruitment-assistant 🤖 Automate recruitment communication with our AI-powered assistant,...	35	Emerging	resume-job-matching	1	Python
1755	NVlabs/NFT Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging...	35	Emerging	rlhf-alignment-training	71	Python
1756	maxxxzdn/erwin Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical...	35	Emerging	transformer-architecture-tutorials	112	Python
1757	calpt/awesome-adapter-resources Collection of Tools and Papers related to Adapters / Parameter-Efficient...	35	Emerging	parameter-efficient-adapters	202	Python
1758	diogok/llama.cpp.zig A build.zig for llama.cpp	35	Emerging	local-llm-deployment	1	Zig
1759	arshadshk/SAINT-pytorch SAINT PyTorch implementation	35	Emerging	transformer-architecture-tutorials	92	Python
1760	vipulraheja/iterater Official implementation of the paper "IteraTeR: Understanding Iterative...	35	Emerging	llm-implementation-from-scratch	80	Python
1761	AdrianBZG/LLM-distributed-finetune Tune efficiently any LLM model from HuggingFace using distributed training...	35	Emerging	llm-benchmark-leaderboards	60	Python
1762	adarshM84/TextLLaMACode Transform your writing with TextLLaMA! ✍️🚀 Simplify grammar, translate...	35	Emerging	interactive-ai-chat-uis	3	JavaScript
1763	dmis-lab/Monet [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers	35	Emerging	mixture-of-experts-llms	76	Python
1764	akjindal53244/Arithmo Small and Efficient Mathematical Reasoning LLMs	35	Emerging	math-reasoning-datasets	73	Python
1765	MURUGESAN88709/mental-health-finetuned-llama 🧠 Fine-tune LLaMA for mental health applications, providing insights and...	35	Emerging	lora-qlora-fine-tuning	1	Python
1766	developer239/llama.cpp-ts llama.cpp 🦙 LLM inference in TypeScript	35	Emerging	local-llm-deployment	3	C++
1767	vlarine/transformers-ru A list of pretrained Transformer models for the Russian language.	35	Emerging	neural-machine-translation	177	Jupyter Notebook
1768	THUDM/Multilingual-GLM The multilingual variant of GLM, a general language model trained with...	35	Emerging	transformer-architecture-education	62	Python
1769	xmindflow/MSA-2Net [BMVC 2024] Official repository of the paper titled "MSA^2 Net: Multi-scale...	35	Emerging	medical-image-segmentation-transformers	70	Python
1770	woodRock/fishy-business Machine Learning for Rapid Evaporative Ionization Mass Spectrometry for...	35	Emerging	academic-thesis-repositories	3	Python
1771	yyDing1/ScaleQuest [ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective...	35	Emerging	task-oriented-dialogue-systems	68	Python
1772	pdfosborne/elsciRL The core repository of the elsciRL framework.	35	Emerging	llm-scaling-architecture	18	Python
1773	gustavecortal/gpt-j-fine-tuning-example Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compression	35	Emerging	model-fine-tuning-methods	68	Jupyter Notebook
1774	aj-naik/Text-Summarization Abstractive and Extractive Text summarization using Transformers.	35	Emerging	text-summarization-transformers	86	Jupyter Notebook
1775	Wangbiao2/R1-Track R1-Track: Direct Application of MLLMs to Visual Object Tracking via...	35	Emerging	multimodal-vision-language	66	Python
1776	zhchen18/ToMBench ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.	35	Emerging	domain-specific-benchmarks	66	Python
1777	BatsResearch/planetarium Dataset and benchmark for assessing LLMs in translating natural language...	35	Emerging	llm-robot-planning	65	Python
1778	otto-de/TRON ⚡️ Implementation of TRON: Transformer Recommender using Optimized...	35	Emerging	recommendation-systems-transformers	74	Python
1779	BaohaoLiao/RSD [ICML 2025] Reward-guided Speculative Decoding (RSD) for efficiency and...	35	Emerging	speculative-decoding-algorithms	56	Python
1780	declare-lab/CICERO The purpose of this repository is to introduce new dialogue-level...	35	Emerging	ml-api-deployment	64	Python
1781	Bruce-Lee-LY/decoding_attention Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using...	35	Emerging	sparse-attention-optimization	46	C++
1782	xf-zhao/LoT Official implementation of LoT paper: "Enhancing Zero-Shot Chain-of-Thought...	35	Emerging	chain-of-thought-reasoning	30	Python
1783	alibaba/easydist Automated Parallelization System and Infrastructure for Multiple Ecosystems	35	Emerging	llm-inference-engines	82	Python
1784	nsidn98/LLaMAR Code for our paper LLaMAR: LM-based Long-Horizon Planner for Multi-Agent Robotics	35	Emerging	llm-robot-planning	30	Jupyter Notebook
1785	Ereboas/MagiCodec A single-layer, streaming codec model providing SOTA audio quality and...	35	Emerging	diffusion-language-models	113	Python
1786	daviden1013/llm-ie A comprehensive toolkit that provides building blocks for LLM-based named...	35	Emerging	llm-framework-abstractions	53	Python
1787	lucasjinreal/Namo-R1 A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from...	35	Emerging	llm-inference-engines	252	Python
1788	nlpodyssey/gotokenizers Go implementation of today's most used tokenizers	35	Emerging	tokenizer-libraries	44	Go
1789	palonso/MAEST Pre-training, fine-tuning, and inference code with the MAEST models for...	35	Emerging	audio-classification-transformers	54	Python
1790	loong64/ollama Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other...	35	Emerging	local-llm-deployment	9	Dockerfile
1791	sail-sg/Attention-Sink [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical...	35	Emerging	diffusion-language-models	159	Python
1792	ExplainableML/WaffleCLIP Official repository for the ICCV 2023 paper: "Waffling around for...	35	Emerging	multimodal-vision-language	61	Python
1793	KolosalAI/kolosal-server Kolosal AI is an OpenSource and Lightweight alternative to Ollama to run...	35	Emerging	local-llm-deployment	13	C++
1794	uakarsh/latr Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel...	35	Emerging	vision-transformer-optimization	56	Python
1795	mybigday/llama.node Node.js binding of llama.cpp	35	Emerging	local-llm-deployment	19	C++
1796	Sakeeb91/text2sql-agent Self-correcting AI agent for natural language to SQL using HuggingFace...	34	Emerging	multi-agent-orchestration	3	Python
1797	DAMO-NLP-SG/multilingual-safety-for-LLMs [ICLR 2024]Data for "Multilingual Jailbreak Challenges in Large Language Models"	34	Emerging	jailbreak-attacks-analysis	101	—
1798	BubbleJoe-BrownU/TransformerHub This is a repository of transformer-like models, including Transformer, GPT,...	34	Emerging	transformer-architecture-tutorials	87	Python
1799	arcee-ai/PruneMe Automated Identification of Redundant Layer Blocks for Pruning in Large...	34	Emerging	llm-compression-optimization	263	Python
1800	PathologyFoundation/plip Pathology Language and Image Pre-Training (PLIP) is the first vision and...	34	Emerging	clip-vision-language	373	Python

« Prev 1 2 3 … 16 17 18 19 20 … 63 64 65 Next »