All Transformer Models

6,429 models ranked by quality score · Page 22 of 65

Showing 2101–2200 of 6,429

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
2101	AIRI-Institute/Probing_framework Framework for probing tasks	32	Emerging	mathematical-reasoning-transformers	31	Python
2102	RishabSA/interp-refusal-tokens We study whether categorical refusal tokens enable controllable and...	32	Emerging	rlhf-alignment-training	7	Python
2103	dirmacs/lancor A Rust client library for llama.cpp's OpenAI-compatible API server	32	Emerging	local-llm-deployment	2	Rust
2104	anthonyfoust/ai-stack-homelab Complete AI automation stack optimized for Mac Mini M4, but can work in...	32	Emerging	local-llm-deployment	7	Shell
2105	taesiri/ArXivQA WIP - Automated Question Answering for ArXiv Papers with Large Language...	32	Emerging	question-answering-systems	377	Python
2106	nsi319/Finetune-Transformers Abstractive text summarization by fine-tuning seq2seq models.	32	Emerging	text-summarization-transformers	39	Python
2107	AspirinCode/AlphaPPImd Exploring the conformational ensembles of protein-protein complexes with...	32	Emerging	protein-transformers-ml	33	Jupyter Notebook
2108	deep-div/Fine-Tuning-LLMs-and-VisionModels Fine-Tuning LLMs (Gemma, LLaMA, Mistral, etc.) A practical guide to...	32	Emerging	lora-qlora-fine-tuning	17	Jupyter Notebook
2109	sixfingerdev/-Sixfinger-API---10-20x-Faster-AI-Chat-API # ⚡ Sixfinger API - 10-20x Faster AI Chat API. İncludes 9 models.	32	Emerging	interactive-ai-chat-uis	6	Python
2110	styfeng/TinyDialogues Code & data for the EMNLP 2024 paper: Is Child-Directed Speech Effective...	31	Emerging	creative-text-generation	12	Python
2111	NTU-SQUAD/transformers-coqa Albert for Conversational Question Answering Challenge	31	Emerging	question-answering-systems	22	Python
2112	titanml/takeoff-community TitanML Takeoff Server is an optimization, compression and deployment...	31	Emerging	llm-inference-engines	114	—
2113	codefuse-ai/GALLa [ACL 2025] Graph Aligned Large Language Models for Improved Source Code Understanding	31	Emerging	graph-language-models	43	Python
2114	pagraf/Seabed-Net Quick start guide for Seabed-Net	31	Emerging	vision-transformer-classification	8	Python
2115	deep-symbolic-mathematics/Multimodal-Symbolic-Regression [ICLR 2024 Spotlight] SNIP on Symbolic Regression: Deep Symbolic Regression...	31	Emerging	mathematical-reasoning-transformers	21	Python
2116	wassemgtk/llm.scala Extensible implementation of a Language Model (LLM) training framework in Scala.	31	Emerging	llm-frameworks-libraries	34	Scala
2117	dropbox/grallama-panel GraLLAMA panel for LLAMA data	31	Emerging	interactive-ai-chat-uis	16	JavaScript
2118	FranxYao/FlanT5-CoT-Specialization Implementation of ICML 23 Paper: Specializing Smaller Language Models...	31	Emerging	chain-of-thought-reasoning	132	Jupyter Notebook
2119	IParraMartin/An-Explanation-Is-All-You-Need The original transformer implementation from scratch. It contains...	31	Emerging	transformer-architecture-education	44	Python
2120	xiaoachen98/Open-LLaVA-NeXT An open-source implementation for training LLaVA-NeXT.	31	Emerging	vision-language-instruction-tuning	436	Python
2121	SCRN-VRC/Language-Translation-with-Fragment-Shaders EN to JP and JP to EN with transformer models	31	Emerging	neural-machine-translation	98	ShaderLab
2122	Chunjiang-Intelligence/Credal-Transformer 论文「Credal Transformer: A Principled Approach for Quantifying and Mitigating...	31	Emerging	transformer-implementation-education	12	Python
2123	RaptorMai/MLLM-CompBench [NeurIPS'25] MLLM-CompBench evaluates the comparative reasoning of MLLMs...	31	Emerging	domain-specific-benchmarks	44	Jupyter Notebook
2124	FudanDISC/ReForm-Eval An benchmark for evaluating the capabilities of large vision-language models (LVLMs)	31	Emerging	safety-robustness-evaluation	46	Python
2125	Yifan-Song793/ETO Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents...	31	Emerging	llm-robot-planning	159	Python
2126	nlp-uoregon/Okapi Okapi: Instruction-tuned Large Language Models in Multiple Languages with...	31	Emerging	rlhf-alignment-training	96	Python
2127	ivanovitchm/PPGEEC2318 Repository for EEC2318, a graduate course on PPgEEC about Machine Learning	31	Emerging	ml-foundations-curricula	31	Jupyter Notebook
2128	TamSiuhin/LLM-UM-Reading A list of large language models for user modeling (LLM-UM) papers, based on...	31	Emerging	llm-research-curation	151	—
2129	tongnie/ImputeFormer [KDD 2024] "ImputeFormer: Low Rankness-Induced Transformers for...	31	Emerging	transformer-interpretability-mechanistic	51	Python
2130	smpanaro/coreml-llm-cli CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.	31	Emerging	llm-quantization-methods	124	Swift
2131	makllama/makllama MaK(Mac+Kubernetes)llama - Running and orchestrating large language models...	31	Emerging	local-llm-deployment	45	Go
2132	Relaxed-System-Lab/HexGen [ICML 2024] Serving LLMs on heterogeneous decentralized clusters.	31	Emerging	llm-inference-engines	34	Python
2133	AGI-Edgerunners/LLM-Optimizers-Papers Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic...	31	Emerging	llm-research-curation	252	—
2134	juzhengz/LoRI [COLM 2025] LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation	31	Emerging	llm-fine-tuning	171	Python
2135	QwenLM/PolyMath [NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath:...	31	Emerging	math-reasoning-datasets	42	Python
2136	Saivineeth147/llm-testlab Comprehensive Testing Tool for Large Language Models	31	Emerging	llm-benchmark-leaderboards	6	Python
2137	miranthajayatilake/nanoQA Question-answering on your own data with Large Language Models (LLMs)	31	Emerging	question-answering-systems	23	Python
2138	ZongXR/8th-National-AI-Training-Competition 第八届全国职工职业技能大赛人工智能训练师赛项	31	Emerging	ml-foundations-curricula	13	Jupyter Notebook
2139	frankluise5220/ComfyUI-Lorahelper A professional automation toolkit for ComfyUI to prepare LoRA training data...	31	Emerging	lora-qlora-fine-tuning	10	Python
2140	DomHudson/bert-in-production A collection of resources on using BERT (https://arxiv.org/abs/1810.04805 )...	31	Emerging	bert-model-implementations	96	—
2141	danieloquelis/natural-language-git Offline LLM-powered Git CLI tool. NLGit interprets your natural language...	31	Emerging	llm-terminal-automation	3	TypeScript
2142	JonSnow1807/Medical-Prescription-OCR OCR system for handwritten medical prescriptions using Donut transformer and...	31	Emerging	ocr-document-extraction	9	Jupyter Notebook
2143	vbario/sleeping-llm A language model that forms persistent memories from conversation and...	31	Emerging	memory-augmented-architectures	52	Python
2144	OpenMOSS/LongLLaDA [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs	31	Emerging	diffusion-language-models	53	Python
2145	singhsidhukuldeep/Text-Summarizer Comparing state of the art models for text summary generation	31	Emerging	text-summarization-transformers	19	Jupyter Notebook
2146	RahulSChand/llama2.c-for-dummies Step by step explanation/tutorial of llama2.c	31	Emerging	local-llm-deployment	225	C
2147	KishanBagaria/dAbot 🤖 CLI tool to automate stuff on DeviantArt.com	31	Emerging	llm-terminal-automation	21	Python
2148	xmed-lab/TAM [ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs	31	Emerging	transformer-interpretability-mechanistic	180	Python
2149	EagleW/Stage-wise-Fine-tuning Code for Stage-wise Fine-tuning for Graph-to-Text Generation	31	Emerging	gpt2-pretraining-fine-tuning	26	Lex
2150	jshuadvd/LongRoPE Implementation of the LongRoPE: Extending LLM Context Window Beyond 2...	31	Emerging	transformer-training-optimization	151	Python
2151	alan-turing-institute/prompto An open source library for asynchronous querying of LLM endpoints	31	Emerging	prompt-engineering-security	36	Python
2152	HLTCHKUST/VG-GPLMs The code repository for EMNLP 2021 paper "Vision Guided Generative...	31	Emerging	vision-language-models	57	Python
2153	Orion-AI-Lab/televit Teleconnection-driven vision transformers for improved long-term forecasting	31	Emerging	vit-image-classification	35	Python
2154	ryoungj/ObsScaling [NeurIPS'24 Spotlight] Observational Scaling Laws	31	Emerging	llm-scaling-architecture	60	Jupyter Notebook
2155	vmarinowski/infini-attention An unofficial pytorch implementation of 'Efficient Infinite Context...	31	Emerging	transformer-architecture-tutorials	55	Python
2156	ant-louis/belgpt2 🇧🇪 BelGPT-2: the 1st GPT model pretrained in French.	31	Emerging	gpt2-pretraining-fine-tuning	34	Python
2157	raymin0223/fast_robust_early_exit Fast and Robust Early-Exiting Framework for Autoregressive Language Models...	31	Emerging	compositional-reasoning-embeddings	65	Python
2158	AlexIoannides/transformers-gen-ai Developing generative language models using transformers.	31	Emerging	gpt-model-fine-tuning	11	Jupyter Notebook
2159	iVishalr/GPT A minimal and efficient Pytorch implementation of OpenAI's GPT (Generative...	31	Emerging	gpt2-pretraining-fine-tuning	18	Jupyter Notebook
2160	mts-ai/OpenAutoNLU An open-source pipeline for training natural language understanding models	31	Emerging	nlp-learning-coursework	39	Python
2161	otvam/pyscalexfmr Optimization and Scaling of Medium-Frequency Transformers	31	Emerging	power-transformer-design	8	Python
2162	yangjianxin1/LongQLoRA LongQLoRA: Extent Context Length of LLMs Efficiently	31	Emerging	llm-fine-tuning	168	Python
2163	Mmorgan-ML/Phase-Slip-Sampler Phase-Slip is a stochastic intervention architecture that operates on the...	31	Emerging	llm-implementation-from-scratch	6	Python
2164	UIC-Liu-Lab/ContinualLM An Extensible Continual Learning Framework Focused on Language Models (LMs)	31	Emerging	prompt-engineering-optimization	293	Python
2165	kyegomez/MambaDecoderBlock MambaDecoderBlock is a novel decoder architecture that replaces traditional...	31	Emerging	3d-vision-transformers	5	Python
2166	ChanMeng666/interactive-story-generator 【Join our constellation of stargazers!⭐️】An interactive AI-powered story...	31	Emerging	prompt-engineering-security	11	Python
2167	shikiw/Modality-Integration-Rate [ICCV 2025] The official code of the paper "Deciphering Cross-Modal...	31	Emerging	vision-language-instruction-tuning	111	Python
2168	curtisgray/wingman Wingman is the fastest and easiest way to run Llama models on your PC or Mac.	31	Emerging	llm-terminal-automation	44	TypeScript
2169	obss/turkish-question-generation Automated question generation and question answering from Turkish texts...	31	Emerging	question-answering-systems	49	Python
2170	ntropy-network/enrichment_models This repository benchmark Ntropy API against different Large Language Models...	31	Emerging	llama-model-implementations	34	Jupyter Notebook
2171	Utshav-paudel/LLM-Zero-to-Hero This repo contains the resources, projects and documentation of mine while...	31	Emerging	llm-implementation-tutorials	34	Jupyter Notebook
2172	dsdanielpark/hf-transllm LLMtranslator translates and generates text in multiple languages.	31	Emerging	llm-translation-tools	45	Jupyter Notebook
2173	Kagamma/llama-pas Free Pascal bindings for llama.cpp	31	Emerging	local-llm-deployment	23	Pascal
2174	qiqiApink/MotionGPT The official PyTorch implementation of the paper "MotionGPT: Finetuned LLMs...	31	Emerging	gpt-multilingual-training	238	Python
2175	vipulraheja/coedit Official implementation of the paper "CoEdIT: Text Editing by Task-Specific...	31	Emerging	llm-knowledge-editing	138	Shell
2176	katanaml/table-query-model Table Query with ML	31	Emerging	ml-benchmarking-frameworks	14	Python
2177	Riko0/messenger_logger_callback messenger-logger-callback — Send ML training logs to Telegram. Standalone...	31	Emerging	conversational-chatbot-applications	2	Python
2178	luiskugel/AI-Writing-Assistant-for-Thunderbird A Thunderbird extension that helps improve your email writing using various...	31	Emerging	ai-powered-business-analytics	4	JavaScript
2179	Phildram1/myantfarm-ai Multi-Agent LLM Orchestration for High-Quality Incident Response - 100%...	31	Emerging	multi-agent-orchestration	8	TeX
2180	LostBeard/SpawnDev.BlazorJS.TransformersJS Use Transformers.js from Blazor WebAssembly to run pretrained models with...	31	Emerging	browser-based-ml-inference	8	C#
2181	Kirill-Kravtsov/drophead-pytorch An implementation of drophead regularization for pytorch transformers	31	Emerging	transformer-architecture-tutorials	19	Python
2182	iboing/CorDA CorDA: Context-Oriented Decomposition Adaptation of Large Language Models...	31	Emerging	llm-knowledge-distillation	55	Python
2183	rohit901/VANE-Bench [NAACL'25] Contains code and documentation for our VANE-Bench paper.	31	Emerging	domain-specific-benchmarks	23	Python
2184	baldoarbol/BodyShapeGPT Fine-tuned LLMs generate accurate 3D human avatars from textual descriptions...	31	Emerging	multimodal-vision-language	37	Python
2185	black-roland/homeassistant-cloud-ru-ai Cloud.ru Foundation Models — cloud-based AI assistants for Home Assistant	31	Emerging	conversational-chatbot-applications	10	Python
2186	pdaicode/awesome-LLMs-finetuning Collection of resources for finetuning Large Language Models (LLMs).	31	Emerging	llm-knowledge-distillation	113	—
2187	naity/finetune-esm Scalable Protein Language Model Finetuning with Distributed Learning and...	31	Emerging	llm-fine-tuning	34	Jupyter Notebook
2188	yinizhilian/ICLR2025-Papers-with-Code 历年ICLR论文和开源项目合集，包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.	31	Emerging	llm-research-curation	562	—
2189	hscspring/llama.np Inference Llama/Llama2/Llama3 Modes in NumPy	31	Emerging	llama-model-implementations	21	Python
2190	samestrin/llm-newsletter-generator llm-newsletter-generator transforms a valid RSS feed into a "Newsletter"...	31	Emerging	youtube-video-summarization	13	Python
2191	Roboflow-Universe/finetune-RF-DETR Modular CLI pipeline for fine‑tuning RF‑DETR object detection models on...	31	Emerging	object-detection-transformers	32	Python
2192	shinomakoi/magi_llm_gui A Qt GUI for large language models	31	Emerging	interactive-ai-chat-uis	45	Python
2193	zzz47zzz/codebase-for-incremental-learning-with-llm [ACL2024] A Codebase for Incremental Learning with Large Language Models;...	31	Emerging	llm-scaling-architecture	60	Python
2194	princeton-pli/AdaptMI [COLM 2025] Adaptive Skill-based In-context Math Instruction for Small...	31	Emerging	math-reasoning-datasets	9	Python
2195	prajjwal1/generalize_lm_nli Code for the paper EMNLP 2021 workshop paper "Generalization in NLI: Ways...	31	Emerging	model-evaluation-diagnostics	34	Jupyter Notebook
2196	dmis-lab/Outlier-Safe-Pre-Training [ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large...	31	Emerging	llm-compression-optimization	35	Python
2197	botisan-ai/sentence-transformers.js Run sentence-transformers (SBERT) compatible models in Node.js or browser.	31	Emerging	browser-based-ml-inference	24	TypeScript
2198	hao-ai-lab/d3LLM d3LLM: Ultra-Fast Diffusion LLM 🚀	31	Emerging	diffusion-language-models	105	Python
2199	amin-tehrani/ollama-colab Serve Ollama LLMs on Google Colab (free plan) using Ngrok	31	Emerging	local-llm-deployment	26	Jupyter Notebook
2200	Zalexanninev15/GetFreeChat Automatic collection of free instances of AI text models (ChatGPT, Claude,...	31	Emerging	multi-provider-llm-interfaces	5	Python

« Prev 1 2 3 … 20 21 22 23 24 … 63 64 65 Next »