All Transformer Models

6,968 models ranked by quality score · Page 23 of 70

Showing 2201–2300 of 6,968

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
2201	ManashJKonwar/NLP-Transformers Transformer (BERT, GPT2, etc.) based Training Module for popular NLP tasks	34	Emerging	model-evaluation-diagnostics	9	Python
2202	Gen-Verse/ReasonFlux [NeurIPS 2025 Spotlight] LLM post-training suite — featuring ReasonFlux,...	34	Emerging	code-model-training	524	Python
2203	leliuga/cohere-configurations Co:Here Inference configurations	34	Emerging	llm-quantization-methods	10	Go
2204	Hon-Wong/VoRA [Fully open] [Encoder-free MLLM] Vision as LoRA	34	Emerging	multimodal-vision-language	379	Python
2205	nanowell/Differential-Transformer-PyTorch PyTorch implementation of the Differential-Transformer architecture for...	34	Emerging	transformer-architecture-education	86	Python
2206	X-iZhang/CCD 📷 CCD: Mitigating Hallucinations in Radiology MLLMs via Clinical Contrastive...	34	Emerging	image-generation-mcp	9	Python
2207	CLAIRE-Labo/quantile-reward-policy-optimization Official codebase for "Quantile Reward Policy Optimization: Alignment with...	34	Emerging	rlhf-alignment-training	30	Python
2208	cifkao/context-probing Black-box language model explanation by context length probing	34	Emerging	mathematical-reasoning-transformers	9	Jupyter Notebook
2209	nareshis21/Truelarge-RT Android inference engine running 20B+ parameter LLMs on 4GB-8GB RAM devices....	34	Emerging	llm-inference-engines	9	Kotlin
2210	Hamtech-ai/Persian-Image-Captioning A Persian Image Captioning model based on Vision Encoder Decoder Models of...	34	Emerging	image-captioning-transformers	20	Jupyter Notebook
2211	dougeeai/llama-cpp-python-wheels Pre-built wheels for llama-cpp-python across platforms and CUDA versions	34	Emerging	llm-docker-deployments	40	—
2212	forgi86/sysid-transformers Code to reproduce the results of the paper In-context learning for...	34	Emerging	transformer-architecture-education	19	Jupyter Notebook
2213	starmpcc/CAMEL Clinically Adapted Model Enhanced from LLaMA	34	Emerging	multilingual-llm-adaptation	89	Python
2214	davide-coccomini/MINTIME-Multi-Identity-size-iNvariant-TIMEsformer-for-Video-Deepfake-Detection Code for Video Deepfake Detector from "MINTIME: Multi-Identity...	34	Emerging	ai-content-detection	68	Jupyter Notebook
2215	suyash/mlt Multilingual Neural Machine Translation using Transformers with Conditional...	34	Emerging	neural-machine-translation	18	Jupyter Notebook
2216	PKU-Alignment/beavertails BeaverTails is a collection of datasets designed to facilitate research on...	34	Emerging	rlhf-alignment-training	176	Makefile
2217	AntonioGr7/pratical-llms A collection of hand on notebook for LLMs practitioner	34	Emerging	llm-learning-resources	51	Jupyter Notebook
2218	CEC-Agent/CEC Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for...	34	Emerging	power-transformer-design	31	Python
2219	fboulnois/llm-leaderboard-csv CSVs of the Huggingface and LMArena LLM leaderboards, along with the code to...	34	Emerging	llm-benchmark-leaderboards	30	Python
2220	jorgemunozl/Finetunning-Llama-Vision-11b Inference and finnetunning of a VLM (LLama Vision 11b) using the Unsloth,...	34	Emerging	lora-qlora-fine-tuning	9	Python
2221	jakobtroidl/neuron-shape-reasoning PyTorch Implementation of Global Neuron Shape Reasoning with Point Affinity...	34	Emerging	transformer-interpretability-mechanistic	13	Jupyter Notebook
2222	ASSERT-KTH/repairllama RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program...	34	Emerging	llama-model-implementations	39	Jupyter Notebook
2223	ModelTC/QLLM [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate...	34	Emerging	llm-quantization-methods	39	Python
2224	nestordemeure/stop_word Huggingface transformers stopping criteria that halts the generation when a...	34	Emerging	huggingface-learning-resources	9	Python
2225	SqueezeAILab/KVQuant [NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with...	34	Emerging	llm-quantization-methods	406	Python
2226	henrikalbihn/gliner-as-a-service GLiNER model in a FastAPI microservice.	34	Emerging	ml-api-deployment	47	Python
2227	Infini-AI-Lab/Sequoia scalable and robust tree-based speculative decoding algorithm	34	Emerging	speculative-decoding-algorithms	372	Python
2228	sdpkjc/SATQuest 🏞 A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs	34	Emerging	llm-reasoning-research	5	Python
2229	wang2226/Awesome-LLM-Decoding 📜 Paper list on decoding methods for LLMs and LVLMs	34	Emerging	llm-research-curation	70	—
2230	itsqyh/Awesome-LMMs-Mechanistic-Interpretability A curated collection of resources focused on the Mechanistic...	34	Emerging	llm-interpretability-explainability	192	—
2231	NiuTrans/LaMaTE Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine...	34	Emerging	llm-scaling-architecture	28	Python
2232	moritztng/fltr Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.	34	Emerging	local-llm-deployment	387	Rust
2233	DCQN-axiomatics/DCQN-Matrix-Axiomatik-LLM-Protocol A strict, deterministic LLM protocol for loading, reading and activating the...	34	Emerging	advanced-prompt-protocols	4	—
2234	PathologyFoundation/plip Pathology Language and Image Pre-Training (PLIP) is the first vision and...	34	Emerging	clip-vision-language	373	Python
2235	ksm26/Open-Source-Models-with-Hugging-Face "Open Source Models with Hugging Face" course empowers you with the skills...	34	Emerging	hugging-face-tutorials	33	Jupyter Notebook
2236	MNoorFawi/curlora The code repository for the CURLoRA research paper. Stable LLM continual...	34	Emerging	llm-fine-tuning	53	Jupyter Notebook
2237	CASE-Lab-UMD/Router-Tuning-Mixture-of-Depths The open-source Mixture of Depths code and the official implementation of...	34	Emerging	mixture-of-experts-llms	28	Python
2238	DestroyerDarkNess/fastvlm-webgpu Real-time video captioning powered by FastVLM	34	Emerging	vision-language-models	4	JavaScript
2239	zerovl/ZeroVL [ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources	34	Emerging	vision-language-models	46	Python
2240	AkiRusProd/numpy-transformer A numpy implementation of the Transformer model in "Attention is All You Need"	34	Emerging	transformer-architecture-tutorials	58	Python
2241	WayneMao/RoboMatrix The Official Implementation of RoboMatrix	34	Emerging	llm-robot-planning	106	Python
2242	deep-div/PlotLLM Data Visualization with LLM automatically analyzes data and generates...	34	Emerging	llm-data-visualization	7	Jupyter Notebook
2243	antoninodimaggio/Hugging-Captions Generate realistic Instagram captions using transformers 🤗	34	Emerging	blip-image-captioning	101	Python
2244	HaoAreYuDong/MachineLearningLM Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML	34	Emerging	llm-domain-datasets	59	Python
2245	google/curie Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long...	34	Emerging	math-reasoning-datasets	29	Jupyter Notebook
2246	michaelnny/QLoRA-LLM A simple custom QLoRA implementation for fine-tuning a language model (LLM)...	34	Emerging	lora-qlora-fine-tuning	10	Python
2247	Tebmer/Awesome-Knowledge-Distillation-of-LLMs This repository collects papers for "A Survey on Knowledge Distillation of...	34	Emerging	llm-knowledge-distillation	1,264	—
2248	Nikityyy/lille A powerful 130-million-parameter model trained from scratch as part of a...	34	Emerging	llm-frameworks-libraries	70	Python
2249	hesamsheikh/llm-mechanics Coding an LLM and its building blocks from scratch.	34	Emerging	llm-implementation-tutorials	116	Jupyter Notebook
2250	OneInterface/realtime-bakllava llama.cpp with BakLLaVA model describes what does it see	34	Emerging	local-llm-deployment	379	Python
2251	RLHFlow/Online-RLHF A recipe for online RLHF and online iterative DPO.	34	Emerging	rlhf-alignment-training	543	Python
2252	iKernels/transformers-lightning A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses...	34	Emerging	transformer-architecture-tutorials	47	Python
2253	holarissun/RewardModelingBeyondBradleyTerry official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models...	34	Emerging	rlhf-alignment-training	71	Python
2254	hpdps-group/ElasticMM ElasticMM: Elastic and Efficient MLLM Serving System	34	Emerging	llm-inference-serving	41	Python
2255	rezazad68/transdeeplab TransDeepLab: Convolution-Free Transformer-based DeepLab v3+ for Medical...	34	Emerging	medical-image-segmentation-transformers	88	Python
2256	RAHB-REALTORS-Association/email-autodrafts Email Auto-ReplAI is a Python tool that uses AI to automate drafting...	34	Emerging	ai-powered-business-analytics	9	Python
2257	Pengxin-Guo/FedSA-LoRA Selective Aggregation for Low-Rank Adaptation in Federated Learning [ICLR 2025]	34	Emerging	llm-fine-tuning	60	Python
2258	jonrbates/turing A PyTorch library for simulating Turing machines with neural networks, based...	33	Emerging	transformer-architecture-tutorials	2	Python
2259	Uralstech/vid-orca Deploy LLaMA-2 Chat on Google Cloud.	33	Emerging	local-llm-deployment	4	Python
2260	srsawant34/efficient_instruction_learning Code base for the paper "Instruction Tuned Models are Quick Learners".	33	Emerging	lora-qlora-fine-tuning	5	Python
2261	Riccorl/llama-trainer Llama Trainer Utility	33	Emerging	llama-model-implementations	9	Python
2262	hollobit/GenAI_LLM_timeline ChatGPT, GenerativeAI and LLMs Timeline	33	Emerging	prompt-engineering-security	956	—
2263	Anjum48/commonlitreadabilityprize 4th Place solution for the Kaggle CommonLit Readability Prize	33	Emerging	essay-scoring-grading	38	Jupyter Notebook
2264	declare-lab/TEAM Our EMNLP 2022 paper on MCQA	33	Emerging	question-answering-systems	23	Python
2265	MLD3/steerability An open-source evaluation framework for measuring LLM steerability.	33	Emerging	llm-bias-evaluation	4	Jupyter Notebook
2266	Srijan-D/LangChain-v0.2-HuggingFace-Llama3 This project integrates LangChain v0.2.6, HuggingFace Serverless Inference...	33	Emerging	prompt-engineering-security	5	Python
2267	elephantmipt/compressors A small library with distillation, quantization and pruning pipelines	33	Emerging	llm-quantization-methods	26	Python
2268	graphcore-research/jax-scalify JAX Scalify: end-to-end scaled arithmetics	33	Emerging	llm-fine-tuning	18	Python
2269	chrisjob1021/transformer-examples A collection of educational toy implementations and examples of key...	33	Emerging	transformer-architecture-education	3	Jupyter Notebook
2270	UBC-MDS/fixml LLM Tool for effective test evaluation of ML projects with curated...	33	Emerging	llm-comparison-evaluation	4	Python
2271	smitkiri/news-qa Reading comprehension based question-answering model for news articles.	33	Emerging	question-answering-systems	11	Jupyter Notebook
2272	IIT-DM/BattleofLLMs Benchmarks of LLMs with Conversational QA datasets.	33	Emerging	llm-evaluation-benchmarking	6	Python
2273	HariomJangra/project-lumen A 128M parameter language model built from scratch for learning how large...	33	Emerging	llm-frameworks-libraries	8	Jupyter Notebook
2274	loretoparisi/bert_text_classifier Text Classification with BERT	33	Emerging	text-classification-transformers	8	Jupyter Notebook
2275	akanyaani/miniLLAMA A simplified LLAMA implementation for training and inference tasks.	33	Emerging	llama-model-implementations	36	Python
2276	jseeio/gpt2-tfjs GPT2 with Tensorflow.js	33	Emerging	gpt2-pretraining-fine-tuning	4	JavaScript
2277	YuanGongND/ltu Code, Dataset, and Pretrained Models for Audio and Speech Large Language...	33	Emerging	llm-scaling-architecture	472	Python
2278	haesleinhuepf/vlm-pictionary Play pictionary with Vision Language Models!	33	Emerging	multimodal-vision-language	6	Jupyter Notebook
2279	Esmail-ibraheem/Tinyllamas-pytorch Tinyllamas🦙 is an Extensible advanced language model framework, inspired by...	33	Emerging	llama-model-implementations	6	Python
2280	Nondzu/LlamaTor LlamaTor: Decentralized AI model sharing via BitTorrent for efficient,...	33	Emerging	llama-model-implementations	58	Python
2281	telekom/transformer-tools Transformers Training Tools	33	Emerging	transformer-architecture-tutorials	6	Python
2282	Ajax0564/VyomAI VyomAI: state-of-the-art NLP LLM Vision MultiModel transformers ...	33	Emerging	llm-implementation-tutorials	5	Python
2283	songxiaoshuai/progco Official Implementation of "ProgCo: Program Helps Self-Correction of Large...	33	Emerging	llm-interpretability-explainability	5	Python
2284	DoubleVII/lithft Pretrain, finetune any LLMs from huggingface on your own data.	33	Emerging	llm-fine-tuning	4	Python
2285	wangcongcong123/transection Transection: Transformers for English to Chinese Translation	33	Emerging	neural-machine-translation	6	Python
2286	monk1337/NanoPeft The simplest repository & Neat implementation of different Lora methods for...	33	Emerging	lora-qlora-fine-tuning	7	Jupyter Notebook
2287	pat-jj/KG-FIT [NeurIPS'24] Knowledge Graph Fine-Tuning using LLMs	33	Emerging	llm-knowledge-graph-generation	130	Python
2288	microsoft/MMLU-CF A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]	33	Emerging	llm-interpretability-explainability	123	—
2289	jianzhnie/LLMToolkit LLMToolkit is a toolkit for NLP(Natural Language Processing) and LLM(Large...	33	Emerging	llm-fine-tuning	6	Python
2290	daskol/llama.py Python bindings to llama.cpp	33	Emerging	local-llm-deployment	27	C
2291	sail-sg/dice Official implementation of Bootstrapping Language Models via DPO Implicit Rewards	33	Emerging	direct-preference-optimization	47	Python
2292	detsutut/ama-bot A modern and lightweight NLP interface for Question-Answering systems and...	33	Emerging	question-answering-systems	4	HTML
2293	yaojin17/Unlearning_LLM [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large...	33	Emerging	rlhf-alignment-training	66	Python
2294	notAI-tech/Anuvaad State of the art open-source translation for Indic languages.	33	Emerging	indic-language-translation	5	Python
2295	rkinas/reasoning_models_how_to This repository serves as a collection of research notes and resources on...	33	Emerging	llm-reasoning-research	132	Python
2296	krishnapriya-18/COVID-19-Tweet-Classification-using-Roberta-and-Bert-Simple-Transformers Rank 1 / 216	33	Emerging	text-classification-transformers	28	Jupyter Notebook
2297	duyhominhnguyen/Exgra-Med [NeurIPS 2025] ExGra-Med: Medical Multi-Modal LLM with Extended Context Alignment	33	Emerging	clinical-llm-tools	41	Python
2298	hasanisaeed/C-Transformer Implementation of the core Transformer architecture in pure C	33	Emerging	transformer-architecture-tutorials	8	C
2299	SORRY-Bench/sorry-bench Benchmark evaluation code for "SORRY-Bench: Systematically Evaluating Large...	33	Emerging	domain-specific-benchmarks	77	Jupyter Notebook
2300	WooooDyy/BAPO Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for...	33	Emerging	rlhf-alignment-training	91	Python

« Prev 1 2 3 … 21 22 23 24 25 … 68 69 70 Next »