All Transformer Models

6,429 models ranked by quality score · Page 24 of 65

Showing 2301–2400 of 6,429

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
2301	CognitiveAISystems/RATE [ICLR 2026] Official implementation of Recurrent Action Transformer with...	30	Emerging	paper-implementation-collections	18	Python
2302	DAMO-NLP-SG/LLM-Multilingual-Knowledge-Boundaries [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across...	30	Emerging	math-reasoning-datasets	18	Jupyter Notebook
2303	sandyresearch/chipmunk 🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E...	30	Emerging	transformer-training-optimization	101	Cuda
2304	gpustack/gguf-packer-go Deliver LLMs of GGUF format via Dockerfile.	30	Emerging	llm-quantization-methods	15	Go
2305	Harish25/StudyScreeningLanguageModel Core LLM for M.A.R.S. (Model Assisted Review System). Utilizes fine-tuned...	30	Emerging	multilingual-llm-adaptation	1	Jupyter Notebook
2306	whucs21Mzy/Model-Phase-Transitions Navigating Model Phase Transitions to Enable Extreme Lossless Compression: A...	30	Emerging	llm-compression-optimization	76	—
2307	ahazeemi/dPrune 🌿 dPrune: A Framework for Data Pruning	30	Emerging	llm-pruning-compression	3	Python
2308	partarstu/transformers-in-java Experimental project for AI and NLP based on Transformer Architecture	30	Emerging	transformer-frameworks-wrappers	16	Java
2309	OSUPCVLab/MobileUNETR Official Implementation of MobileUNETR: A Lightweight End-To-End Hybrid...	30	Emerging	medical-image-segmentation-transformers	61	Python
2310	gentaiscool/miners MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual...	30	Emerging	semantic-search-retrieval	14	Python
2311	mubingshen/MLC-SLM-Baseline The project is associated with the recently-launched INTERSPEECH 2025...	30	Emerging	llm-scaling-architecture	50	Python
2312	ulab-uiuc/Time-R1 Time-R1: Framework and resources for endowing LLMs with comprehensive...	30	Emerging	llm-reasoning-research	66	Python
2313	DebeshJha/TransNetR Official implementation of TransNetR: Transformer-based Residual Network for...	30	Emerging	medical-image-segmentation-transformers	24	Python
2314	gersteinlab/Struc-Bench [NAACL 2024] Struc-Bench: Are Large Language Models Good at Generating...	30	Emerging	math-reasoning-datasets	55	Python
2315	bfilar/URLTran PyTorch/HuggingFace Implementation of URLTran: Improving Phishing URL...	30	Emerging	transformer-architecture-tutorials	37	Python
2316	ShinoharaHare/LLM-Training A distributed training framework for large language models powered by Lightning.	30	Emerging	llm-inference-engines	24	Python
2317	chziakas/redeval A library for red-teaming LLM applications with LLMs.	30	Emerging	evaluation-frameworks-metrics	29	Python
2318	kyegomez/MC-ViT Implementation of the model: "(MC-ViT)" from the paper: "Memory...	30	Emerging	vit-image-classification	27	Python
2319	trekhleb/homemade-gpt-js A minimal TensorFlow.js re-implementation of Karpathy's minGPT (Generative...	30	Emerging	gpt2-pretraining-fine-tuning	88	TypeScript
2320	NVlabs/HMAR [CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation	30	Emerging	semantic-segmentation-techniques	61	Python
2321	LMLK-seal/HuggingGGUF Hugging Face Model downloader and GGUF Converter.	30	Emerging	llm-quantization-methods	13	Python
2322	hukenovs/slovo Slovo: Russian Sign Language Dataset and Models	30	Emerging	3d-vision-transformers	83	Python
2323	theosorus/GPT2-Hasktorch GPT2 implementation in Haskell with the Hasktorch library, inspired by...	30	Emerging	gpt-implementation-tutorials	36	Haskell
2324	westlake-repl/NRPStransformer A Transformer-Based Predictor for Nonribosomal Peptide Synthetases (NRPS)...	30	Emerging	peptide-property-prediction	9	Python
2325	hsisaberi/single-trait-electra A complete ELECTRA-based framework for Big Five personality trait...	30	Emerging	text-clustering-topic-modeling	18	Python
2326	xmindflow/SSCT [ICCV 2023] Self-supervised Semantic Segmentation: Consistency over Transformation	29	Experimental	medical-image-segmentation-transformers	26	Jupyter Notebook
2327	seedatnabeel/CLLM Curated LLM (ICML 2024)	29	Experimental	llm-domain-datasets	14	Jupyter Notebook
2328	crux82/CLiC-it_2023_tutorial This repository hosts materials from the CLiC-IT 2023 tutorial	29	Experimental	llm-learning-resources	30	Jupyter Notebook
2329	THU-KEG/WaterBench [ACL2024-Main] Data and Code for WaterBench: Towards Holistic Evaluation of...	29	Experimental	llm-hallucination-mitigation	30	Python
2330	declare-lab/MM-Align [EMNLP 2022] This repository contains the official implementation of the...	29	Experimental	vision-language-models	33	Python
2331	PromptMixerDev/prompt-mixer-ollama-connector Ollama Connector	29	Experimental	interactive-ai-chat-uis	3	JavaScript
2332	saeeddhqan/tiny-transformer Tiny transformer models implemented in pytorch.	29	Experimental	transformer-architecture-tutorials	9	Python
2333	qizhou000/UniEdit [NeurIPS 2025 B & D] UniEdit: A Unified Knowledge Editing Benchmark for...	29	Experimental	rlhf-alignment-training	2	Python
2334	OSU-STARLAB/Simul-LLM [ACL 2024] An easily extensible framework for simultaneous, text-to-text...	29	Experimental	llm-scaling-architecture	18	Python
2335	francoislanc/midistral LLM finetuned for generating symbolic music	29	Experimental	llm-fine-tuning	42	Python
2336	ASSERT-KTH/agentic-evals-lab Framework for training and evaluating LLMs with reinforcement learning in...	29	Experimental	multi-agent-orchestration	4	Python
2337	wafflecomposite/langchain-ask-pdf-local An AI-app that allows you to upload a PDF and ask questions about it. It...	29	Experimental	streamlit-llm-interfaces	93	Python
2338	Aloereed/llama.cpp-server-ohos Llama.cpp server for OpenHarmony	29	Experimental	local-llm-deployment	9	C++
2339	antonyvigouret/Pay-Attention-to-MLPs My implementation of the gMLP model from the paper "Pay Attention to MLPs".	29	Experimental	transformer-architecture-tutorials	25	Python
2340	sashazykov/json-repair-rb A simple Ruby gem designed to repair broken JSON strings	29	Experimental	local-llm-deployment	10	Ruby
2341	Selozhd/FNet-tensorflow Tensorflow Implementation of "FNet: Mixing Tokens with Fourier Transforms."	29	Experimental	transformer-architecture-tutorials	22	Python
2342	amazon-science/llm-code-preference Training and Benchmarking LLMs for Code Preference.	29	Experimental	code-model-training	38	Python
2343	GiannakopoulosIlias/vision-transformer-network-for-mr-electrical-properties-tomography A 3D Vision Transformer-based neural network for reconstructing electrical...	29	Experimental	vision-transformer-implementations	9	Python
2344	dev-sufyaan/Nexlify Unified API platform for free access to enterprise-grade AI models from...	29	Experimental	local-llm-deployment	13	Python
2345	krnel-ai/krnel-graph Lightweight representation engineering dataflow operations for agent developers.	29	Experimental	graph-transformers	22	Python
2346	lrusso/llama3pure Three inference engines for Llama 3: pure C for desktop systems, pure...	29	Experimental	local-llm-deployment	21	HTML
2347	mehdihosseinimoghadam/AVA-Llama-3 Fine-Tuned Llama 3 Persian Large Language Model LLM / Persian Llama 3	29	Experimental	llm-fine-tuning	36	Jupyter Notebook
2348	Ludobico/KakaoChatData 카카오톡 대화 데이터셋	29	Experimental	messaging-platform-chatbots	53	Python
2349	zwhe99/X-SIR [ACL 2024] Can Watermarks Survive Translation? On the Cross-lingual...	29	Experimental	llm-training-experimentation	42	Python
2350	wschella/llm-reliability Code for the paper "Larger and more instructable language models become less...	29	Experimental	llm-training-experimentation	31	Jupyter Notebook
2351	abenechehab/dicl [ICLR 2025] Official implementation of DICL (Disentangled In-Context...	29	Experimental	rlhf-alignment-training	25	Jupyter Notebook
2352	corl-team/lime Official implementation of the paper "You Do Not Fully Utilize Transformer's...	29	Experimental	llm-frameworks-libraries	32	Python
2353	GeorgeMichailidis/multi-task-mixed-freq Code repository for "Multi-Task Encoder-Dual-Decoder Modeling Framework on...	29	Experimental	time-series-forecasting-transformers	12	Python
2354	martin-wey/peft-llm-code Replication package of the paper "Exploring Parameter-Efficient Fine-Tuning...	29	Experimental	llm-scaling-architecture	25	Python
2355	frankaging/ReCOGS ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of...	29	Experimental	transformer-architecture-tutorials	10	Jupyter Notebook
2356	Gary3410/TaPA [arXiv 2023] Embodied Task Planning with Large Language Models	29	Experimental	vision-language-instruction-tuning	193	Python
2357	Silvestre17/BDA_AmazonReviews_DatabricksPySparkAnalysis_MasterProject 🛍️ Big Data project analyzing Amazon tech reviews using Databricks, PySpark,...	29	Experimental	review-sentiment-classification	2	Jupyter Notebook
2358	pluja/maestro Turn natual language into commands. Your CLI tasks, now as easy as a...	29	Experimental	llm-terminal-automation	63	Go
2359	sigeisler/reinforce-attacks-llms REINFORCE Adversarial Attacks on Large Language Models: An Adaptive,...	29	Experimental	jailbreak-attacks-analysis	23	Python
2360	teddykoker/grokking PyTorch implementation of "Grokking: Generalization Beyond Overfitting on...	29	Experimental	transformer-architecture-tutorials	39	Python
2361	surrey-nlp/PLOD-AbbreviationDetection This repository contains the PLOD Dataset for Abbreviation Detection...	29	Experimental	nlp-learning-coursework	12	Jupyter Notebook
2362	kyegomez/M2PT Implementation of M2PT in PyTorch from the paper: "Multimodal Pathway:...	29	Experimental	parameter-efficient-adapters	14	Python
2363	antofuller/configaformers A python library for highly configurable transformers - easing model...	29	Experimental	transformer-architecture-tutorials	48	Python
2364	DreamerGPT/DreamerGPT 🌱 梦想家(DreamerGPT)：中文大语言模型指令精调	29	Experimental	multilingual-llm-adaptation	51	Python
2365	WangRongsheng/Chinese-LLaMA-Alpaca-Usage 📔 对Chinese-LLaMA-Alpaca进行使用说明和核心代码注解	29	Experimental	multilingual-llm-adaptation	51	Jupyter Notebook
2366	Praveengovianalytics/falcon-evaluate Falcon Evaluate is an open-source Python library aims to revolutionise the...	29	Experimental	evaluation-frameworks-metrics	14	Python
2367	sampathkethineedi/bert-topic-sentiment Topic Based Sentiment Detection using BERT	29	Experimental	review-sentiment-classification	9	Python
2368	Am1n3e/active-learning-transformer A hands-on tutorial on how to use Active Learning with Transformer models.	29	Experimental	machine-translation-transformers	15	Jupyter Notebook
2369	mcbal/spin-model-transformers Physics-inspired transformer modules based on mean-field dynamics of...	29	Experimental	transformer-architecture-tutorials	46	Python
2370	PRIME-RL/Entropy-Mechanism-of-RL The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.	29	Experimental	llm-reasoning-research	421	Python
2371	kaist-cvml/I-HallA-v1.0 [AAAI 2025] Official Implementation of I-HallA v1.0	29	Experimental	llm-hallucination-mitigation	13	Python
2372	avilum/llama-saas A client/server for LLaMA (Large Language Model Meta AI) that can run ANYWHERE.	29	Experimental	llm-terminal-automation	61	Go
2373	camenduru/alpaca-lora-colab Alpaca Lora	29	Experimental	llm-quantization-methods	25	Jupyter Notebook
2374	fabienfrfr/tptt 😊 TPTT: Transforming Pretrained Transformers into Titans	29	Experimental	transformer-architecture-education	60	Python
2375	MaxiDonkey/DelphiGroqCloud The GroqCloud API wrapper for Delphi provides access to models from Meta,...	29	Experimental	llm-terminal-automation	20	Pascal
2376	AristotelisPap/Question-Answering-with-BERT-and-Knowledge-Distillation Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD)...	29	Experimental	question-answering-systems	26	Jupyter Notebook
2377	zjunlp/DynamicKnowledgeCircuits [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits...	29	Experimental	math-reasoning-datasets	47	Jupyter Notebook
2378	shahriargolchin/time-travel-in-llms The official repository for the paper entitled "Time Travel in LLMs: Tracing...	29	Experimental	llm-domain-datasets	12	Python
2379	teilomillet/retrain a Python library that uses Reinforcement Learning (RL) to train LLMs.	29	Experimental	llm-knowledge-distillation	42	Python
2380	davidjosipovic/news-trend-analysis Automated NLP pipeline for news analysis with sentiment detection, topic...	29	Experimental	review-sentiment-classification	2	Python
2381	cakshat/AlloyBERT Introducing AlloyBERT: a transformer encoder-based model for predicting...	29	Experimental	bert-model-implementations	12	Python
2382	ai-forever/model-zoo NLP model zoo for Russian	29	Experimental	model-evaluation-diagnostics	50	—
2383	actypedef/ARCQuant Code for the paper "ARCQuant: Boosting NVFP4 Quantization with Augmented...	29	Experimental	llm-quantization-techniques	18	Cuda
2384	LoserCheems/WonderfulMatrices Wonderful Matrices to Build Small Language Models	29	Experimental	transformer-architecture-education	44	Python
2385	abcsys/libem Compound AI toolchain for fast and accurate entity matching, powered by LLMs.	29	Experimental	multilingual-llm-adaptation	26	Python
2386	jinzhuoran/RWKU RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language...	29	Experimental	llm-knowledge-editing	91	Python
2387	zrr1999/emotion-recognition 多模态情绪识别方法研究（Multimodal Emotion Recognition）	29	Experimental	emotion-detection-transformers	25	Python
2388	alphasecio/groq A Streamlit chatbot with memory for running open-source text models on Groq.	29	Experimental	streamlit-llm-interfaces	2	Python
2389	sitammeur/qwen2.5-web Qwen2.5 Instruct, large language model, operates within web browsers via 🤗...	29	Experimental	browser-based-ml-inference	2	JavaScript
2390	xlang-ai/text2reward [ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for...	29	Experimental	code-model-training	201	Jupyter Notebook
2391	ananttripathi/Resume-Analyzer-MLOps Resume Analyzer is an AI-powered MLOps platform that optimizes your resume...	29	Experimental	resume-job-matching	6	Python
2392	DataArcTech/ChartMoE [ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for...	29	Experimental	mathematical-reasoning-transformers	94	Jupyter Notebook
2393	astorfi/LLM-Alignment-Project A comprehensive template for aligning large language models (LLMs) using...	29	Experimental	rlhf-alignment-training	39	Python
2394	SALT-NLP/Adaptive-Compositional-Modules Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive...	29	Experimental	compositional-reasoning-embeddings	39	Python
2395	GU-DataLab/stance-detection-KE-MLM Official resource of the paper "Knowledge Enhanced Masked Language Model for...	29	Experimental	hate-speech-detection	39	Python
2396	jaketae/vit-breast-cancer Transfer learning pretrained vision transformers for breast histopathology	29	Experimental	medical-image-diagnosis-transformers	14	Python
2397	affjljoo3581/Inverse-DALL-E-for-Optical-Character-Recognition Inverse DALL-E for Optical Character Recognition	29	Experimental	text-to-image-generation	38	Python
2398	MileBench/MileBench This repo contains evaluation code for the paper "MileBench: Benchmarking...	29	Experimental	domain-specific-benchmarks	36	Python
2399	gentaiscool/few-shot-lm The source code of "Language Models are Few-shot Multilingual Learners" (MRL...	29	Experimental	instruction-tuning-datasets	53	Python
2400	jordandeklerk/SwinViT Modified Swin Transformer model in PyTorch on CIFAR-10 for image classification	29	Experimental	vit-image-classification	10	Python

« Prev 1 2 3 … 22 23 24 25 26 … 63 64 65 Next »