All Transformer Models

6,429 models ranked by quality score · Page 14 of 65

Showing 1301–1400 of 6,429

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
1301	jackaduma/Vicuna-LoRA-RLHF-PyTorch A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer...	39	Emerging	rlhf-alignment-training	221	Python
1302	harveybc/predictor Predictor that uses a configurable plugin-based predictive supervised...	39	Emerging	financial-return-prediction	5	Python
1303	janelu9/EasyLLM Running Large Language Model easily.	39	Emerging	llm-training-experimentation	13	Python
1304	ruimalheiro/training-custom-llama Llama-style transformer in PyTorch with multi-node / multi-GPU training....	39	Emerging	lora-qlora-fine-tuning	21	Python
1305	Archimedes1618/Madlab Madlab is an advanced AI development studio designed to streamline the...	39	Emerging	local-llm-deployment	11	TypeScript
1306	leaderj1001/CLIP CLIP: Connecting Text and Image (Learning Transferable Visual Models From...	39	Emerging	clip-vision-language	83	Python
1307	slSeanWU/Compose_and_Embellish Official PyTorch implementation of ICASSP 2023 paper "Compose & Embellish:...	39	Emerging	music-generation-transformers	33	Python
1308	complex-reasoning/RPG [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)	39	Emerging	rlhf-alignment-training	65	Python
1309	padeler/PE-former 2D Human Pose estimation using transformers. Implementation in Pytorch	39	Emerging	3d-vision-transformers	34	Python
1310	Aaronhuang-778/BiLLM [ICML 2024] BiLLM: Pushing the Limit of Post-Training Quantization for LLMs	39	Emerging	llm-quantization-techniques	228	Python
1311	UCSC-VLAA/m1 [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical...	39	Emerging	llm-scaling-architecture	48	Jupyter Notebook
1312	lvyufeng/cybertron-ai mindspore implementation of transformers	39	Emerging	transformer-frameworks-wrappers	68	Python
1313	WayneJin0918/SRUM Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified...	39	Emerging	rlhf-alignment-training	96	Python
1314	praeclarum/transformers-js Browser-compatible JS library for running language models	39	Emerging	browser-based-ml-inference	233	JavaScript
1315	AyushExel/trolo An SDK for Transformers + YOLO and other SSD family models	39	Emerging	3d-vision-transformers	64	Jupyter Notebook
1316	zinengtang/TVLT PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)	39	Emerging	multimodal-fusion-transformers	126	Jupyter Notebook
1317	Michael-A-Kuykendall/shimmytok Pure Rust tokenizer for GGUF models - llama.cpp compatible	39	Emerging	llm-quantization-methods	14	Rust
1318	DeepChainBio/deepchain-apps A library for deploying App on deepchain.bio	39	Emerging	transformer-frameworks-wrappers	31	Python
1319	akx/ollama-dl Download models from the Ollama library, without Ollama	39	Emerging	ollama-go-clients	127	Python
1320	YJiangcm/Lion [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models	39	Emerging	llm-knowledge-distillation	212	Python
1321	DAMO-NLP-SG/CLEX [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models	39	Emerging	diffusion-language-models	78	Python
1322	young-geng/m3ae_public Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation	39	Emerging	transformer-frameworks-wrappers	107	Python
1323	withcaer/curtana Simplified zero-cost wrapper over llama.cpp powered by the lama-cpp-2 Crate.	39	Emerging	local-llm-deployment	2	Rust
1324	ariG23498/gemma3-object-detection Fine tune Gemma 3 on an object detection task	39	Emerging	lora-qlora-fine-tuning	100	Python
1325	muhtalhakhan/Hacktoberfest2025 Hacktoberfest 2025 🧑🏻‍💻 OPEN FIRST Pull Request 🎉	39	Emerging	ai-powered-saas-startups	8	HTML
1326	anchen1011/FireAct FireAct: Toward Language Agent Fine-tuning	39	Emerging	llm-fine-tuning	292	Python
1327	amazon-science/crossmodal-contrastive-learning CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video...	39	Emerging	vision-language-models	64	Python
1328	asprenger/ray_vllm_inference A simple service that integrates vLLM with Ray Serve for fast and scalable...	39	Emerging	llm-inference-serving	78	Python
1329	ariannamethod/doe DoE Janus Architecture: Democracy of Experts	39	Emerging	llm-quantization-methods	4	C
1330	AlekseyKorshuk/huggingartists Lyrics generation with GPT2-based Transformer	39	Emerging	music-generation-transformers	108	Jupyter Notebook
1331	ChenRocks/UNITER Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt...	39	Emerging	3d-vision-transformers	800	Python
1332	LLMBook-zh/LLMBook-zh.github.io 《大语言模型》作者：赵鑫，李军毅，周昆，唐天一，文继荣	39	Emerging	llm-learning-resources	4,371	Python
1333	zjunlp/Deco [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation	39	Emerging	llm-hallucination-mitigation	137	Python
1334	UKPLab/5pils Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!"...	39	Emerging	llm-interpretability-explainability	45	Python
1335	hpretila/llama.net .NET wrapper for LLaMA.cpp for LLaMA language model inference on CPU. 🦙	39	Emerging	local-llm-deployment	58	C#
1336	gopikrsmscs/stock-price-prediction-transformer Tesal Stock Price Prediction Using Transformer	39	Emerging	financial-return-prediction	31	Python
1337	riccardomusmeci/mlx-llm Large Language Models (LLMs) applications and tools running on Apple Silicon...	39	Emerging	llm-inference-engines	459	Python
1338	amoffat/HeimdaLLM Constrain LLM output	39	Emerging	llm-frameworks-libraries	113	Python
1339	THUDM/LongAlign [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs	39	Emerging	llm-knowledge-editing	259	Python
1340	golololologol/LLM-Distillery A pipeline for LLM knowledge distillation	39	Emerging	llm-knowledge-distillation	112	Python
1341	TatevKaren/BabyGPT-Build_GPT_From_Scratch BabyGPT: Build Your Own GPT Large Language Model from Scratch Pre-Training...	39	Emerging	gpt2-pretraining-fine-tuning	116	Python
1342	slp-rl/slamkit SlamKit is an open source tool kit for efficient training of SpeechLMs. It...	39	Emerging	llm-benchmark-leaderboards	229	Python
1343	liuyukid/transformers-ner Pytorch-Named-Entity-Recognition-with-transformers	39	Emerging	named-entity-recognition	210	Python
1344	xenova/sponsorblock-ml Automatically detect in-video YouTube sponsorships, self/unpaid promotions,...	39	Emerging	youtube-video-summarization	159	Python
1345	jerryshell/resumind AI 智能简历分析系统，为每个职位定制专属反馈与 ATS 评分	38	Emerging	resume-job-matching	5	JavaScript
1346	AIoT-MLSys-Lab/Efficient-LLMs-Survey [TMLR 2024] Efficient Large Language Models: A Survey	38	Emerging	llm-research-curation	1,256	—
1347	iflytek/VLE VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)	38	Emerging	multimodal-vision-language	194	Python
1348	osainz59/Ask2Transformers A Framework for Textual Entailment based Zero Shot text classification	38	Emerging	text-classification-transformers	153	Python
1349	xingyizhou/GTR Global Tracking Transformers, CVPR 2022	38	Emerging	3d-vision-transformers	379	Python
1350	hasanirtiza/PedesFormer-Transformer-Networks-For-Pedestrian-Detection Transformer Networks for Pedestrian Detection	38	Emerging	3d-vision-transformers	43	Python
1351	shrut2702/upasak UI-based Fine-Tuning for Large Language Models (LLMs)	38	Emerging	lora-qlora-fine-tuning	20	Python
1352	BarCodeReader/SelfReformer [TMM-2023] Official implementation of "Towards Complete and Detail-Preserved...	38	Emerging	object-detection-transformers	73	Python
1353	daniel-furman/sft-demos Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and...	38	Emerging	rlhf-alignment-training	77	Jupyter Notebook
1354	DirtyHarryLYL/Transformer-in-Vision Recent Transformer-based CV and related works.	38	Emerging	vision-transformer-optimization	1,339	—
1355	baaivision/EVE EVE Series: Encoder-Free Vision-Language Models from BAAI	38	Emerging	multimodal-vision-language	368	Python
1356	OFA-Sys/ExpertLLaMA An opensource ChatBot built with ExpertPrompting which achieves 96% of...	38	Emerging	messaging-platform-chatbots	302	Python
1357	OFA-Sys/OFASys OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models	38	Emerging	multimodal-fusion-transformers	151	Python
1358	Kaleidophon/nlp-uncertainty-zoo Model zoo for different kinds of uncertainty quantification methods used in...	38	Emerging	power-transformer-design	55	Python
1359	ma2za/telegram-llm-bot Telegram LLM bot backed by OpenAI, Whisper, Beam, LLaMA, Weaviate, MinIO and MongoDB	38	Emerging	messaging-platform-chatbots	112	Python
1360	Nkluge-correa/Tucano Natively pre-trained open-source Portuguese language models.	38	Emerging	multilingual-llm-adaptation	79	Jupyter Notebook
1361	Yxxxb/VoCo-LLaMA [CVPR'2025] VoCo-LLaMA: This repo is the official implementation of...	38	Emerging	vision-language-instruction-tuning	203	Python
1362	itsnamgyu/block-transformer Block Transformer: Global-to-Local Language Modeling for Fast Inference...	38	Emerging	kv-cache-optimization	163	Python
1363	icon-lab/SLATER Official implementation of the paper: Unsupervised MRI Reconstruction via...	38	Emerging	3d-vision-transformers	41	Python
1364	lukashermann/hulc Hierarchical Universal Language Conditioned Policies	38	Emerging	trajectory-prediction-ml	77	Python
1365	openpsi-project/ReaLHF Super-Efficient RLHF Training of LLMs with Parameter Reallocation	38	Emerging	rlhf-alignment-training	333	Python
1366	WisconsinAIVision/ViP-LLaVA [CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary...	38	Emerging	vision-language-instruction-tuning	336	Python
1367	tomekkorbak/pretraining-with-human-feedback Code accompanying the paper Pretraining Language Models with Human Preferences	38	Emerging	rlhf-alignment-training	180	Python
1368	Whiax/BERT-Transformer-Pytorch Basic implementation of BERT and Transformer in Pytorch in one short python...	38	Emerging	transformer-architecture-education	45	Python
1369	sshh12/multi_token Embed arbitrary modalities (images, audio, documents, etc) into large...	38	Emerging	multimodal-vision-language	191	Python
1370	AmpereComputingAI/llama.cpp Ampere optimized llama.cpp	38	Emerging	llm-inference-engines	33	Python
1371	shufangxun/LLaVA-MoD [ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation	38	Emerging	llm-knowledge-distillation	223	Python
1372	egaoharu-kensei/flash-attention-triton Cross-platform FlashAttention-2 Triton implementation for Turing+ GPUs with...	38	Emerging	sparse-attention-optimization	21	Python
1373	arrmansa/Basic-UI-for-GPT-J-6B-with-low-vram A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for...	38	Emerging	gpt2-pretraining-fine-tuning	113	Jupyter Notebook
1374	sergiomorapardo/AdvancedTopicsAnalytics Material y notebooks del curso "Tópicos Avanzados en Analítica...	38	Emerging	ml-foundations-curricula	12	Jupyter Notebook
1375	takara-ai/go-attention A full attention mechanism and transformer in pure go.	38	Emerging	attention-mechanism-implementations	451	Go
1376	HillZhang1999/ICD Code & Data for our Paper "Alleviating Hallucinations of Large Language...	38	Emerging	llm-hallucination-mitigation	69	Python
1377	jakubburkiewicz/node-red-contrib-ollama A Node-RED module that wraps the ollama.js library, offering its...	38	Emerging	interactive-ai-chat-uis	28	HTML
1378	wuwangzhang1216/prometheus Fully automatic censorship removal for language models. LoRA abliteration +...	38	Emerging	lora-qlora-fine-tuning	33	Python
1379	Hugging-Face-Supporter/tftokenizers Use Huggingface Transformer and Tokenizers as Tensorflow Reusable SavedModels	38	Emerging	tokenizer-libraries	10	Python
1380	epfml/llm-optimizer-benchmark Benchmarking Optimizers for LLM Pretraining	38	Emerging	domain-specific-benchmarks	56	Python
1381	Longyichen/Alpaca-family-library Summarize all open source Large Languages Models and low-cost replication...	38	Emerging	multilingual-llm-adaptation	136	—
1382	BhabhaAI/dataformer Solving data for LLMs - Create quality synthetic datasets!	38	Emerging	synthetic-data-generation	151	Python
1383	dbmdz/berts DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models	38	Emerging	bert-model-implementations	159	—
1384	di37/finetuning-quantize-evaluate Fine-Tune, Quantize, Evaluate: The Complete Guide — LLMs, VLMs, and Embedding Models	38	Emerging	llm-fine-tuning	13	Typst
1385	minosvasilias/godot-dodo Finetuning large language models for GDScript generation.	38	Emerging	lora-qlora-fine-tuning	567	Python
1386	moeru-ai/inventory 🧠🃏 Your universal model catalog, everything, everywhere, all at once.	38	Emerging	ml-foundations-curricula	5	Go
1387	intersun/LightningDOT source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT	38	Emerging	parameter-efficient-adapters	72	Python
1388	zarzouram/image_captioning_with_transformers Pytorch implementation of image captioning using transformer-based model.	38	Emerging	image-captioning-transformers	68	Jupyter Notebook
1389	hao-ai-lab/Consistency_LLM [ICML 2024] CLLMs: Consistency Large Language Models	38	Emerging	llm-interpretability-explainability	413	Python
1390	NohTow/PPL-MCTS Repository for the code of the "PPL-MCTS: Constrained Textual Generation...	38	Emerging	creative-text-generation	66	Python
1391	Infini-AI-Lab/vortex_torch Vortex: A Flexible and Efficient Sparse Attention Framework	38	Emerging	sparse-attention-optimization	49	Python
1392	ParCIS/Chimera Chimera: bidirectional pipeline parallelism for efficiently training...	38	Emerging	transformer-training-optimization	70	Python
1393	kyaiooiayk/Awesome-LLM-Large-Language-Models-Notes What can I do with a LLM model?	38	Emerging	llm-training-experimentation	157	Jupyter Notebook
1394	Curated-Awesome-Lists/awesome-llms-fine-tuning Explore a comprehensive collection of resources, tutorials, papers, tools,...	38	Emerging	llm-training-experimentation	505	—
1395	kolinko/effort An implementation of bucketMul LLM inference	38	Emerging	apple-silicon-llm-inference	227	Swift
1396	robert-mcdermott/LLM-Image-Classification Image Classification Testing with LLMs	38	Emerging	text-classification	72	Python
1397	InhwanBae/LMTrajectory Official Code for "Can Language Beat Numerical Regression? Language-Based...	38	Emerging	game-playing-agents	159	Python
1398	matlab-deep-learning/transformer-networks-for-time-series-prediction Deep Learning in Quantitative Finance: Transformer Networks for Time Series...	38	Emerging	time-series-forecasting-transformers	61	MATLAB
1399	upb-lea/mag-net-hub MagNet Toolkit - Certified Models of the MagNet Challenge	38	Emerging	power-transformer-design	18	Python
1400	chenhan97/TimeLlama The official repo of TimeLlama, an instruction-finetuned Llama2 series that...	38	Emerging	llm-frameworks-libraries	43	Python

« Prev 1 2 3 … 12 13 14 15 16 … 63 64 65 Next »