All Transformer Models

6,429 models ranked by quality score · Page 17 of 65

Showing 1601–1700 of 6,429

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
1601	cgtuebingen/ua3dscancomp Latent Uncertainty-Aware Multi-View SDF Scan Completion	36	Emerging	3d-vision-transformers	2	Python
1602	jackaduma/ChatGLM-LoRA-RLHF-PyTorch A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer...	36	Emerging	rlhf-alignment-training	140	Python
1603	HenryHZY/Awesome-Multimodal-LLM Research Trends in LLM-guided Multimodal Learning.	36	Emerging	multimodal-vision-language-models	356	—
1604	SeekingDream/DyCodeEval Official repository of the ICML2025 paper “Dynamic Benchmarking of Reasoning...	36	Emerging	math-reasoning-datasets	255	Python
1605	chenmozhijin/BSRoformer.cpp GGML-based C++ inference for BS Roformer/Mel-Band-Roformer vocal separation...	36	Emerging	llm-inference-engines	8	C++
1606	Relaxed-System-Lab/Flash-Sparse-Attention 🚀🚀 Efficient implementations of Native Sparse Attention	36	Emerging	sparse-attention-optimization	983	Python
1607	gbaptista/ollama-ai A Ruby gem for interacting with Ollama's API that allows you to run open...	36	Emerging	interactive-ai-chat-uis	255	Ruby
1608	TIGER-AI-Lab/LongICLBench Code and Data for "Long-context LLMs Struggle with Long In-context Learning"...	36	Emerging	math-reasoning-datasets	112	Python
1609	surrey-nlp/NLP-2026 Labs for COM3029/COMM061 at University of Surrey	36	Emerging	nlp-learning-coursework	1	Jupyter Notebook
1610	guxm2021/ALT_SpeechBrain [ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription	36	Emerging	wav2vec2-speech-recognition	49	Python
1611	c0sogi/llama-api An OpenAI-like LLaMA inference API	36	Emerging	local-llm-deployment	113	Python
1612	bahree/helloLondon Historical Language Model for London - A specialized LLM trained on...	36	Emerging	transformer-implementation-education	29	Python
1613	gusye1234/llm-as-function Embed your LLM into a python function	36	Emerging	llm-function-calling	22	Python
1614	rese1f/aurora [ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a...	36	Emerging	image-captioning-transformers	139	Python
1615	jonrbates/turing A PyTorch library for simulating Turing machines with neural networks, based...	36	Emerging	transformer-architecture-tutorials	2	Python
1616	jqtangust/Robust-R1 🔥🔥🔥[AAAI 2026 Oral] Official Implementation of Robust-R1: Degradation-Aware...	36	Emerging	llm-reasoning-research	520	Python
1617	avocardio/Zicklein Finetuning instruct-LLaMA on german datasets.	36	Emerging	lora-qlora-fine-tuning	33	Python
1618	gotzmann/booster Booster - open accelerator for LLM models. Better inference and debugging...	36	Emerging	llm-inference-engines	167	C++
1619	HOLYKEYZ/model-unfetter The production engine for directional ablation. Unalign / remove models...	36	Emerging	llm-compression-optimization	19	Python
1620	PaddlePaddle/PALM a Fast, Flexible, Extensible and Easy-to-use NLP Large-scale Pretraining and...	36	Emerging	llm-training-experimentation	185	Python
1621	m0dulo/InferSpore 🌱 A fully independent Large Language Model (LLM) inference engine, built...	36	Emerging	llm-inference-engines	32	Cuda
1622	Ratnesh-181998/python-ai-ml-libraries A comprehensive Python AI/ML repository covering end-to-end workflows using...	36	Emerging	ml-foundations-curricula	2	Jupyter Notebook
1623	AutonomicPerfectionist/PipeInfer PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation	36	Emerging	llm-cuda-optimization	32	C++
1624	TIGER-AI-Lab/MAmmoTH Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid...	36	Emerging	math-reasoning-datasets	383	Jupyter Notebook
1625	declare-lab/LLM-PuzzleTest This repository is maintained to release dataset and models for multimodal...	36	Emerging	math-reasoning-datasets	113	Python
1626	haiodo/oaitt An OpenAI compatible transcriber using transformers and whisperx.	36	Emerging	whisper-speech-transcription	6	Python
1627	jeffreysijuntan/lloco The official repo for "LLoCo: Learning Long Contexts Offline"	36	Emerging	llm-compression-optimization	118	Python
1628	HiThink-Research/BizFinBench A Business-Driven Real-World Financial Benchmark for Evaluating LLMs	36	Emerging	domain-specific-benchmarks	211	Python
1629	MME-Benchmarks/Video-MME ✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark...	36	Emerging	multimodal-vision-language	732	—
1630	robinhad/kruk Ukrainian instruction-tuned language models and datasets	36	Emerging	multilingual-llm-adaptation	96	Jupyter Notebook
1631	BUAADreamer/Chinese-LLaVA-Med 中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine	36	Emerging	vision-language-instruction-tuning	103	Python
1632	SingleZombie/LLSA Official implementation of Log-linear Sparse Attention (LLSA).	36	Emerging	attention-mechanism-implementations	62	Python
1633	JinjieNi/MegaDLMs GPU-optimized framework for training diffusion language models at any scale....	36	Emerging	diffusion-language-models	327	Python
1634	0x7o/text2keywords Trained T5 and T5-large model for creating keywords from text	36	Emerging	t5-mt5-fine-tuning	73	Jupyter Notebook
1635	yihongXU/TransCenter This is the official implementation of TransCenter (TPAMI). The code and...	36	Emerging	3d-vision-transformers	118	—
1636	SkyworkAI/MoE-plus-plus [ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with...	36	Emerging	mixture-of-experts-llms	264	Python
1637	UCSC-REAL/DS2 [ICLR 2025] Official implementation of paper "Improving Data Efficiency via...	36	Emerging	graph-language-models	101	Python
1638	cankocagil/SwinDetr Integration of Swin Transformer to DETR for Robust Object Detection (DEMO)	36	Emerging	object-detection-transformers	30	Jupyter Notebook
1639	yongchao98/R1-Code-Interpreter R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and...	36	Emerging	llm-reasoning-research	31	Python
1640	ray-project/ray-llm RayLLM - LLMs on Ray (Archived). Read README for more info.	36	Emerging	llm-inference-serving	1,267	—
1641	ethicalabs-ai/kurtis Kurtis is a fine-tuning, inference and evaluation tool built for SLMs (Small...	36	Emerging	lora-qlora-fine-tuning	6	Python
1642	kyegomez/PALI Democratization of "PaLI: A Jointly-Scaled Multilingual Language-Image Model"	36	Emerging	vision-language-models	94	Python
1643	etaoxing/multigame-dt Implementation of Multi-Game Decision Transformers in PyTorch	36	Emerging	power-transformer-design	49	Python
1644	trzy/llava-cpp-server LLaVA server (llama.cpp).	36	Emerging	local-llm-deployment	183	C++
1645	xuyang-liu16/GlobalCom2 [AAAI 2026] Global Compression Commander: Plug-and-Play Inference...	36	Emerging	llm-compression-optimization	39	Python
1646	grigio/llm-eval-simple llm-eval-simple is a simple LLM evaluation framework with intermediate...	36	Emerging	evaluation-frameworks-metrics	59	Python
1647	nanxiang11/CodeLab_LLM 🌟 从LLaMA2开启大语言模型原理与实践教程	36	Emerging	llm-learning-resources	76	Python
1648	cleopatra-itn/fair_multimodal_sentiment Code and Splits for the paper "A Fair and Comprehensive Comparison of...	36	Emerging	review-sentiment-classification	10	Python
1649	awneesht/KVShuttle Benchmark & decision framework for KV cache transfer compression in...	36	Emerging	llm-quantization-methods	5	Python
1650	retarfi/language-pretraining Pre-training Language Models for Japanese	36	Emerging	bert-model-implementations	50	Python
1651	ChuloAI/BrainChulo Harnessing the Memory Power of the Camelids	36	Emerging	multilingual-llm-adaptation	147	Python
1652	Ethyros-AI/ModelCypher ModelCypher - Decipher the high dimensional geometry of LLMs. An open source...	36	Emerging	llm-finetuning-frameworks	19	Python
1653	abhilash1910/LongPegasus LongPegasus package is used for inducing longformer self attention over base...	36	Emerging	text-summarization-transformers	4	Jupyter Notebook
1654	jagilley/fact-checker Fact-checking LLM outputs with self-ask	36	Emerging	fact-checking-systems	307	Jupyter Notebook
1655	abhisheknair10/llama3.cu Lightweight Llama 3 8B Inference Engine in CUDA C	36	Emerging	local-llm-deployment	54	Cuda
1656	BatsResearch/trove A Flexible Toolkit for Dense Retrieval	36	Emerging	power-transformer-design	44	Python
1657	julienkay/com.doji.transformers A Unity package to run pretrained transformer models with Unity Sentis	36	Emerging	transformer-frameworks-wrappers	25	C#
1658	deep-symbolic-mathematics/Multimodal-Math-Pretraining [ICLR 2024 Spotlight] This is the official code for the paper "SNIP:...	36	Emerging	mathematical-reasoning-transformers	58	Python
1659	TIGER-AI-Lab/VisualWebInstruct The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction...	36	Emerging	instruction-tuning-datasets	38	Python
1660	HenryNdubuaku/nanodl Build GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more in JAX.	36	Emerging	transformer-frameworks-wrappers	299	Python
1661	dobriban/Principles-of-AI-LLMs Materials for the course Principles of AI: LLMs at UPenn (Stat 9911, Spring...	36	Emerging	llm-training-experimentation	44	—
1662	purvanshjoshi/IndiVoice-DeepASR Deep Learning framework for Indian-accented Speech-to-Text using Whisper and...	36	Emerging	whisper-speech-transcription	2	Python
1663	bnosac/golgotha Contextualised Embeddings and Language Modelling using BERT and Friends using R	36	Emerging	bert-model-implementations	47	R
1664	sh0416/llama-classification Text classification with Foundation Language Model LLaMA	36	Emerging	llama-model-implementations	113	Python
1665	monologg/HanBert-Transformers HanBert on 🤗 Huggingface Transformers 🤗	36	Emerging	korean-language-models	87	Python
1666	shahrukhx01/siamese-nn-semantic-text-similarity A repository containing comprehensive Neural Networks based PyTorch...	36	Emerging	semantic-textual-similarity	53	Python
1667	jdaln/dgx-spark-inference-stack Serve the home! Inference stack for your Nvidia DGX Spark aka the Grace...	36	Emerging	llm-inference-engines	26	JavaScript
1668	voidism/Lookback-Lens Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual...	36	Emerging	llm-hallucination-mitigation	147	Python
1669	mayank164/loveFreeTools 🛠️ Provide free tools like temporary emails, link shortening, and more, all...	36	Emerging	study-aid-generators	2	JavaScript
1670	ai-action/cypress-ai-demo Cypress AI Demo	36	Emerging	interactive-ai-chat-uis	2	TypeScript
1671	IrohXu/Awesome-Multimodal-LLM-Autonomous-Driving [WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving	36	Emerging	multimodal-vision-language-models	309	—
1672	real-stanford/reflect [CoRL 2023] REFLECT: Summarizing Robot Experiences for Failure Explanation...	36	Emerging	llm-robot-planning	103	Jupyter Notebook
1673	HKUDS/SepLLM [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One...	36	Emerging	diffusion-language-models	567	Python
1674	FengheTan9/LLM4Seg [MICCAI 2025] Official code for "Pre-Trained LLM is a Semantic-Aware and...	36	Emerging	instruction-tuning-datasets	51	Python
1675	wehos/awesome-graph-transformer Papers about graph transformers.	35	Emerging	graph-transformers	919	—
1676	nuhmanpk/quick-llama Run Ollama models on Google Colab	35	Emerging	local-llm-deployment	4	Python
1677	researchim-ai/models-at-home training models at home	35	Emerging	llm-fine-tuning	34	Python
1678	FareedKhan-dev/gpt4o-from-scratch Implementation of a GPT-4o like Multimodal from Scratch using Python	35	Emerging	gpt2-pretraining-fine-tuning	78	Jupyter Notebook
1679	arrmansa/Basic-UI-for-GPT-Neo-with-low-vram A basic ui for running gpt neo 2.7B on low vram (3 gb Vram minimum)	35	Emerging	gpt2-pretraining-fine-tuning	36	Jupyter Notebook
1680	TideDra/VL-RLHF A RLHF Infrastructure for Vision-Language Models	35	Emerging	rlhf-alignment-training	198	Python
1681	a-tokyo/ai-zero-shot-classifier 🧠 leverage advanced AI embeddings to perform multilingual zero-shot text...	35	Emerging	text-classification-transformers	12	TypeScript
1682	ziqipang/LM4VisualEncoding [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are...	35	Emerging	multimodal-vision-language	246	Python
1683	rust-dd/iTransformer An iTransformer implementation in Rust	35	Emerging	browser-based-ml-inference	19	Rust
1684	Ankur3107/nlp_notebooks Tensorflow, Pytorch, Huggingface Transformer, Fastai, etc. tutorial Colab Notebooks.	35	Emerging	huggingface-learning-resources	78	Jupyter Notebook
1685	Airmomo/transformers-docs-zh 【持续更新中】完全中文版的 Transformers 学习笔记及演示示例，支持 Jupyter Notebook，主要内容来自 🤗 Hugging...	35	Emerging	huggingface-learning-resources	71	—
1686	thushv89/packt_nlp_tensorflow_2 This will contain the code for the 2nd edition of NLP with TensorFlow (Edition 2)	35	Emerging	nlp-learning-coursework	45	Jupyter Notebook
1687	EvanZhouDev/llm.pdf Run LLMs inside a PDF file.	35	Emerging	pdf-qa-systems	755	Python
1688	UCSC-REAL/TokenCleaning [ICML 2025] Official implementation of paper "Token Cleaning: Fine-Grained...	35	Emerging	instruction-tuning-datasets	51	Python
1689	modal-labs/stopwatch A tool for benchmarking LLMs on Modal	35	Emerging	llm-evaluation-benchmarking	50	Python
1690	Jagatmohan46/tiny-recursive-model 🚀 Implement the Tiny Recursive Model (TRM) for improved performance in...	35	Emerging	transformer-training-optimization	1	Python
1691	wxjiao/ParroT The ParroT framework to enhance and regulate the Translation Abilities...	35	Emerging	multilingual-llm-adaptation	176	Python
1692	gsarti/t5-flax-gcp Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP	35	Emerging	transformer-frameworks-wrappers	58	Python
1693	InternRobotics/PointLLM [ECCV 2024 Best Paper Candidate & TPAMI 2025] PointLLM: Empowering Large...	35	Emerging	vision-language-instruction-tuning	983	Python
1694	misonsky/HiFT memory-efficient fine-tuning; support 24G GPU memory fine-tuning 7B	35	Emerging	lora-qlora-fine-tuning	21	Python
1695	nickduran/align2-linguistic-alignment ALIGN 2.0: Modern Python package for multi-level linguistic alignment...	35	Emerging	rlhf-alignment-training	4	Python
1696	Tanveer81/ReVisionLLM This is the official implementation of ReVisionLLM: Recursive...	35	Emerging	multimodal-vision-language	43	Python
1697	amazon-science/recode Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"	35	Emerging	math-reasoning-datasets	58	Python
1698	OnlyTerp/turboquant First open-source implementation of Google TurboQuant (ICLR 2026) --...	35	Emerging	kv-cache-optimization	36	Python
1699	Gurumurthy30/Stackformer Modular PyTorch transformer library for building, training, and...	35	Emerging	transformer-architecture-tutorials	7	Python
1700	macabdul9/AnyGen A Unified and Minimalist Pipeline for Generating Outputs with LLMs...	35	Emerging	prompt-engineering-security	7	Python

« Prev 1 2 3 … 15 16 17 18 19 … 63 64 65 Next »