All Transformer Models

6,429 models ranked by quality score · Page 12 of 65

Showing 1101–1200 of 6,429

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
1101	Esmail-ibraheem/Axon AI research lab🔬: implementations of AI papers and theoretical research:...	41	Emerging	ml-foundations-curricula	18	Python
1102	ariannamethod/arianna.c Arianna is a Digital Persona. Embodied cognition as is.	41	Emerging	interactive-ai-chat-uis	6	C
1103	Multi-Agent-LLMs/mallm Framework: Multi-Agent LLMs For Conversational Task-Solving (MALLM)	41	Emerging	multi-agent-debate-systems	52	Python
1104	declare-lab/instruct-eval This repository contains code to quantitatively evaluate instruction-tuned...	41	Emerging	instruction-tuning-datasets	552	Python
1105	willxxy/ECG-Bench A Unified Framework for Benchmarking Generative Electrocardiogram-Language...	41	Emerging	llm-domain-datasets	42	Python
1106	bigscience-workshop/xmtf Crosslingual Generalization through Multitask Finetuning	41	Emerging	llm-fine-tuning	537	Jupyter Notebook
1107	cloudguruab/modsysML Human reinforcement learning (RLHF) framework for AI models. Evaluate and...	41	Emerging	llm-evaluation-benchmarking	36	Python
1108	yang-ai-lab/SleepLM SleepLM: Natural-Language Intelligence for Human Sleep	41	Emerging	llm-scaling-architecture	29	Jupyter Notebook
1109	ariannamethod/ariannamethod.ai Arianna Method Programming Language	41	Emerging	ml-foundations-curricula	12	C
1110	mlvlab/Flipped-VQA Large Language Models are Temporal and Causal Reasoners for Video Question...	41	Emerging	multimodal-vision-language	78	Python
1111	XunhaoLai/native-sparse-attention-triton Efficient triton implementation of Native Sparse Attention.	41	Emerging	sparse-attention-optimization	269	Python
1112	HHousen/DocSum A tool to automatically summarize documents abstractively using the BART or...	41	Emerging	text-summarization-transformers	69	Python
1113	ictnlp/LLaVA-Mini LLaVA-Mini is a unified large multimodal model (LMM) that can support the...	41	Emerging	vision-language-instruction-tuning	562	Python
1114	pjlab-sys4nlp/llama-moe ⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual...	41	Emerging	mathematical-reasoning-transformers	1,002	Python
1115	zai-org/GLM-Edge GLM Series Edge Models	41	Emerging	llm-frameworks-libraries	160	Python
1116	liuqidong07/MOELoRA-peft [SIGIR'24] The official implementation code of MOELoRA.	41	Emerging	llm-fine-tuning	189	Python
1117	Beomi/InfiniTransformer Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No...	41	Emerging	transformer-architecture-tutorials	375	Python
1118	punica-ai/punica Serving multiple LoRA finetuned LLM as one	41	Emerging	llm-fine-tuning	1,145	Python
1119	SqueezeAILab/SqueezeLLM [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization	41	Emerging	llm-quantization-methods	713	Python
1120	VITA-MLLM/Freeze-Omni ✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with...	41	Emerging	multimodal-vision-language	369	Python
1121	AdrianBZG/llama-multimodal-vqa Multimodal Instruction Tuning for Llama 3	41	Emerging	vision-language-instruction-tuning	51	Python
1122	fahadshamshad/awesome-transformers-in-medical-imaging A collection of resources on applications of Transformers in Medical Imaging.	41	Emerging	medical-image-diagnosis-transformers	1,286	—
1123	Breeze648/Transformer-from-Scratch 本仓库定位为 AI论文复现 / 从零实现 Transformer。 ...	41	Emerging	transformer-architecture-education	33	Python
1124	HarderThenHarder/transformers_tasks ⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification,...	41	Emerging	model-evaluation-diagnostics	2,412	Jupyter Notebook
1125	monologg/KoCharELECTRA Character-level Korean ELECTRA Model (음절 단위 한국어 ELECTRA)	40	Emerging	korean-language-models	54	Python
1126	kyegomez/SimplifiedTransformers SimplifiedTransformer simplifies transformer block without affecting...	40	Emerging	transformer-architecture-education	15	Python
1127	lin-tan/clm For our ICSE23 paper "Impact of Code Language Models on Automated Program...	40	Emerging	vulnerability-detection-llm	63	Python
1128	NVIDIA/Star-Attention Efficient LLM Inference over Long Sequences	40	Emerging	sparse-attention-optimization	392	Python
1129	thevasudevgupta/bigbird Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers	40	Emerging	korean-language-models	49	Jupyter Notebook
1130	zyds/transformers-code 手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube	40	Emerging	huggingface-learning-resources	3,853	Jupyter Notebook
1131	ziplab/LIT [AAAI 2022] This is the official PyTorch implementation of "Less is More:...	40	Emerging	transformer-architecture-tutorials	97	Python
1132	luuyin/OWL Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity...	40	Emerging	llm-compression-optimization	81	Python
1133	TrustedLLM/LLMDet LLMDet is a text detection tool that can identify which generated sources...	40	Emerging	ai-generated-text-detection	84	Python
1134	JulesBelveze/bert-squeeze 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡	40	Emerging	bert-model-implementations	85	Python
1135	git-cloner/llama-lora-fine-tuning llama fine-tuning with lora	40	Emerging	lora-qlora-fine-tuning	140	Python
1136	Hsu1023/DuQuant [NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation...	40	Emerging	llm-quantization-techniques	180	Python
1137	amitkedia007/Financial-Fraud-Detection-Using-LLMs The aim of this dissertation is to assess the effectiveness of LLMs such as ...	40	Emerging	ai-stock-analysis	86	Jupyter Notebook
1138	ECNU-ICALK/EduChat An open-source educational chat model from ICALK, East China Normal...	40	Emerging	multilingual-llm-adaptation	913	Jupyter Notebook
1139	HHousen/speaker-change-detection Speaker change detection using SincNet and an LSTM/Transformer	40	Emerging	audio-classification-transformers	57	Jupyter Notebook
1140	boheumd/MA-LMM (2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term...	40	Emerging	multimodal-vision-language	347	Python
1141	molbal/llm-text-completion-finetune Guide on text completion large language model fine-tuning, including example...	40	Emerging	llm-fine-tuning	87	Python
1142	PediaMedAI/AggPose [IJCAI 2022] Official PyTorch implementation of AggPose: Deep Aggregation...	40	Emerging	3d-vision-transformers	30	Python
1143	RedHatResearch/conext24-NetConfEval Benchmark for evaluating LLMs in network configuration problems.	40	Emerging	domain-specific-benchmarks	34	Python
1144	ChristophReich1996/Swin-Transformer-V2 PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up...	40	Emerging	vision-transformer-optimization	205	Python
1145	architkaila/Fine-Tuning-LLMs-for-Medical-Entity-Extraction Exploring the potential of fine-tuning Large Language Models (LLMs) like...	40	Emerging	llm-fine-tuning	89	Python
1146	rishikksh20/CrossViT-pytorch Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer...	40	Emerging	vit-image-classification	208	Python
1147	ukairia777/pytorch-nlp-tutorial pytorch를 사용하여 텍스트 전처리부터 RAG, 에이전트, LLM 파인튜닝을 정리한 Deep Learning NLP 저장소입니다.	40	Emerging	llm-learning-resources	89	Jupyter Notebook
1148	jha-lab/acceltran [TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers	40	Emerging	power-transformer-design	58	Python
1149	rasbt/blog-finetuning-llama-adapters Supplementary material for "Understanding Parameter-Efficient Finetuning of...	40	Emerging	llm-fine-tuning	48	Jupyter Notebook
1150	johnmai-dev/NotebookMLX 📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)	40	Emerging	llm-training-experimentation	339	Jupyter Notebook
1151	xNul/code-llama-for-vscode Use Code Llama with Visual Studio Code and the Continue extension. A local...	40	Emerging	code-completion-copilots	569	Python
1152	OpenSparseLLMs/LLaMA-MoE-v2 🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of...	40	Emerging	llm-implementation-from-scratch	93	Python
1153	0x7o/RETRO-transformer Easy-to-use Retrieval-Enhanced Transformer implementation	40	Emerging	transformer-architecture-tutorials	10	Python
1154	cdli-gh/Semi-Supervised-NMT-for-Sumerian-English Exploring the Limits of Low-Resource Neural Machine Translation	40	Emerging	neural-machine-translation	34	Jupyter Notebook
1155	kingabzpro/using-llama3-locally Running llama3 using Ollama-Python, Curl, LangChain, Chroma, and User interface.	40	Emerging	generative-ai-learning	59	Jupyter Notebook
1156	infocusp/llm_seminar_series Material for the series of seminars on Large Language Models	40	Emerging	llm-learning-resources	34	Jupyter Notebook
1157	zackshen/gguf a GGUF file parser	40	Emerging	llm-quantization-methods	17	Rust
1158	cooelf/AwesomeMRC IJCAI 2021 Tutorial & code for Retrospective Reader for Machine Reading...	40	Emerging	question-answering-systems	362	Python
1159	rednote-hilab/dots.llm1 The official repository of the dots.llm1 base and instruct models proposed...	40	Emerging	llm-learning-resources	490	—
1160	google-deepmind/gemma_penzai A JAX Research Toolkit for Visualizing, Manipulating, and Understanding...	40	Emerging	lora-qlora-fine-tuning	90	Jupyter Notebook
1161	saqib1707/gpt2-from-scratch PyTorch Implementation of GPT-2	40	Emerging	gpt2-pretraining-fine-tuning	31	Python
1162	huggingface/datablations Scaling Data-Constrained Language Models	40	Emerging	llm-scaling-architecture	342	Jupyter Notebook
1163	VITA-Group/Q-GaLore Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank...	40	Emerging	llm-quantization-techniques	203	Python
1164	vicgalle/zero-shot-reward-models ZYN: Zero-Shot Reward Models with Yes-No Questions	40	Emerging	llm-recommendation-systems	35	Python
1165	trrahul/llama2.cs Inference Llama 2 in one file of pure C#	40	Emerging	local-llm-deployment	102	C#
1166	souzatharsis/tamingLLMs Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software	40	Emerging	llm-training-experimentation	340	Jupyter Notebook
1167	hkust-nlp/deita Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]	40	Emerging	instruction-tuning-datasets	591	Python
1168	aniketmaurya/llm-inference Large Language Model (LLM) Inference API and Chatbot	40	Emerging	llm-inference-engines	127	Python
1169	ai4co/parco [NeurIPS 2025] PARCO: Parallel AutoRegressive Combinatorial Optimization	40	Emerging	mathematical-reasoning-transformers	44	Python
1170	Traffic-Alpha/iLLM-TSC This repository contains the code for the paper“iLLM-TSC: Integration...	40	Emerging	competitive-agent-games	70	Python
1171	HqWu-HITCS/Awesome-Chinese-LLM 整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。	40	Emerging	multilingual-llm-adaptation	22,371	—
1172	teticio/llama-squad Train Llama 2 & 3 on the SQuAD v2 task as an example of how to specialize a...	40	Emerging	llm-chatbot-interfaces	53	Python
1173	Fsoft-AIC/Grasp-Anything Dataset and Code for ICRA 2024 paper "Grasp-Anything: Large-scale Grasp...	40	Emerging	multimodal-vision-language	219	Python
1174	Omid-Nejati/BEFUnet A Hybrid CNN-Transformer Architecture for Precise Medical Image Segmentation	40	Emerging	medical-image-segmentation-transformers	73	Python
1175	SamsungSAILMontreal/ghn3 Code for "Can We Scale Transformers to Predict Parameters of Diverse...	40	Emerging	graph-transformers	39	Shell
1176	Bindwell/PLAPT Codebase and CLI for PLAPT: A state-of-the-art protein-ligand binding...	40	Emerging	protein-transformers-ml	114	Mathematica
1177	OpenBMB/InfiniteBench Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K...	40	Emerging	domain-specific-benchmarks	378	Python
1178	aJupyter/ThinkLLM ThinkLLM：🚀 轻量、高效的大语言模型算法实现	40	Emerging	llm-frameworks-libraries	114	Jupyter Notebook
1179	l294265421/alpaca-rlhf Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback)...	40	Emerging	rlhf-alignment-training	117	Python
1180	IAAR-Shanghai/Grimoire Grimoire is All You Need for Enhancing Large Language Models	40	Emerging	multilingual-llm-adaptation	117	Python
1181	prismformore/Multi-Task-Transformer Code of ICLR2023 paper "TaskPrompter: Spatial-Channel Multi-Task Prompting...	40	Emerging	vision-transformer-optimization	327	Python
1182	bigcode-project/selfcodealign [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation	40	Emerging	llm-knowledge-editing	323	Python
1183	vmicheli/delta-iris Efficient World Models with Context-Aware Tokenization. ICML 2024	40	Emerging	mathematical-reasoning-transformers	119	Python
1184	harishdeivanayagam/rowfill Open-source spreadsheets platform for deep research and document processing	40	Emerging	interactive-ai-chat-uis	368	TypeScript
1185	takashiishida/paper2slides Transform any arXiv papers into slides using LLMs	40	Emerging	generative-ai-platforms	75	Python
1186	hongyehu/Machine_Learning_Quantum_State_Tomography An unofficial pytorch implementation of using generative models to do...	40	Emerging	machine-translation-transformers	37	Python
1187	DmitryNekrasov/ai-code-completion-idea-plugin Implementation of IntelliJ IDEA code completion plugin using a local LLM.	40	Emerging	code-completion-copilots	18	Kotlin
1188	cgbur/llama2.zig Inference Llama 2 in one file of pure Zig	40	Emerging	local-llm-deployment	211	Zig
1189	asigalov61/Allegro-Music-Transformer Full-attention multi-instrumental music transformer featuring asymmetrical...	40	Emerging	ai-music-generation	48	Python
1190	alexrozanski/LlamaChat Chat with your favourite LLaMA models in a native macOS app	40	Emerging	interactive-ai-chat-uis	1,514	Swift
1191	yifanzhang-pro/HLA Official Project Page for HLA: Higher-order Linear Attention...	40	Emerging	llm-knowledge-distillation	45	HTML
1192	samestrin/llm-pdf-ocr-api A Python-based REST API for PDF OCR using AI models with PyTorch and...	40	Emerging	ocr-document-extraction	34	Python
1193	hans00/react-native-transformers-example Example of transformers.js on React Native	40	Emerging	browser-based-ml-inference	75	TypeScript
1194	elapse-annals/laravel-plus Based on Laravel transformation and expansion, more convenient for practical...	40	Emerging	php-ai-sdks	50	PHP
1195	Sachithx/EntroPE This includes the codebase for EntroPE (Entropy-Guided Dynamic Patch Encoder...	40	Emerging	time-series-forecasting-transformers	41	Python
1196	tlkh/t2t-tuner Convenient Text-to-Text Training for Transformers	40	Emerging	creative-text-generation	19	Jupyter Notebook
1197	Traffic-Alpha/LLM-Assisted-Light This repository contains the code for the paper "LLM-Assisted Light:...	40	Emerging	multimodal-vision-language-models	99	Python
1198	clabrugere/scratch-llm Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch,...	40	Emerging	llm-implementation-from-scratch	38	Python
1199	amithkoujalgi/ollama-pdf-bot A bot that accepts PDF docs and lets you ask questions on it.	40	Emerging	streamlit-llm-interfaces	212	Python
1200	WENGSYX/LMTuner LMTuner: Make the LLM Better for Everyone	40	Emerging	llm-finetuning-frameworks	38	Python

« Prev 1 2 3 … 10 11 12 13 14 … 63 64 65 Next »