All Transformer Models

6,429 models ranked by quality score · Page 6 of 65

Showing 501–600 of 6,429

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
501	inclusionAI/asystem-awex A high-performance RL training-inference weight synchronization framework,...	48	Emerging	llm-inference-engines	138	Python
502	olivkoch/nano-trm An implementation of Tiny Recursive Models (TRM)	48	Emerging	transformer-training-optimization	101	Python
503	NVIDIA-AI-IOT/nanoowl A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT.	48	Emerging	transformer-training-optimization	409	Python
504	jianghoucheng/AlphaEdit AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models,...	48	Emerging	llm-knowledge-editing	423	Python
505	AlekseyKorshuk/optimum-transformers Accelerated NLP pipelines for fast inference on CPU and GPU. Built with...	48	Emerging	transformer-training-optimization	126	Python
506	ALucek/ppt2desc Convert PowerPoint files into semantically rich text using vision language models	48	Emerging	ai-presentation-generation	113	Python
507	SakanaAI/doc-to-lora Hypernetworks that update LLMs to remember factual information	48	Emerging	agent-memory-infrastructure	545	Python
508	rojagtap/transformer-abstractive-summarization Abstractive Text Summarization using Transformer	48	Emerging	text-summarization-transformers	168	Jupyter Notebook
509	X-D-Lab/LangChain-ChatGLM-Webui 基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答	48	Emerging	multilingual-llm-adaptation	3,307	Python
510	VHellendoorn/Code-LMs Guide to using pre-trained large language models of source code	48	Emerging	llm-finetuning-frameworks	1,842	Python
511	Beomi/KoAlpaca KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델 (KoAlpaca: An open-source language model...	48	Emerging	multilingual-llm-adaptation	1,578	Jupyter Notebook
512	zyushun/Adam-mini Code for Adam-mini: Use Fewer Learning Rates To Gain More...	48	Emerging	llm-compression-optimization	453	Python
513	socialfoundations/folktexts Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on...	48	Emerging	llm-training-experimentation	25	Jupyter Notebook
514	worldbank/REaLTabFormer A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer...	48	Emerging	creative-text-generation	244	Jupyter Notebook
515	jmont-dev/ollama-hpp Modern, Header-only C++ bindings for the Ollama API.	48	Emerging	local-llm-deployment	213	C++
516	livingbio/fuzzy-json Fuzzy-JSON is a compact Python package with no dependencies, designed to...	48	Emerging	llm-quantization-methods	43	Python
517	ymcui/Chinese-LLaMA-Alpaca 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)	48	Emerging	multilingual-llm-adaptation	18,970	Python
518	FoundationVision/Infinity [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for...	48	Emerging	text-to-image-generation	1,553	Python
519	Thinklab-SJTU/Crossformer Official implementation of our ICLR 2023 paper "Crossformer: Transformer...	48	Emerging	time-series-forecasting-transformers	669	Python
520	datawhalechina/llms-from-scratch-cn 仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理	48	Emerging	llm-implementation-from-scratch	4,010	Jupyter Notebook
521	malteos/llm-datasets A collection of datasets for language model pretraining including scripts...	48	Emerging	llm-domain-datasets	64	Python
522	kyegomez/attn_res A clean, single-file PyTorch implementation of Attention Residuals (Kimi...	48	Emerging	transformer-architecture-tutorials	8	Python
523	fboulnois/llama-cpp-docker Run llama.cpp in a GPU accelerated Docker container	48	Emerging	local-llm-deployment	63	Dockerfile
524	graphdeeplearning/graphtransformer Graph Transformer Architecture. Source code for "A Generalization of...	48	Emerging	graph-transformers	1,019	Python
525	kyegomez/HLT Implementation of the transformer from the paper: "Real-World Humanoid...	48	Emerging	transformer-architecture-tutorials	62	Python
526	AndrewZhe/lawyer-llama 中文法律LLaMA (LLaMA for Chinese legel domain)	48	Emerging	multilingual-llm-adaptation	984	Python
527	slwang-ustc/nano-vllm-v1 Nano vLLM with vLLM v1's request scheduling strategy and chunked prefill	48	Emerging	llm-inference-engines	61	Python
528	curiousily/Deploy-BERT-for-Sentiment-Analysis-with-FastAPI Deploy BERT for Sentiment Analysis as REST API using FastAPI, Transformers...	48	Emerging	review-sentiment-classification	209	Python
529	locuslab/wanda A simple and effective LLM pruning approach.	47	Emerging	llm-compression-optimization	854	Python
530	cztomsik/ava All-in-one desktop app for running LLMs locally.	47	Emerging	llm-terminal-automation	465	TypeScript
531	DaoD/INTERS This is the repository for our paper "INTERS: Unlocking the Power of Large...	47	Emerging	instruction-tuning-datasets	207	Python
532	lorenzorovida/FHE-BERT-Tiny Source code for the paper "Transformer-based Language Models and Homomorphic...	47	Emerging	gpt-model-fine-tuning	32	Jupyter Notebook
533	ictnlp/LLaMA-Omni LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction...	47	Emerging	multimodal-vision-language	3,128	Python
534	geeks-of-data/knowledge-gpt Extract knowledge from all information sources using gpt and other language...	47	Emerging	llm-implementation-from-scratch	291	Python
535	x-tabdeveloping/turftopic Robust and fast topic models with sentence-transformers.	47	Emerging	text-clustering-topic-modeling	94	Python
536	xusenlinzy/api-for-open-llm Openai style api for open large language models, using LLMs just as chatgpt!...	47	Emerging	multilingual-llm-adaptation	2,468	Python
537	back2matching/turboquant First open-source TurboQuant KV cache compression for LLM inference. Drop-in...	47	Emerging	llm-quantization-methods	5	Python
538	soulteary/docker-llama2-chat Play LLaMA2 (official / 中文版 / INT4 / llama2.cpp) Together! ONLY 3 STEPS! (...	47	Emerging	local-llm-deployment	538	Python
539	dali92002/DocEnTR DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022	47	Emerging	3d-vision-transformers	186	Jupyter Notebook
540	deepseek-ai/Janus Janus-Series: Unified Multimodal Understanding and Generation Models	47	Emerging	multimodal-vision-language	17,708	Python
541	HHousen/TransformerSum Models to perform neural summarization (extractive and abstractive) using...	47	Emerging	text-summarization-tools	439	Python
542	NVlabs/Eagle Eagle: Frontier Vision-Language Models with Data-Centric Strategies	47	Emerging	vision-language-instruction-tuning	931	Python
543	haoliuhl/ringattention Large Context Attention	47	Emerging	transformer-architecture-tutorials	770	Python
544	hiyouga/ChatGLM-Efficient-Tuning Fine-tuning ChatGLM-6B with PEFT \| 基于 PEFT 的高效 ChatGLM 微调	47	Emerging	rlhf-alignment-training	3,732	Python
545	The-FinAI/PIXIU This repository introduces PIXIU, an open-source resource featuring the...	47	Emerging	multilingual-llm-adaptation	835	Jupyter Notebook
546	mim-solutions/bert_for_longer_texts BERT classification model for processing texts longer than 512 tokens. Text...	47	Emerging	text-classification-transformers	146	Python
547	Cardinal-Operations/ORLM ORLM: Training Large Language Models for Optimization Modeling	47	Emerging	llm-scaling-architecture	237	Python
548	dusty-nv/NanoLLM Optimized local inference for LLMs with HuggingFace-like APIs for...	47	Emerging	nlp-fundamentals-tutorials	359	Python
549	The-Swarm-Corporation/MedGuard MedGuard is a robust, production-grade Python library that ensures HIPAA...	47	Emerging	therapeutic-chatbot-applications	15	Python
550	cedrickchee/awesome-transformer-nlp A curated list of NLP resources focused on Transformer networks, attention...	47	Emerging	transformer-architecture-tutorials	1,131	—
551	kayoyin/transformer-slt Sign Language Translation with Transformers (COLING'2020, ECCV'20 SLRTP Workshop)	47	Emerging	sign-language-recognition	160	ASL
552	sagorbrur/bangla-bert Bangla-Bert is a pretrained bert model for Bengali language	47	Emerging	bert-model-implementations	83	Jupyter Notebook
553	kyegomez/Lets-Verify-Step-by-Step "Improving Mathematical Reasoning with Process Supervision" by OPENAI	47	Emerging	gpt2-pretraining-fine-tuning	114	Python
554	ycq091044/BIOT BIOT - A framework for pretraining biosignals at scale. Large EEG pre-trained models.	47	Emerging	academic-thesis-repositories	182	Python
555	ymcui/Chinese-LLaMA-Alpaca-2 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs...	47	Emerging	multilingual-llm-adaptation	7,163	Python
556	xianglin226/Benchmarking-Single-Cell-Perturbation Single-Cell (Perturbation) Model Library	47	Emerging	protein-design-llms	93	Python
557	prrao87/tweet-stance-prediction Applying NLP transfer learning techniques to predict Tweet stance toward a topic	47	Emerging	disaster-tweet-classification	107	Jupyter Notebook
558	awslabs/mlm-scoring Python library & examples for Masked Language Model Scoring (ACL 2020)	47	Emerging	end-to-end-asr-frameworks	348	Python
559	Uminosachi/open-llm-webui This repository contains a web application designed to execute relatively...	47	Emerging	interactive-ai-chat-uis	47	Python
560	conceptofmind/LaMDA-rlhf-pytorch Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding...	47	Emerging	rlhf-alignment-training	470	Python
561	Event-AHU/Medical_Image_Analysis Foundation models based medical image analysis	47	Emerging	clinical-llm-tools	213	Python
562	chuanyangjin/MMToM-QA [🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind...	47	Emerging	vision-language-models	154	Python
563	obss/trapper State-of-the-art NLP through transformer models in a modular design and...	47	Emerging	transformer-frameworks-wrappers	47	Python
564	leaderj1001/BottleneckTransformers Bottleneck Transformers for Visual Recognition	47	Emerging	vision-transformer-implementations	279	Python
565	Zefan-Cai/KVCache-Factory Unified KV Cache Compression Methods for Auto-Regressive Models	47	Emerging	kv-cache-optimization	1,309	Python
566	noahho/CAAFE Semi-automatic feature engineering process using Language Models and your...	47	Emerging	feature-selection-frameworks	182	Python
567	dorarad/gansformer Generative Adversarial Transformers	47	Emerging	multimodal-fusion-transformers	1,346	Python
568	raymin0223/mixture_of_recursions Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive...	47	Emerging	mixture-of-experts-llms	548	Python
569	alpa-projects/alpa Training and serving large-scale neural networks with auto parallelization.	47	Emerging	llm-cuda-optimization	3,188	Python
570	jackaduma/Recurrent-LLM The open-source LLM implementation of paper: RecurrentGPT: Interactive...	47	Emerging	multilingual-llm-adaptation	203	Python
571	predibase/lorax Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs	47	Emerging	lora-qlora-fine-tuning	3,735	Python
572	haotian-liu/LLaVA [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V...	47	Emerging	vision-language-instruction-tuning	24,554	Python
573	horseee/LLM-Pruner [NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language...	47	Emerging	llm-pruning-compression	1,109	Python
574	jla524/fromthetensor From the Tensor to Stable Diffusion, a rough outline for a 10 week course.	47	Emerging	ml-foundations-curricula	1,076	—
575	jeya-maria-jose/TransWeather Pytorch Code for the paper TransWeather - CVPR 2022	47	Emerging	time-series-forecasting-transformers	220	Python
576	jobergum/browser-ml-inference Edge Inference in Browser with Transformer NLP model	47	Emerging	browser-based-ml-inference	316	Jupyter Notebook
577	vectorch-ai/ScaleLLM A high-performance inference system for large language models, designed for...	47	Emerging	llm-inference-engines	491	C++
578	hybridgroup/yzma Go with your own intelligence - Go applications that directly integrate...	47	Emerging	local-llm-deployment	350	Go
579	ARM-software/keyword-transformer Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769	47	Emerging	transformer-architecture-education	138	Jupyter Notebook
580	ssbuild/chatglm_finetuning chatglm 6b finetuning and alpaca finetuning	47	Emerging	llm-finetuning-frameworks	1,537	Python
581	jiwidi/Behavior-Sequence-Transformer-Pytorch This is a pytorch implementation for the BST model from Alibaba...	47	Emerging	transformer-architecture-tutorials	176	Jupyter Notebook
582	EleutherAI/gpt-neo An implementation of model parallel GPT-2 and GPT-3-style models using the...	47	Emerging	gpt2-pretraining-fine-tuning	8,286	Python
583	VinAIResearch/PhoBERT PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)	47	Emerging	korean-language-models	775	—
584	The-AI-Summer/self-attention-cv Implementation of various self-attention mechanisms focused on computer...	47	Emerging	transformer-architecture-tutorials	1,215	Python
585	vinjn/llm-metahuman An open solution for AI-powered photorealistic digital humans.	47	Emerging	llm-terminal-automation	138	Python
586	monologg/KoBERT-Transformers KoBERT on 🤗 Huggingface Transformers 🤗 (with Bug Fixed)	47	Emerging	korean-language-models	212	Python
587	codewithdark-git/Building-LLMs-from-scratch This repository guides you through the process of building a GPT-style Large...	47	Emerging	llm-implementation-from-scratch	51	Jupyter Notebook
588	MiniMax-AI/MiniMax-M1 MiniMax-M1, the world's first open-weight, large-scale hybrid-attention...	46	Emerging	llm-frameworks-libraries	3,115	Python
589	NVlabs/RLP [ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a...	46	Emerging	rlhf-alignment-training	241	—
590	ariannamethod/molequla molequla.ai. live ecology of GPT organisms	46	Emerging	prompt-engineering-security	50	C
591	DUTIR-BioNLP/Taiyi-LLM Taiyi 2, Biomedical LLM, A Bilingual (Chinese and English) Fine-Tuned Large...	46	Emerging	llm-frameworks-libraries	163	Python
592	livepeer/ai-runner Inference runtime for running different batch and real-time AI pipelines.	46	Emerging	llm-inference-engines	25	Python
593	MegEngine/InferLLM a lightweight LLM model inference framework	46	Emerging	llm-inference-engines	747	C++
594	vfeofanov/mantis Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time...	46	Emerging	time-series-forecasting-transformers	105	Jupyter Notebook
595	hila-chefer/Transformer-MM-Explainability [ICCV 2021- Oral] Official PyTorch implementation for Generic...	46	Emerging	transformer-interpretability-mechanistic	903	Jupyter Notebook
596	IDEA-CCNL/Fengshenbang-LM Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系，成为中文AIGC和认知智能的基础设施。	46	Emerging	multilingual-llm-adaptation	4,149	Python
597	zjunlp/KnowLM An Open-sourced Knowledgable Large Language Model Framework.	46	Emerging	llm-benchmark-leaderboards	1,376	Python
598	georgian-io/LLM-Finetuning-Toolkit Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.	46	Emerging	llm-fine-tuning	870	Python
599	microsoft/GODEL Large-scale pretrained models for goal-directed dialog	46	Emerging	conversational-chatbot-applications	889	Python
600	datawhalechina/base-llm 从 NLP 到 LLM 的算法全栈教程，在线阅读地址：https://datawhalechina.github.io/base-llm/	46	Emerging	llm-training-experimentation	421	Jupyter Notebook

« Prev 1 2 3 4 5 6 7 8 … 63 64 65 Next »