All Transformer Models

6,429 models ranked by quality score · Page 16 of 65

Showing 1501–1600 of 6,429

« Prev Next »

#	Model	Score	Tier	Category	Stars	Language
1501	AlexanderVNikitin/kernel-language-entropy Code for Fine-grained Uncertainty Quantification for LLMs from Semantic...	37	Emerging	llm-reasoning-research	36	Python
1502	rbitr/llm.f90 LLM inference in Fortran	37	Emerging	llm-inference-engines	64	Fortran
1503	zjohn77/lightning-mlflow-hf Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow	37	Emerging	llm-fine-tuning	65	Python
1504	xiangking/prompt_uie_torch 基于PaddleNLP开源的抽取式UIE进行医学命名实体识别（torch实现）	37	Emerging	transformer-frameworks-wrappers	44	Python
1505	ksm26/Finetuning-Large-Language-Models Unlock the potential of finetuning Large Language Models (LLMs). Learn from...	37	Emerging	llm-fine-tuning	68	Jupyter Notebook
1506	HomebrewML/HomebrewNLP-torch A case study of efficient training of large language models using commodity hardware.	37	Emerging	gpt-multilingual-training	68	Python
1507	litus-ai/classy classy is a simple-to-use library for building high-performance Machine...	37	Emerging	text-classification-transformers	87	Python
1508	mantasu/cs224n Solutions for CS224n (2022)	37	Emerging	nlp-learning-coursework	72	Python
1509	lliai/D2MoE D^2-MoE: Delta Decompression for MoE-based LLMs Compression	37	Emerging	mixture-of-experts-llms	74	Python
1510	alexeykarnachev/full_stack_transformer Pytorch library for end-to-end transformer models training, inference and serving	37	Emerging	transformer-architecture-tutorials	70	Python
1511	openshieldai/openshield OpenShield is a new generation security layer for AI models	37	Emerging	local-llm-deployment	84	Go
1512	mohyunho/NAS_transformer Evolutionary Neural Architecture Search on Transformers for RUL Prediction	37	Emerging	transformer-architecture-tutorials	50	Python
1513	GithubX-F/DynaMO-RL Dynamic Rollout Allocation and Advantage Modulation for Policy Optimization...	37	Emerging	rlhf-alignment-training	86	Python
1514	GT-RIPL/robo-vln Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics...	37	Emerging	multimodal-fusion-transformers	88	Python
1515	taishi-i/nagisa_bert A BERT model for nagisa	37	Emerging	bert-model-implementations	5	Jupyter Notebook
1516	poteminr/instruct-ner Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models...	37	Emerging	lora-qlora-fine-tuning	89	Python
1517	canjiali/PARADE code and data to faciliate BERT/ELECTRA for document ranking. Details refer...	37	Emerging	semantic-search-retrieval	96	Python
1518	user1342/Tomato LLM steganography with minimum-entropy coupling - Hiding encrypted messages...	37	Emerging	llm-frameworks-libraries	94	Python
1519	all-things-vits/code-samples Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and...	37	Emerging	vit-image-classification	197	Jupyter Notebook
1520	RLado/STB-VMM STB-VMM: Swin Transformer Based Video Motion Magnification (official repository)	37	Emerging	vision-transformer-implementations	50	Python
1521	jhcho99/CoFormer [CVPR'22] Official PyTorch Implementation of "Collaborative Transformers for...	37	Emerging	3d-vision-transformers	50	Python
1522	mu-cai/matryoshka-mm Matryoshka Multimodal Models	37	Emerging	vision-language-instruction-tuning	122	Python
1523	rkansal47/MPGAN The message passing GAN https://arxiv.org/abs/2106.11535 and generative...	37	Emerging	multimodal-fusion-transformers	13	Python
1524	Shanghai-Digital-Brain-Laboratory/BDM-DB1 A large-scale multi-modal pre-trained model	37	Emerging	multimodal-fusion-transformers	134	Python
1525	princeton-nlp/LLMBar [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following	37	Emerging	evaluation-frameworks-metrics	137	Python
1526	zd11024/NaviLLM [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for...	37	Emerging	multimodal-vision-language	229	Python
1527	microsoft/AdaMix This is the implementation of the paper AdaMix: Mixture-of-Adaptations for...	37	Emerging	knowledge-distillation-compression	138	Python
1528	eqimp/hogwild_llm Official PyTorch implementation for Hogwild! Inference: Parallel LLM...	37	Emerging	llm-cuda-optimization	140	Python
1529	zjunlp/Mol-Instructions [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset...	37	Emerging	rlhf-alignment-training	294	Python
1530	CTCycle/ADSMOD-Adsorption-Modeling Streamline adsorption modeling by automatically fitting theoretical...	37	Emerging	molecular-generation-transformers	3	Python
1531	bodeby/torchstack 🫧 probability-level model ensembling for transformers	37	Emerging	transformer-architecture-tutorials	3	Python
1532	DebarshiChanda/Amazon-ML-Challenge2021 Scripts and Approach for Amazon ML Challenge	37	Emerging	semantic-textual-similarity	91	Jupyter Notebook
1533	desaixie/zeroverse Official code for NeurIPS 2024 paper LRM-Zero: Training Large Reconstruction...	37	Emerging	3d-vision-transformers	153	Python
1534	HKUNLP/icl-ceil [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.	37	Emerging	rlhf-alignment-training	103	Python
1535	K-H-Ismail/torchortho [ICLR 2026] Polynomial, trigonometric, and tropical activations	37	Emerging	transformer-architecture-tutorials	16	Jupyter Notebook
1536	joslefaure/HERMES [ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes...	37	Emerging	multimodal-vision-language	38	Python
1537	ImKeTT/AdaVAE [Preprint] AdaVAE: Exploring Adaptive GPT-2s in VAEs for Language Modeling...	37	Emerging	variational-autoencoders-nlp	37	Python
1538	babycommando/machinascript-for-robots Build LLM-powered robots in your garage with MachinaScript For Robots!	37	Emerging	llm-terminal-automation	195	Python
1539	locuslab/massive-activations Code accompanying the paper "Massive Activations in Large Language Models"	37	Emerging	llm-scaling-architecture	197	Python
1540	eduard23144/locoformer 🤖 Explore LocoFormer, a Transformer-XL model that enhances robot locomotion...	37	Emerging	transformer-architecture-tutorials	4	Python
1541	ariya/gamal Research tool leveraging LLM for answers	37	Emerging	llm-terminal-automation	58	JavaScript
1542	lukechilds/humanscript A truly natural scripting language	37	Emerging	llm-terminal-automation	236	Shell
1543	SALT-NLP/LLaVAR Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for...	37	Emerging	multimodal-vision-language	269	Python
1544	promptslab/LLMtuner FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)	37	Emerging	llm-fine-tuning	247	Python
1545	horus-ai-labs/DistillFlow Library for model distillation	37	Emerging	llm-knowledge-distillation	165	Python
1546	juyongjiang/CodeUp CodeUp: A Multilingual Code Generation Llama-X Model with...	37	Emerging	code-model-training	127	Python
1547	extreme-bert/extreme-bert ExtremeBERT is a toolkit that accelerates the pretraining of customized...	37	Emerging	bert-model-frameworks	268	Python
1548	sandesha21/Stock-Market-News-Sentiment-Analysis-and-Summarization NLP pipeline for classifying sentiment in financial news and generating...	37	Emerging	financial-sentiment-analysis	3	Jupyter Notebook
1549	OSU-NLP-Group/AmpleGCG AmpleGCG: Learning a Universal and Transferable Generator of Adversarial...	37	Emerging	adversarial-nlp-robustness	85	Python
1550	yangjianxin1/Firefly Firefly:...	37	Emerging	multilingual-llm-adaptation	6,644	Python
1551	viddexa/moderators One package to moderate them all	37	Emerging	hate-speech-detection	5	Python
1552	FuxiaoLiu/LRV-Instruction [ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust...	37	Emerging	llm-evaluation-frameworks	297	Python
1553	volverjs/ai Hugging Face Transformers.js wrapper for on-device AI with web-workers	37	Emerging	browser-based-ml-inference	8	TypeScript
1554	iiis-ai/cumulative-reasoning [TMLR] Cumulative Reasoning With Large Language Models...	37	Emerging	llm-reasoning-research	308	Python
1555	CVI-SZU/Linly Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集	37	Emerging	multilingual-llm-adaptation	3,056	Python
1556	iMoonLab/LLM4Hypergraph The source code of ICLR 2025 "Beyond Graphs: Can Large Language Models...	37	Emerging	graph-language-models	38	Python
1557	tommyip/mamba2-minimal Minimal Mamba-2 implementation in PyTorch	37	Emerging	diffusion-language-models	243	Python
1558	ziegler-ingo/cleavage_benchmark [BIBM 2025] Code and resources for the paper "Enhancing Multi-Epitope...	37	Emerging	protein-transformers-ml	6	Python
1559	hyintell/awesome-refreshing-llms EMNLP'23 survey: a curation of awesome papers and resources on refreshing...	37	Emerging	llm-learning-resources	136	—
1560	SimeonHristov99/DL_25-26 Practice sessions for the course "Introduction to deep learning" in the...	37	Emerging	ml-foundations-curricula	4	Jupyter Notebook
1561	huggingface/large_language_model_training_playbook An open collection of implementation tips, tricks and resources for training...	37	Emerging	llm-frameworks-libraries	497	Python
1562	GyanPrakashkushwaha/DataScience EVERYTHING YOU NEED FOR DATA SCIENCE.	37	Emerging	ml-foundations-curricula	6	Jupyter Notebook
1563	softengg-manoj/dreamer4 🌟 Implement Dreamer 4 for training agents within scalable world models,...	37	Emerging	mathematical-reasoning-transformers	4	Python
1564	NVlabs/RocketKV [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage...	36	Emerging	llm-quantization-methods	34	Python
1565	ziplab/HVT [ICCV 2021] Official implementation of "Scalable Vision Transformers with...	36	Emerging	vision-transformer-implementations	33	Python
1566	oValach/RailSafeNet Repository of the paper: RailSafeNet: Visual Scene Understanding for Tram Safety	36	Emerging	medical-image-segmentation-transformers	6	Python
1567	FSoft-AI4Code/CodeCapybara Open-source Self-Instruction Tuning Code LLM	36	Emerging	multilingual-llm-adaptation	172	Python
1568	jaketae/alibi PyTorch implementation of Train Short, Test Long: Attention with Linear...	36	Emerging	transformer-architecture-tutorials	33	Python
1569	AaronFeng753/Ollama-Model-Dumper Export and Backup Ollama models into GGUF and ModelFile	36	Emerging	llm-quantization-methods	92	Python
1570	asigalov61/Perceiver-Music-Transformer SOTA Google's Perceiver-AR Music Transformer Implementation and Model	36	Emerging	ai-music-generation	104	Python
1571	Alsace08/Chain-of-Embedding [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding...	36	Emerging	llm-reasoning-research	95	Python
1572	kaistAI/Janus [NeurIPS 2024] Train LLMs with diverse system messages reflecting...	36	Emerging	rlhf-alignment-training	53	Python
1573	hao-ai-lab/DistCA Efficient Long-context Language Model Training by Core Attention Disaggregation	36	Emerging	diffusion-language-models	93	Python
1574	kyegomez/DifferentialTransformer An open source community implementation of the model from "DIFFERENTIAL...	36	Emerging	power-transformer-design	39	Python
1575	DFKI-NLP/thermostat Collection of NLP model explanations and accompanying analysis tools	36	Emerging	transformer-interpretability-mechanistic	144	Jsonnet
1576	pleisto/yuren-baichuan-7b 基于baichuan-7b的开源多模态大语言模型	36	Emerging	multilingual-llm-adaptation	72	Python
1577	warner-benjamin/commented-transformers Highly commented implementations of Transformers in PyTorch	36	Emerging	transformer-architecture-tutorials	138	Python
1578	sinanuozdemir/oreilly-bert-nlp This repository contains code for the O'Reilly Live Online Training for BERT	36	Emerging	model-evaluation-diagnostics	32	Jupyter Notebook
1579	lifeadventurer/sentify Leveraging Sentiment Analysis on News for Stock Market Insights	36	Emerging	financial-sentiment-analysis	6	Python
1580	AIFrameResearch/SPO Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL...	36	Emerging	rlhf-alignment-training	45	Python
1581	general-preference/general-preference-model [ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for...	36	Emerging	direct-preference-optimization	39	Python
1582	harryjdavies/HeartGPT Interpretable Pre-Trained Transformers for Heart Time-Series Data	36	Emerging	academic-thesis-repositories	50	Python
1583	qingsongedu/Awesome-TimeSeries-SpatioTemporal-LM-LLM A professional list on Large (Language) Models and Foundation Models (LLM,...	36	Emerging	multimodal-vision-language-models	1,203	—
1584	weiserlab/TinyLLM Bringing Language Models to the Most Resource Constrained Devices	36	Emerging	llm-frameworks-libraries	50	Python
1585	styfeng/DataAug4NLP Collection of papers and resources for data augmentation for NLP.	36	Emerging	essay-scoring-grading	831	—
1586	zhilizju/Awesome-instruction-tuning A curated list of awesome instruction tuning datasets, models, papers and...	36	Emerging	instruction-tuning-datasets	347	Python
1587	DAMO-NLP-SG/LLM-Zoo LLM Zoo collects information of various open- and close-sourced LLMs	36	Emerging	multilingual-llm-adaptation	271	—
1588	aryan-jadon/Regression-Loss-Functions-in-Time-Series-Forecasting-Tensorflow This repository contains the implementation of paper Temporal Fusion...	36	Emerging	time-series-forecasting-transformers	85	Python
1589	dravenk/ollama-zig Ollama Zig library	36	Emerging	local-llm-deployment	35	Zig
1590	epfl-dlab/llm-latent-language Repo accompanying our paper "Do Llamas Work in English? On the Latent...	36	Emerging	llm-frameworks-libraries	80	Jupyter Notebook
1591	mrdbourke/mac-ml-speed-test A few quick scripts focused on testing TensorFlow/PyTorch/Llama 2 on macOS.	36	Emerging	apple-silicon-llm-inference	202	Jupyter Notebook
1592	chanind/linear-relational Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs)...	36	Emerging	llm-training-experimentation	10	Python
1593	csiro-robotics/HOTFormerLoc [IEEE/CVF CVPR 2025] Hierarchical Octree Transformer for Versatile Lidar...	36	Emerging	3d-vision-transformers	26	Python
1594	mala-lab/SEMPO [NeurIPS 2025] Official implementation of "SEMPO: Lightweight Foundation...	36	Emerging	time-series-forecasting-transformers	18	Python
1595	ahans30/goldfish-loss [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs	36	Emerging	mixup-augmentation-frameworks	97	Python
1596	mytechnotalent/RE-GPT Inspired by Andrej Karpathy’s "Let’s Build GPT", this project guides you...	36	Emerging	gpt2-pretraining-fine-tuning	27	Jupyter Notebook
1597	tlc4418/llm_optimization A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.	36	Emerging	rlhf-alignment-training	47	Python
1598	lechmazur/writing This benchmark tests how well LLMs incorporate a set of 10 mandatory story...	36	Emerging	llm-benchmark-leaderboards	353	Batchfile
1599	yinboc/trans-inr Transformers as Meta-Learners for Implicit Neural Representations, in ECCV 2022	36	Emerging	mixup-augmentation-frameworks	160	Python
1600	chaitjo/gated-graph-transformers Transformers are Graph Neural Networks!	36	Emerging	graph-transformers	54	Python

« Prev 1 2 3 … 14 15 16 17 18 … 63 64 65 Next »