All Transformer Models
6,429 models ranked by quality score · Page 10 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 901 |
jianghoucheng/AnyEdit
AnyEdit: Edit Any Knowledge Encoded in Language Models, ICML 2025 |
|
Emerging |
| 902 |
huangwl18/language-planner
Official Code for "Language Models as Zero-Shot Planners: Extracting... |
|
Emerging |
| 903 |
Intelligent-CAT-Lab/PLTranslationEmpirical
Artifact repository for the paper "Lost in Translation: A Study of Bugs... |
|
Emerging |
| 904 |
elicit/machine-learning-list
A curriculum for learning about foundation models, from scratch to the frontier |
|
Emerging |
| 905 |
WangRongsheng/ChatGenTitle
🌟 ChatGenTitle:使用百万arXiv论文信息在LLaMA模型上进行微调的论文题目生成模型 |
|
Emerging |
| 906 |
MozerWang/AMPO
[ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents |
|
Emerging |
| 907 |
A-baoYang/alpaca-7b-chinese
Finetune LLaMA-7B with Chinese instruction datasets |
|
Emerging |
| 908 |
therealoliver/Deepdive-llama3-from-scratch
Achieve the llama3 inference step-by-step, grasp the core concepts, master... |
|
Emerging |
| 909 |
johnmai-dev/ChatMLX
🤖✨ChatMLX is a modern, open-source, high-performance chat application for... |
|
Emerging |
| 910 |
oxpig/CaLM
Protein language model trained on coding DNA |
|
Emerging |
| 911 |
Gunale0926/SORSA
SORSA: Singular Values and Orthonormal Regularized Singular Vectors... |
|
Emerging |
| 912 |
microsoft/interwhen
A framework for verifiable reasoning with language models. |
|
Emerging |
| 913 |
tosiyuki/LLaVA-JP
LLaVA-JP is a Japanese VLM trained by LLaVA method |
|
Emerging |
| 914 |
amirfeder/CausaLM
CausaLM: Causal Model Explanation Through Counterfactual Language Models |
|
Emerging |
| 915 |
gaussalgo/adaptor
ACL 2022: Adaptor: a library to easily adapt a language model to your own... |
|
Emerging |
| 916 |
EleutherAI/DALLE-mtf
Open-AI's DALL-E for large scale training in mesh-tensorflow. |
|
Emerging |
| 917 |
TextGeneratorio/text-generator.io
Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io |
|
Emerging |
| 918 |
sayakpaul/robustness-vit
Contains code for the paper "Vision Transformers are Robust Learners" (AAAI 2022). |
|
Emerging |
| 919 |
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation |
|
Emerging |
| 920 |
grctest/FastAPI-BitNet
Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker. |
|
Emerging |
| 921 |
ArdaGnsrn/ollama-php
This is a PHP library for Ollama. Ollama is an open-source project that... |
|
Emerging |
| 922 |
biodatlab/thonburian-whisper
Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo... |
|
Emerging |
| 923 |
ZO-Bench/ZO-LLM
[ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization... |
|
Emerging |
| 924 |
AntixK/PyTorch-Model-Compare
Compare neural networks by their feature similarity |
|
Emerging |
| 925 |
Dartvauder/NeuroSandboxWebUI
(Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image,... |
|
Emerging |
| 926 |
Yachay-AI/byt5-geotagging
Confidence and Byt5 - based geotagging model predicting coordinates from text alone. |
|
Emerging |
| 927 |
CVxTz/music_genre_classification
music genre classification : LSTM vs Transformer |
|
Emerging |
| 928 |
bilibili/Index-1.9B
A lightweight multilingual LLM |
|
Emerging |
| 929 |
ivanfioravanti/wine_variety_classification
Examples on how to use various LLM providers with a Wine Classification problem |
|
Emerging |
| 930 |
nova-land/gbnf-compiler
Plug n Play GBNF Compiler for llama.cpp |
|
Emerging |
| 931 |
CAMeL-Lab/CAMeLBERT
Code and models for "The Interplay of Variant, Size, and Task Type in Arabic... |
|
Emerging |
| 932 |
HamedBabaei/LLMs4OM
LLMs4OM: Matching Ontologies with Large Language Models |
|
Emerging |
| 933 |
hyperonym/basaran
Basaran is an open-source alternative to the OpenAI text completion API. It... |
|
Emerging |
| 934 |
sinanuozdemir/oreilly-ai-pipelines
Designing and Deploying LLM Pipelines |
|
Emerging |
| 935 |
softmax1/Flash-Attention-Softmax-N
CUDA and Triton implementations of Flash Attention with SoftmaxN. |
|
Emerging |
| 936 |
waikato-llm/llm-dataset-converter
For converting LLM datasets from one format into another. |
|
Emerging |
| 937 |
aimclub/FEDOT.LLM
LLM-based prototype for nexgen AutoML |
|
Emerging |
| 938 |
ZinYY/Online_RLHF
A PyTorch implementation of the paper "Provably Efficient Online RLHF with... |
|
Emerging |
| 939 |
bhavnicksm/vanilla-transformer-jax
JAX/Flax implimentation of 'Attention Is All You Need' by Vaswani et al.... |
|
Emerging |
| 940 |
josStorer/selfhostedAI
A collection of one-click self-hosted AI |
|
Emerging |
| 941 |
HyperCluster-Tech/manimator
Transform research papers and mathematical concepts into stunning visual... |
|
Emerging |
| 942 |
xyjigsaw/LLM-Pretrain-SFT
Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed) |
|
Emerging |
| 943 |
kyegomez/qformer
Implementation of Qformer from BLIP2 in Zeta Lego blocks. |
|
Emerging |
| 944 |
AmpereComputingAI/ampere_model_library
AML's goal is to make benchmarking of various AI architectures on Ampere... |
|
Emerging |
| 945 |
EncrEor/rlm-claude
Recursive Language Models for Claude Code - Infinite memory solution... |
|
Emerging |
| 946 |
shivendrra/SmallLanguageModel
a LLM cookbook, for building your own from scratch, all the way from... |
|
Emerging |
| 947 |
efeslab/fiddler
[ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration |
|
Emerging |
| 948 |
thunlp/InfLLM
The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for... |
|
Emerging |
| 949 |
NVIDIA/Cosmos-Tokenizer
A suite of image and video neural tokenizers |
|
Emerging |
| 950 |
palewire/first-llm-classifier
Learn how journalists use large-language models to organize and analyze... |
|
Emerging |
| 951 |
ByteDance-Seed/FlexPrefill
Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse... |
|
Emerging |
| 952 |
monologg/GoEmotions-Korean
Korean version of GoEmotions Dataset 😍😢😱 |
|
Emerging |
| 953 |
zjunlp/KnowledgeCircuits
[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers |
|
Emerging |
| 954 |
uclaml/SPPO
The official implementation of Self-Play Preference Optimization (SPPO) |
|
Emerging |
| 955 |
LLukas22/llm-rs-python
Unofficial python bindings for the rust llm library. 🐍❤️🦀 |
|
Emerging |
| 956 |
RobertCsordas/ndr
The official repository for our paper "The Neural Data Router: Adaptive... |
|
Emerging |
| 957 |
RobertCsordas/transformer_generalization
The official repository for our paper "The Devil is in the Detail: Simple... |
|
Emerging |
| 958 |
shushanxingzhe/transformers_ner
Add CRF or LSTM+CRF for huggingface transformers bert to perform better on... |
|
Emerging |
| 959 |
AviSoori1x/seemore
From scratch implementation of a vision language model in pure PyTorch |
|
Emerging |
| 960 |
calcuis/gguf-core
a simple way to interact llama with gguf |
|
Emerging |
| 961 |
garyb9/twitter-llm-bot
Fully automatic asynchronous AI operated Twitter bot using Large Language... |
|
Emerging |
| 962 |
sedthh/BeatLearning
Open Source Generative AI Models for Automatic Rhythm Game Beatmap... |
|
Emerging |
| 963 |
nlp-uoregon/mlmm-evaluation
Multilingual Large Language Models Evaluation Benchmark |
|
Emerging |
| 964 |
canyuchen/ClinicalBench
Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in... |
|
Emerging |
| 965 |
golsun/DialogRPT
EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data" |
|
Emerging |
| 966 |
ai-forever/mgpt
Multilingual Generative Pretrained Model |
|
Emerging |
| 967 |
asigalov61/SuperPiano
Absolutely amazing SOTA Google Colab (Jupyter) Notebooks for... |
|
Emerging |
| 968 |
monologg/DistilKoBERT
Distillation of KoBERT from SKTBrain (Lightweight KoBERT) |
|
Emerging |
| 969 |
jaisidhsingh/pytorch-mixtures
One-stop solutions for Mixture of Expert modules in PyTorch. |
|
Emerging |
| 970 |
lamalab-org/MatText
Text-based modeling of materials. |
|
Emerging |
| 971 |
princeton-nlp/LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via... |
|
Emerging |
| 972 |
Dicklesworthstone/llm_introspective_compression_and_metacognition
A novel approach for transformer model introspection that enables saving,... |
|
Emerging |
| 973 |
AbdelStark/attnres
Rust implementation of Attention Residuals from MoonshotAI/Kimi |
|
Emerging |
| 974 |
ChanithaAbey/AI-Agent-for-Stock-Prediction
An AI Agent for stock data analysis, news rerieval, and prediction; powered... |
|
Emerging |
| 975 |
sail-sg/understand-r1-zero
Understanding R1-Zero-Like Training: A Critical Perspective |
|
Emerging |
| 976 |
ayaka14732/llama-2-jax
JAX implementation of the Llama 2 model |
|
Emerging |
| 977 |
harleyszhang/lite_llama
A light llama-like llm inference framework based on the triton kernel. |
|
Emerging |
| 978 |
illiterate/BertClassifier
基于PyTorch的BERT中文文本分类模型(BERT Chinese text classification model implemented by PyTorch) |
|
Emerging |
| 979 |
the-crypt-keeper/can-ai-code
Self-evaluating interview for AI coders |
|
Emerging |
| 980 |
westlake-repl/IDvs.MoRec
End-to-end Training for Multimodal Recommendation Systems |
|
Emerging |
| 981 |
lenguajenatural-ai/autotransformers
A Python package for automatically training and comparing language models. |
|
Emerging |
| 982 |
jingedawang/TutorialLLM
LLM Tutorial for Everyone. |
|
Emerging |
| 983 |
hellotransformers/Natural_Language_Processing_with_Transformers
Natural Language Processing with Transformers 中译本,最权威Transformers教程 |
|
Emerging |
| 984 |
gotzmann/llama.go
llama.go is like llama.cpp in pure Golang! |
|
Emerging |
| 985 |
mojivalipour/symbolicgpt
Symbolic regression is the task of identifying a mathematical expression... |
|
Emerging |
| 986 |
njchoma/transformer_image_caption
Image Captioning based on Bottom-Up and Top-Down Attention model |
|
Emerging |
| 987 |
jankais3r/LLaMA_MPS
Run LLaMA (and Stanford-Alpaca) inference on Apple Silicon GPUs. |
|
Emerging |
| 988 |
leehanchung/lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA |
|
Emerging |
| 989 |
menon92/BangalASR
Transformer based Bangla Speech Recognition | Encoder Decoder Architecture |
|
Emerging |
| 990 |
ssbuild/deep_training
deep learning |
|
Emerging |
| 991 |
gitctrlx/llama.go
Llama from scratch in Go. |
|
Emerging |
| 992 |
sinanuozdemir/foundations-of-gen-ai
Transformer Architectures for Generative AI |
|
Emerging |
| 993 |
ruanchaves/napolab
The Natural Portuguese Language Benchmark (Napolab). Stay up to date with... |
|
Emerging |
| 994 |
harleyszhang/llm_note
LLM notes, including model inference, transformer model structure, and llm... |
|
Emerging |
| 995 |
argosopentech/MetalTranslate
Customizable machine translation in C++ |
|
Emerging |
| 996 |
mbzuai-oryx/Awesome-LLM-Post-training
Awesome Reasoning LLM Tutorial/Survey/Guide |
|
Emerging |
| 997 |
dohlee/chromoformer
The official code implementation for Chromoformer in PyTorch. (Lee et al.,... |
|
Emerging |
| 998 |
zetavg/LLaMA-LoRA-Tuner
UI tool for fine-tuning and testing your own LoRA models base on LLaMA,... |
|
Emerging |
| 999 |
NVlabs/GroupViT
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges... |
|
Emerging |
| 1000 |
waltonfuture/Diabetica
[SCI-FM@ICLR 2025] Specialized LLMs capable of handling various diabetes tasks |
|
Emerging |