All Transformer Models
6,429 models ranked by quality score · Page 6 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 501 |
inclusionAI/asystem-awex
A high-performance RL training-inference weight synchronization framework,... |
|
Emerging |
| 502 |
olivkoch/nano-trm
An implementation of Tiny Recursive Models (TRM) |
|
Emerging |
| 503 |
NVIDIA-AI-IOT/nanoowl
A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT. |
|
Emerging |
| 504 |
jianghoucheng/AlphaEdit
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models,... |
|
Emerging |
| 505 |
AlekseyKorshuk/optimum-transformers
Accelerated NLP pipelines for fast inference on CPU and GPU. Built with... |
|
Emerging |
| 506 |
ALucek/ppt2desc
Convert PowerPoint files into semantically rich text using vision language models |
|
Emerging |
| 507 |
SakanaAI/doc-to-lora
Hypernetworks that update LLMs to remember factual information |
|
Emerging |
| 508 |
rojagtap/transformer-abstractive-summarization
Abstractive Text Summarization using Transformer |
|
Emerging |
| 509 |
X-D-Lab/LangChain-ChatGLM-Webui
基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答 |
|
Emerging |
| 510 |
VHellendoorn/Code-LMs
Guide to using pre-trained large language models of source code |
|
Emerging |
| 511 |
Beomi/KoAlpaca
KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델 (KoAlpaca: An open-source language model... |
|
Emerging |
| 512 |
zyushun/Adam-mini
Code for Adam-mini: Use Fewer Learning Rates To Gain More... |
|
Emerging |
| 513 |
socialfoundations/folktexts
Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on... |
|
Emerging |
| 514 |
worldbank/REaLTabFormer
A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer... |
|
Emerging |
| 515 |
jmont-dev/ollama-hpp
Modern, Header-only C++ bindings for the Ollama API. |
|
Emerging |
| 516 |
livingbio/fuzzy-json
Fuzzy-JSON is a compact Python package with no dependencies, designed to... |
|
Emerging |
| 517 |
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs) |
|
Emerging |
| 518 |
FoundationVision/Infinity
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for... |
|
Emerging |
| 519 |
Thinklab-SJTU/Crossformer
Official implementation of our ICLR 2023 paper "Crossformer: Transformer... |
|
Emerging |
| 520 |
datawhalechina/llms-from-scratch-cn
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理 |
|
Emerging |
| 521 |
malteos/llm-datasets
A collection of datasets for language model pretraining including scripts... |
|
Emerging |
| 522 |
kyegomez/attn_res
A clean, single-file PyTorch implementation of Attention Residuals (Kimi... |
|
Emerging |
| 523 |
fboulnois/llama-cpp-docker
Run llama.cpp in a GPU accelerated Docker container |
|
Emerging |
| 524 |
graphdeeplearning/graphtransformer
Graph Transformer Architecture. Source code for "A Generalization of... |
|
Emerging |
| 525 |
kyegomez/HLT
Implementation of the transformer from the paper: "Real-World Humanoid... |
|
Emerging |
| 526 |
AndrewZhe/lawyer-llama
中文法律LLaMA (LLaMA for Chinese legel domain) |
|
Emerging |
| 527 |
slwang-ustc/nano-vllm-v1
Nano vLLM with vLLM v1's request scheduling strategy and chunked prefill |
|
Emerging |
| 528 |
curiousily/Deploy-BERT-for-Sentiment-Analysis-with-FastAPI
Deploy BERT for Sentiment Analysis as REST API using FastAPI, Transformers... |
|
Emerging |
| 529 |
locuslab/wanda
A simple and effective LLM pruning approach. |
|
Emerging |
| 530 |
cztomsik/ava
All-in-one desktop app for running LLMs locally. |
|
Emerging |
| 531 |
DaoD/INTERS
This is the repository for our paper "INTERS: Unlocking the Power of Large... |
|
Emerging |
| 532 |
lorenzorovida/FHE-BERT-Tiny
Source code for the paper "Transformer-based Language Models and Homomorphic... |
|
Emerging |
| 533 |
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction... |
|
Emerging |
| 534 |
geeks-of-data/knowledge-gpt
Extract knowledge from all information sources using gpt and other language... |
|
Emerging |
| 535 |
x-tabdeveloping/turftopic
Robust and fast topic models with sentence-transformers. |
|
Emerging |
| 536 |
xusenlinzy/api-for-open-llm
Openai style api for open large language models, using LLMs just as chatgpt!... |
|
Emerging |
| 537 |
back2matching/turboquant
First open-source TurboQuant KV cache compression for LLM inference. Drop-in... |
|
Emerging |
| 538 |
soulteary/docker-llama2-chat
Play LLaMA2 (official / 中文版 / INT4 / llama2.cpp) Together! ONLY 3 STEPS! (... |
|
Emerging |
| 539 |
dali92002/DocEnTR
DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022 |
|
Emerging |
| 540 |
deepseek-ai/Janus
Janus-Series: Unified Multimodal Understanding and Generation Models |
|
Emerging |
| 541 |
HHousen/TransformerSum
Models to perform neural summarization (extractive and abstractive) using... |
|
Emerging |
| 542 |
NVlabs/Eagle
Eagle: Frontier Vision-Language Models with Data-Centric Strategies |
|
Emerging |
| 543 |
haoliuhl/ringattention
Large Context Attention |
|
Emerging |
| 544 |
hiyouga/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调 |
|
Emerging |
| 545 |
The-FinAI/PIXIU
This repository introduces PIXIU, an open-source resource featuring the... |
|
Emerging |
| 546 |
mim-solutions/bert_for_longer_texts
BERT classification model for processing texts longer than 512 tokens. Text... |
|
Emerging |
| 547 |
Cardinal-Operations/ORLM
ORLM: Training Large Language Models for Optimization Modeling |
|
Emerging |
| 548 |
dusty-nv/NanoLLM
Optimized local inference for LLMs with HuggingFace-like APIs for... |
|
Emerging |
| 549 |
The-Swarm-Corporation/MedGuard
MedGuard is a robust, production-grade Python library that ensures HIPAA... |
|
Emerging |
| 550 |
cedrickchee/awesome-transformer-nlp
A curated list of NLP resources focused on Transformer networks, attention... |
|
Emerging |
| 551 |
kayoyin/transformer-slt
Sign Language Translation with Transformers (COLING'2020, ECCV'20 SLRTP Workshop) |
|
Emerging |
| 552 |
sagorbrur/bangla-bert
Bangla-Bert is a pretrained bert model for Bengali language |
|
Emerging |
| 553 |
kyegomez/Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI |
|
Emerging |
| 554 |
ycq091044/BIOT
BIOT - A framework for pretraining biosignals at scale. Large EEG pre-trained models. |
|
Emerging |
| 555 |
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs... |
|
Emerging |
| 556 |
xianglin226/Benchmarking-Single-Cell-Perturbation
Single-Cell (Perturbation) Model Library |
|
Emerging |
| 557 |
prrao87/tweet-stance-prediction
Applying NLP transfer learning techniques to predict Tweet stance toward a topic |
|
Emerging |
| 558 |
awslabs/mlm-scoring
Python library & examples for Masked Language Model Scoring (ACL 2020) |
|
Emerging |
| 559 |
Uminosachi/open-llm-webui
This repository contains a web application designed to execute relatively... |
|
Emerging |
| 560 |
conceptofmind/LaMDA-rlhf-pytorch
Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding... |
|
Emerging |
| 561 |
Event-AHU/Medical_Image_Analysis
Foundation models based medical image analysis |
|
Emerging |
| 562 |
chuanyangjin/MMToM-QA
[🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind... |
|
Emerging |
| 563 |
obss/trapper
State-of-the-art NLP through transformer models in a modular design and... |
|
Emerging |
| 564 |
leaderj1001/BottleneckTransformers
Bottleneck Transformers for Visual Recognition |
|
Emerging |
| 565 |
Zefan-Cai/KVCache-Factory
Unified KV Cache Compression Methods for Auto-Regressive Models |
|
Emerging |
| 566 |
noahho/CAAFE
Semi-automatic feature engineering process using Language Models and your... |
|
Emerging |
| 567 |
dorarad/gansformer
Generative Adversarial Transformers |
|
Emerging |
| 568 |
raymin0223/mixture_of_recursions
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive... |
|
Emerging |
| 569 |
alpa-projects/alpa
Training and serving large-scale neural networks with auto parallelization. |
|
Emerging |
| 570 |
jackaduma/Recurrent-LLM
The open-source LLM implementation of paper: RecurrentGPT: Interactive... |
|
Emerging |
| 571 |
predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs |
|
Emerging |
| 572 |
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V... |
|
Emerging |
| 573 |
horseee/LLM-Pruner
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language... |
|
Emerging |
| 574 |
jla524/fromthetensor
From the Tensor to Stable Diffusion, a rough outline for a 10 week course. |
|
Emerging |
| 575 |
jeya-maria-jose/TransWeather
Pytorch Code for the paper TransWeather - CVPR 2022 |
|
Emerging |
| 576 |
jobergum/browser-ml-inference
Edge Inference in Browser with Transformer NLP model |
|
Emerging |
| 577 |
vectorch-ai/ScaleLLM
A high-performance inference system for large language models, designed for... |
|
Emerging |
| 578 |
hybridgroup/yzma
Go with your own intelligence - Go applications that directly integrate... |
|
Emerging |
| 579 |
ARM-software/keyword-transformer
Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769 |
|
Emerging |
| 580 |
ssbuild/chatglm_finetuning
chatglm 6b finetuning and alpaca finetuning |
|
Emerging |
| 581 |
jiwidi/Behavior-Sequence-Transformer-Pytorch
This is a pytorch implementation for the BST model from Alibaba... |
|
Emerging |
| 582 |
EleutherAI/gpt-neo
An implementation of model parallel GPT-2 and GPT-3-style models using the... |
|
Emerging |
| 583 |
VinAIResearch/PhoBERT
PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings) |
|
Emerging |
| 584 |
The-AI-Summer/self-attention-cv
Implementation of various self-attention mechanisms focused on computer... |
|
Emerging |
| 585 |
vinjn/llm-metahuman
An open solution for AI-powered photorealistic digital humans. |
|
Emerging |
| 586 |
monologg/KoBERT-Transformers
KoBERT on 🤗 Huggingface Transformers 🤗 (with Bug Fixed) |
|
Emerging |
| 587 |
codewithdark-git/Building-LLMs-from-scratch
This repository guides you through the process of building a GPT-style Large... |
|
Emerging |
| 588 |
MiniMax-AI/MiniMax-M1
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention... |
|
Emerging |
| 589 |
NVlabs/RLP
[ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a... |
|
Emerging |
| 590 |
ariannamethod/molequla
molequla.ai. live ecology of GPT organisms |
|
Emerging |
| 591 |
DUTIR-BioNLP/Taiyi-LLM
Taiyi 2, Biomedical LLM, A Bilingual (Chinese and English) Fine-Tuned Large... |
|
Emerging |
| 592 |
livepeer/ai-runner
Inference runtime for running different batch and real-time AI pipelines. |
|
Emerging |
| 593 |
MegEngine/InferLLM
a lightweight LLM model inference framework |
|
Emerging |
| 594 |
vfeofanov/mantis
Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time... |
|
Emerging |
| 595 |
hila-chefer/Transformer-MM-Explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic... |
|
Emerging |
| 596 |
IDEA-CCNL/Fengshenbang-LM
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。 |
|
Emerging |
| 597 |
zjunlp/KnowLM
An Open-sourced Knowledgable Large Language Model Framework. |
|
Emerging |
| 598 |
georgian-io/LLM-Finetuning-Toolkit
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs. |
|
Emerging |
| 599 |
microsoft/GODEL
Large-scale pretrained models for goal-directed dialog |
|
Emerging |
| 600 |
datawhalechina/base-llm
从 NLP 到 LLM 的算法全栈教程,在线阅读地址:https://datawhalechina.github.io/base-llm/ |
|
Emerging |