All Transformer Models
6,429 models ranked by quality score · Page 12 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 1101 |
Esmail-ibraheem/Axon
AI research lab🔬: implementations of AI papers and theoretical research:... |
|
Emerging |
| 1102 |
ariannamethod/arianna.c
Arianna is a Digital Persona. Embodied cognition as is. |
|
Emerging |
| 1103 |
Multi-Agent-LLMs/mallm
Framework: Multi-Agent LLMs For Conversational Task-Solving (MALLM) |
|
Emerging |
| 1104 |
declare-lab/instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned... |
|
Emerging |
| 1105 |
willxxy/ECG-Bench
A Unified Framework for Benchmarking Generative Electrocardiogram-Language... |
|
Emerging |
| 1106 |
bigscience-workshop/xmtf
Crosslingual Generalization through Multitask Finetuning |
|
Emerging |
| 1107 |
cloudguruab/modsysML
Human reinforcement learning (RLHF) framework for AI models. Evaluate and... |
|
Emerging |
| 1108 |
yang-ai-lab/SleepLM
SleepLM: Natural-Language Intelligence for Human Sleep |
|
Emerging |
| 1109 |
ariannamethod/ariannamethod.ai
Arianna Method Programming Language |
|
Emerging |
| 1110 |
mlvlab/Flipped-VQA
Large Language Models are Temporal and Causal Reasoners for Video Question... |
|
Emerging |
| 1111 |
XunhaoLai/native-sparse-attention-triton
Efficient triton implementation of Native Sparse Attention. |
|
Emerging |
| 1112 |
HHousen/DocSum
A tool to automatically summarize documents abstractively using the BART or... |
|
Emerging |
| 1113 |
ictnlp/LLaVA-Mini
LLaVA-Mini is a unified large multimodal model (LMM) that can support the... |
|
Emerging |
| 1114 |
pjlab-sys4nlp/llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual... |
|
Emerging |
| 1115 |
zai-org/GLM-Edge
GLM Series Edge Models |
|
Emerging |
| 1116 |
liuqidong07/MOELoRA-peft
[SIGIR'24] The official implementation code of MOELoRA. |
|
Emerging |
| 1117 |
Beomi/InfiniTransformer
Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No... |
|
Emerging |
| 1118 |
punica-ai/punica
Serving multiple LoRA finetuned LLM as one |
|
Emerging |
| 1119 |
SqueezeAILab/SqueezeLLM
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization |
|
Emerging |
| 1120 |
VITA-MLLM/Freeze-Omni
✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with... |
|
Emerging |
| 1121 |
AdrianBZG/llama-multimodal-vqa
Multimodal Instruction Tuning for Llama 3 |
|
Emerging |
| 1122 |
fahadshamshad/awesome-transformers-in-medical-imaging
A collection of resources on applications of Transformers in Medical Imaging. |
|
Emerging |
| 1123 |
Breeze648/Transformer-from-Scratch
本仓库定位为 AI论文复现 / 从零实现 Transformer。 ... |
|
Emerging |
| 1124 |
HarderThenHarder/transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification,... |
|
Emerging |
| 1125 |
monologg/KoCharELECTRA
Character-level Korean ELECTRA Model (음절 단위 한국어 ELECTRA) |
|
Emerging |
| 1126 |
kyegomez/SimplifiedTransformers
SimplifiedTransformer simplifies transformer block without affecting... |
|
Emerging |
| 1127 |
lin-tan/clm
For our ICSE23 paper "Impact of Code Language Models on Automated Program... |
|
Emerging |
| 1128 |
NVIDIA/Star-Attention
Efficient LLM Inference over Long Sequences |
|
Emerging |
| 1129 |
thevasudevgupta/bigbird
Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers |
|
Emerging |
| 1130 |
zyds/transformers-code
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube |
|
Emerging |
| 1131 |
ziplab/LIT
[AAAI 2022] This is the official PyTorch implementation of "Less is More:... |
|
Emerging |
| 1132 |
luuyin/OWL
Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity... |
|
Emerging |
| 1133 |
TrustedLLM/LLMDet
LLMDet is a text detection tool that can identify which generated sources... |
|
Emerging |
| 1134 |
JulesBelveze/bert-squeeze
🛠️ Tools for Transformers compression using PyTorch Lightning ⚡ |
|
Emerging |
| 1135 |
git-cloner/llama-lora-fine-tuning
llama fine-tuning with lora |
|
Emerging |
| 1136 |
Hsu1023/DuQuant
[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation... |
|
Emerging |
| 1137 |
amitkedia007/Financial-Fraud-Detection-Using-LLMs
The aim of this dissertation is to assess the effectiveness of LLMs such as ... |
|
Emerging |
| 1138 |
ECNU-ICALK/EduChat
An open-source educational chat model from ICALK, East China Normal... |
|
Emerging |
| 1139 |
HHousen/speaker-change-detection
Speaker change detection using SincNet and an LSTM/Transformer |
|
Emerging |
| 1140 |
boheumd/MA-LMM
(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term... |
|
Emerging |
| 1141 |
molbal/llm-text-completion-finetune
Guide on text completion large language model fine-tuning, including example... |
|
Emerging |
| 1142 |
PediaMedAI/AggPose
[IJCAI 2022] Official PyTorch implementation of AggPose: Deep Aggregation... |
|
Emerging |
| 1143 |
RedHatResearch/conext24-NetConfEval
Benchmark for evaluating LLMs in network configuration problems. |
|
Emerging |
| 1144 |
ChristophReich1996/Swin-Transformer-V2
PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up... |
|
Emerging |
| 1145 |
architkaila/Fine-Tuning-LLMs-for-Medical-Entity-Extraction
Exploring the potential of fine-tuning Large Language Models (LLMs) like... |
|
Emerging |
| 1146 |
rishikksh20/CrossViT-pytorch
Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer... |
|
Emerging |
| 1147 |
ukairia777/pytorch-nlp-tutorial
pytorch를 사용하여 텍스트 전처리부터 RAG, 에이전트, LLM 파인튜닝을 정리한 Deep Learning NLP 저장소입니다. |
|
Emerging |
| 1148 |
jha-lab/acceltran
[TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers |
|
Emerging |
| 1149 |
rasbt/blog-finetuning-llama-adapters
Supplementary material for "Understanding Parameter-Efficient Finetuning of... |
|
Emerging |
| 1150 |
johnmai-dev/NotebookMLX
📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama) |
|
Emerging |
| 1151 |
xNul/code-llama-for-vscode
Use Code Llama with Visual Studio Code and the Continue extension. A local... |
|
Emerging |
| 1152 |
OpenSparseLLMs/LLaMA-MoE-v2
🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of... |
|
Emerging |
| 1153 |
0x7o/RETRO-transformer
Easy-to-use Retrieval-Enhanced Transformer implementation |
|
Emerging |
| 1154 |
cdli-gh/Semi-Supervised-NMT-for-Sumerian-English
Exploring the Limits of Low-Resource Neural Machine Translation |
|
Emerging |
| 1155 |
kingabzpro/using-llama3-locally
Running llama3 using Ollama-Python, Curl, LangChain, Chroma, and User interface. |
|
Emerging |
| 1156 |
infocusp/llm_seminar_series
Material for the series of seminars on Large Language Models |
|
Emerging |
| 1157 |
zackshen/gguf
a GGUF file parser |
|
Emerging |
| 1158 |
cooelf/AwesomeMRC
IJCAI 2021 Tutorial & code for Retrospective Reader for Machine Reading... |
|
Emerging |
| 1159 |
rednote-hilab/dots.llm1
The official repository of the dots.llm1 base and instruct models proposed... |
|
Emerging |
| 1160 |
google-deepmind/gemma_penzai
A JAX Research Toolkit for Visualizing, Manipulating, and Understanding... |
|
Emerging |
| 1161 |
saqib1707/gpt2-from-scratch
PyTorch Implementation of GPT-2 |
|
Emerging |
| 1162 |
huggingface/datablations
Scaling Data-Constrained Language Models |
|
Emerging |
| 1163 |
VITA-Group/Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank... |
|
Emerging |
| 1164 |
vicgalle/zero-shot-reward-models
ZYN: Zero-Shot Reward Models with Yes-No Questions |
|
Emerging |
| 1165 |
trrahul/llama2.cs
Inference Llama 2 in one file of pure C# |
|
Emerging |
| 1166 |
souzatharsis/tamingLLMs
Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software |
|
Emerging |
| 1167 |
hkust-nlp/deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024] |
|
Emerging |
| 1168 |
aniketmaurya/llm-inference
Large Language Model (LLM) Inference API and Chatbot |
|
Emerging |
| 1169 |
ai4co/parco
[NeurIPS 2025] PARCO: Parallel AutoRegressive Combinatorial Optimization |
|
Emerging |
| 1170 |
Traffic-Alpha/iLLM-TSC
This repository contains the code for the paper“iLLM-TSC: Integration... |
|
Emerging |
| 1171 |
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。 |
|
Emerging |
| 1172 |
teticio/llama-squad
Train Llama 2 & 3 on the SQuAD v2 task as an example of how to specialize a... |
|
Emerging |
| 1173 |
Fsoft-AIC/Grasp-Anything
Dataset and Code for ICRA 2024 paper "Grasp-Anything: Large-scale Grasp... |
|
Emerging |
| 1174 |
Omid-Nejati/BEFUnet
A Hybrid CNN-Transformer Architecture for Precise Medical Image Segmentation |
|
Emerging |
| 1175 |
SamsungSAILMontreal/ghn3
Code for "Can We Scale Transformers to Predict Parameters of Diverse... |
|
Emerging |
| 1176 |
Bindwell/PLAPT
Codebase and CLI for PLAPT: A state-of-the-art protein-ligand binding... |
|
Emerging |
| 1177 |
OpenBMB/InfiniteBench
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K... |
|
Emerging |
| 1178 |
aJupyter/ThinkLLM
ThinkLLM:🚀 轻量、高效的大语言模型算法实现 |
|
Emerging |
| 1179 |
l294265421/alpaca-rlhf
Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback)... |
|
Emerging |
| 1180 |
IAAR-Shanghai/Grimoire
Grimoire is All You Need for Enhancing Large Language Models |
|
Emerging |
| 1181 |
prismformore/Multi-Task-Transformer
Code of ICLR2023 paper "TaskPrompter: Spatial-Channel Multi-Task Prompting... |
|
Emerging |
| 1182 |
bigcode-project/selfcodealign
[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation |
|
Emerging |
| 1183 |
vmicheli/delta-iris
Efficient World Models with Context-Aware Tokenization. ICML 2024 |
|
Emerging |
| 1184 |
harishdeivanayagam/rowfill
Open-source spreadsheets platform for deep research and document processing |
|
Emerging |
| 1185 |
takashiishida/paper2slides
Transform any arXiv papers into slides using LLMs |
|
Emerging |
| 1186 |
hongyehu/Machine_Learning_Quantum_State_Tomography
An **unofficial** pytorch implementation of using generative models to do... |
|
Emerging |
| 1187 |
DmitryNekrasov/ai-code-completion-idea-plugin
Implementation of IntelliJ IDEA code completion plugin using a local LLM. |
|
Emerging |
| 1188 |
cgbur/llama2.zig
Inference Llama 2 in one file of pure Zig |
|
Emerging |
| 1189 |
asigalov61/Allegro-Music-Transformer
Full-attention multi-instrumental music transformer featuring asymmetrical... |
|
Emerging |
| 1190 |
alexrozanski/LlamaChat
Chat with your favourite LLaMA models in a native macOS app |
|
Emerging |
| 1191 |
yifanzhang-pro/HLA
Official Project Page for HLA: Higher-order Linear Attention... |
|
Emerging |
| 1192 |
samestrin/llm-pdf-ocr-api
A Python-based REST API for PDF OCR using AI models with PyTorch and... |
|
Emerging |
| 1193 |
hans00/react-native-transformers-example
Example of transformers.js on React Native |
|
Emerging |
| 1194 |
elapse-annals/laravel-plus
Based on Laravel transformation and expansion, more convenient for practical... |
|
Emerging |
| 1195 |
Sachithx/EntroPE
This includes the codebase for EntroPE (Entropy-Guided Dynamic Patch Encoder... |
|
Emerging |
| 1196 |
tlkh/t2t-tuner
Convenient Text-to-Text Training for Transformers |
|
Emerging |
| 1197 |
Traffic-Alpha/LLM-Assisted-Light
This repository contains the code for the paper "LLM-Assisted Light:... |
|
Emerging |
| 1198 |
clabrugere/scratch-llm
Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch,... |
|
Emerging |
| 1199 |
amithkoujalgi/ollama-pdf-bot
A bot that accepts PDF docs and lets you ask questions on it. |
|
Emerging |
| 1200 |
WENGSYX/LMTuner
LMTuner: Make the LLM Better for Everyone |
|
Emerging |