All Transformer Models
6,429 models ranked by quality score · Page 31 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 3001 |
xufangzhi/Symbol-LLM
[ACL 2024] The project of Symbol-LLM |
|
Experimental |
| 3002 |
visresearch/LLaVA-STF
The official implementation of "Learning Compact Vision Tokens for Efficient... |
|
Experimental |
| 3003 |
Qingfeng-233/KeyAtten
KeyAtten: Attention-based Zero-Shot Keyword & Keyphrase Extraction |
|
Experimental |
| 3004 |
Asha-Gutlapalli/Drug-Recommendation-System-based-on-the-Condition-of-the-Patient-using-BERT
Patients are recommended drugs based on their condition and reviews of the... |
|
Experimental |
| 3005 |
is-leeroy-jenkins/Mathy
Machine-learning algorithms for pre-processing, classification, regression,... |
|
Experimental |
| 3006 |
cleopatra-itn/claim_detection
Code for tasks in the paper "Check\_square at CheckThat! 2020: Claim... |
|
Experimental |
| 3007 |
Curated-Awesome-Lists/Awesome-Llama3
A curated, awesome list of resources, tools, and projects for the AI Large... |
|
Experimental |
| 3008 |
NC0DER/GreekT5
A series of Greek News Summarization Sequence-to-Sequence Models built with... |
|
Experimental |
| 3009 |
artpli/CodeIE
[ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot... |
|
Experimental |
| 3010 |
garyb9/pytorch-transformers
Transformers architecture code playground repository in python using PyTorch. |
|
Experimental |
| 3011 |
marqinhos/MedicalLiverSegmentationToolKit
Medical Toolkit for Liver Volume Segmentation |
|
Experimental |
| 3012 |
bobazooba/shurale
Conversation AI model for open domain dialogs |
|
Experimental |
| 3013 |
CServinL/tbot
A multimodal AI bot for your terminal |
|
Experimental |
| 3014 |
dinhquy-nguyen-1704/ZaloAI2023-Elementary-Math-Solving
Baseline achieving 0.8 accuracy on the private test set in the ZaloAI... |
|
Experimental |
| 3015 |
Human-Centric-Machine-Learning/counterfactual-llms
Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024. |
|
Experimental |
| 3016 |
NamrataThakur/Large_Language_Model_From_Scratch_Implementation
Implementing an LLM from scratch block-by-block using PyTorch |
|
Experimental |
| 3017 |
ayaka14732/TrAVis
TrAVis: Visualise BERT attention in your browser |
|
Experimental |
| 3018 |
byroneverson/Mia
A simple swift app for MacOS/iOS to test large language models (LLM) |
|
Experimental |
| 3019 |
En1gma02/Proteomic-and-Genomic-Drug-Development
An end-to-end generative AI pipeline for Proteomic and Genomic drug... |
|
Experimental |
| 3020 |
mrcabbage972/simple-toolformer
A Python implementation of Toolformer using Huggingface Transformers |
|
Experimental |
| 3021 |
avatsaev/av-local-llm-api
Allows to easily run local REST API with a custom LLM, running locally or... |
|
Experimental |
| 3022 |
Brazilian-willametteriver232/llama.swift
🚀 Access llama.cpp easily in your Swift projects, leveraging precompiled... |
|
Experimental |
| 3023 |
mourga/transformer-uncertainty
Code for evaluating uncertainty estimation methods for Transformer-based... |
|
Experimental |
| 3024 |
GreenScreen410/LYMT
LYMT: Let Your Model Think |
|
Experimental |
| 3025 |
nullHawk/simple-transformer
Implementation of Transformer model in PyTorch |
|
Experimental |
| 3026 |
liziniu/policy_optimization
Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data) |
|
Experimental |
| 3027 |
elijahnzeli1/CausalTorch
CausalTorch is a PyTorch library for building generative models with... |
|
Experimental |
| 3028 |
Katashynskyi/Voice_assistant_UA_EN
No api-keys | local | llama3.1 For language studying and live translation |
|
Experimental |
| 3029 |
LightDopper/skill-codex
🚀 Enable automated code analysis and editing with Claude Code using Codex... |
|
Experimental |
| 3030 |
cvedix/omnisdk
On-device AI deloper platform |
|
Experimental |
| 3031 |
ItzDerock/llama-playground
A simple to use and powerful web-interface to mess around with Meta's LLaMA LLM. |
|
Experimental |
| 3032 |
bpevangelista/vfastml
Inference and Training Engine for LLMs, Image2Image and Other Models |
|
Experimental |
| 3033 |
assembly-automation-hub/Issues-github-actions-Llama
🤖 GitHub Action that analyzes push/PR diffs with Llama AI and auto-creates... |
|
Experimental |
| 3034 |
lazy-guy/chess-llama
Tiny Llama model trained to play chess |
|
Experimental |
| 3035 |
ai4sd/multiscale-byte-lm
A hierarchical LM that scales to training on context windows of +5M tokens |
|
Experimental |
| 3036 |
xarillian/GDLlama
A working and actively maintained GDExtension for running local LLMs in... |
|
Experimental |
| 3037 |
Pragyan2004/Polyglot_AI
Polyglot AI is a developer-focused platform that converts visual coding... |
|
Experimental |
| 3038 |
Aradhye2002/selective-peft-toolkit
Official implementation of the paper "Step-by-Step Unmasking for... |
|
Experimental |
| 3039 |
randomtask2000/MultiShot.AI
This project creates a real-time conversational AI, either serverless via... |
|
Experimental |
| 3040 |
SauravP97/toy-transformer
A decoder only Transformer implementing masked attention |
|
Experimental |
| 3041 |
SciCrunch/bio_electra
Bio-Electra - Small and efficient discriminatively pre-trained language... |
|
Experimental |
| 3042 |
Giyanellow/llama-chatbot-with-ui
This project provides a comprehensive template for self-hosting a Large... |
|
Experimental |
| 3043 |
MNThomson/chat
chat - platform agnostic "ai" cli |
|
Experimental |
| 3044 |
Nutanpatil06/Fine-Tuning-LLM-with-LLaMA-Factory
Complete LoRA/QLoRA implementation using LLaMA Factory. Fine-tune models... |
|
Experimental |
| 3045 |
Iteranya/AktivaAI
Local LLM Discord Bot |
|
Experimental |
| 3046 |
hplt-project/monolingual-multilingual-instruction-tuning
Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca |
|
Experimental |
| 3047 |
knoveleng/steering
Official repo for the paper: "Selective Steering: Norm-Preserving Control... |
|
Experimental |
| 3048 |
uw-swag/tokdrift
Repository for TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar. |
|
Experimental |
| 3049 |
Ketis21/KetisBot
KetisBot is a powerful Discord AI chatbot using KoboldCpp for text... |
|
Experimental |
| 3050 |
SkillichSE/Lumi-bot
A Telegram bot powered by aiogram integrated with a local LLM (LM Studio).... |
|
Experimental |
| 3051 |
piratheon/LB-llm_training_scripts
A bunch of script to train your own offsec LLM |
|
Experimental |
| 3052 |
Koziev/LM-pretrain
Char-level language model pretraining code and scripts |
|
Experimental |
| 3053 |
tripathiarpan20/self-improvement-4all
Private self-improvement coaching with open-source LLMs |
|
Experimental |
| 3054 |
DoctorLai/SimilarString
Compute the score of similarity between two strings |
|
Experimental |
| 3055 |
toriving/haafor-challenge-2020
The project for HAAFOR CHALLENGE 2020 |
|
Experimental |
| 3056 |
NellyW8/VeriReason
This is the Github Repo for the paper: VeriReason: Reinforcement Learning... |
|
Experimental |
| 3057 |
ashimmortallp/mHC-manifold-constrained-hyper-connections
🔍 Explore mHC for manifold-constrained hyper-connections in PyTorch,... |
|
Experimental |
| 3058 |
calcuis/llama-core
solo connector core built on llama.cpp |
|
Experimental |
| 3059 |
gmontamat/poor-mans-transformers
Implement Transformers (and Deep Learning) from scratch in NumPy |
|
Experimental |
| 3060 |
jaketae/tupe
PyTorch implementation of Rethinking Positional Encoding in Language Pre-training |
|
Experimental |
| 3061 |
kurnevsky/llama-cpp.el
A client for llama-cpp server |
|
Experimental |
| 3062 |
trialandsuccess/verysimpletransformers
Very Simple Transformers provides a simplified interface for packaging,... |
|
Experimental |
| 3063 |
Ultron09/Mirror_mind
A production-ready adaptive meta-learning framework for continuous... |
|
Experimental |
| 3064 |
rishabkr/Attention-Is-All-You-Need-Explained-PyTorch
A paper implementation and tutorial from scratch combining various great... |
|
Experimental |
| 3065 |
LazerLambda/Promptzl
Turn LLMs into zero-shot PyTorch classifiers! |
|
Experimental |
| 3066 |
bandirevanth/aiml-codex
A curated collection of my AI & ML projects - crafting tomorrow’s smart... |
|
Experimental |
| 3067 |
Victorwz/VaLM
VaLM: Visually-augmented Language Modeling. ICLR 2023. |
|
Experimental |
| 3068 |
Boykadakim/User-Clustering-with-BERT-Models
User Clustering Pipelines with BERT Models on Long and Heterogeneous Tweets... |
|
Experimental |
| 3069 |
seanpm2001/DALL-E_LLaMA
🤖️🦙️🧠️ DALL-E LLaMA is a combination of DALL-E and LLaMA (Large Language... |
|
Experimental |
| 3070 |
tunib-ai/joker
AI model designed to test the effectiveness in handling external ethical attacks. |
|
Experimental |
| 3071 |
Uokoroafor/transformer_from_scratch
This is a PyTorch implementation of the Transformer model in the paper... |
|
Experimental |
| 3072 |
iqbal-sk/Detecting-Persuasion-Techniques-in-Memes
Hierarchical, multilingual, multimodal detection of persuasion techniques in... |
|
Experimental |
| 3073 |
kyegomez/open_qwen
A non-official implementation of Qwen 3.5, as there doesn’t seem to be a... |
|
Experimental |
| 3074 |
PCfVW/plip-rs
Mechanistic interpretability toolkit for code LLMs, in Rust. Analysis of... |
|
Experimental |
| 3075 |
unsanitary-bek/mlx-skills
🚀 Enhance your machine learning workflow with essential MLX skills from this... |
|
Experimental |
| 3076 |
bowen-upenn/llm_token_bias
[EMNLP 2024] A Peek into Token Bias: Large Language Models Are Not Yet... |
|
Experimental |
| 3077 |
lucky-verma/SaastIE
Document understanding system using Donut transformer architecture |
|
Experimental |
| 3078 |
th789/mbr-for-nmt
Characterizing the performance of minimum Bayes risk (MBR) decoding for... |
|
Experimental |
| 3079 |
dhakalnirajan/LLaMA-BitNet
LLaMA-BitNet is a repository dedicated to empowering users to train their... |
|
Experimental |
| 3080 |
seanpm2001/DALL-E_LLaMA_Docs
🤖️🦙️🧠️📖️ The official documentation source repository for DALL-E LLaMA, a... |
|
Experimental |
| 3081 |
Spico197/MoE-SFT
🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction... |
|
Experimental |
| 3082 |
sitammeur/gliner-litserve
Leverage ModernGLiNER's capabilities using LitServe. |
|
Experimental |
| 3083 |
liuqidong07/Awesome-LLM-Enhanced-Recommender-Systems
[KDD'25] Large Language Model Enhanced Recommender Systems: Methods,... |
|
Experimental |
| 3084 |
yihedeng9/rlhf-summary-notes
A brief and partial summary of RLHF algorithms. |
|
Experimental |
| 3085 |
naderabdelghany/project-rev
A proof-of-concept audio-interactive personalized chatbot based on Ted... |
|
Experimental |
| 3086 |
neeleshbhalla/transformers_for_time_series_forecasting
Inferencing 'PatchTST' and 'Informer' to harness the power of transformers... |
|
Experimental |
| 3087 |
s-omranpour/MIDI-Transformer
Another implementation of the paper "Compound Word Transformer: Learning to... |
|
Experimental |
| 3088 |
sarnikowski/danish_transformers
A collection of Danish Transformers |
|
Experimental |
| 3089 |
avrtt/QASATIK
LLM-based Q&A on preloaded docs, raw data, Wikipedia articles and scraped... |
|
Experimental |
| 3090 |
twitter-research/multilingual-alignment-tpp
Code for reproducing the paper Improved Multilingual Language Model... |
|
Experimental |
| 3091 |
Stamir36/CursusAI-ChatBot
Chatbot based on artificial intelligence (AI) for communication, image... |
|
Experimental |
| 3092 |
raimbekovm/cs231n-2025-notes
📚 Comprehensive lecture notes for Stanford CS231n: Deep Learning for... |
|
Experimental |
| 3093 |
declare-lab/della
DELLA-Merging: Reducing Interference in Model Merging through... |
|
Experimental |
| 3094 |
Beomi/megatronlm_dataset_autotokenizer
Megatron-LM/GPT-NeoX compatible Text Encoder with 🤗Transformers AutoTokenizer. |
|
Experimental |
| 3095 |
sahsaeedi/TPO
[TMLR] Triple Preference Optimization |
|
Experimental |
| 3096 |
franciellevargas/MOL
Multilingual Offensive Lexicon consists of the first contextual lexicon for... |
|
Experimental |
| 3097 |
load1n9/chat
leverage llama3.2 and other large language models to generate responses to... |
|
Experimental |
| 3098 |
mfekadu/nimbus-transformer
it's like Nimbus but uses a transformer language model |
|
Experimental |
| 3099 |
s4um1l/aya-cross-lingual-probe
Mechanistic interpretability of cross-lingual concept representations in... |
|
Experimental |
| 3100 |
GregorKobsik/ImageTransformer
This notebook shows a basic implementation of a transformer (decoder)... |
|
Experimental |