All Transformer Models

6,429 models ranked by quality score · Page 31 of 65

Showing 3001–3100 of 6,429
# Model Score Tier
3001 xufangzhi/Symbol-LLM

[ACL 2024] The project of Symbol-LLM

24
Experimental
3002 visresearch/LLaVA-STF

The official implementation of "Learning Compact Vision Tokens for Efficient...

24
Experimental
3003 Qingfeng-233/KeyAtten

KeyAtten: Attention-based Zero-Shot Keyword & Keyphrase Extraction

24
Experimental
3004 Asha-Gutlapalli/Drug-Recommendation-System-based-on-the-Condition-of-the-Patient-using-BERT

Patients are recommended drugs based on their condition and reviews of the...

24
Experimental
3005 is-leeroy-jenkins/Mathy

Machine-learning algorithms for pre-processing, classification, regression,...

24
Experimental
3006 cleopatra-itn/claim_detection

Code for tasks in the paper "Check\_square at CheckThat! 2020: Claim...

24
Experimental
3007 Curated-Awesome-Lists/Awesome-Llama3

A curated, awesome list of resources, tools, and projects for the AI Large...

24
Experimental
3008 NC0DER/GreekT5

A series of Greek News Summarization Sequence-to-Sequence Models built with...

24
Experimental
3009 artpli/CodeIE

[ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot...

24
Experimental
3010 garyb9/pytorch-transformers

Transformers architecture code playground repository in python using PyTorch.

24
Experimental
3011 marqinhos/MedicalLiverSegmentationToolKit

Medical Toolkit for Liver Volume Segmentation

24
Experimental
3012 bobazooba/shurale

Conversation AI model for open domain dialogs

24
Experimental
3013 CServinL/tbot

A multimodal AI bot for your terminal

24
Experimental
3014 dinhquy-nguyen-1704/ZaloAI2023-Elementary-Math-Solving

Baseline achieving 0.8 accuracy on the private test set in the ZaloAI...

24
Experimental
3015 Human-Centric-Machine-Learning/counterfactual-llms

Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.

24
Experimental
3016 NamrataThakur/Large_Language_Model_From_Scratch_Implementation

Implementing an LLM from scratch block-by-block using PyTorch

24
Experimental
3017 ayaka14732/TrAVis

TrAVis: Visualise BERT attention in your browser

24
Experimental
3018 byroneverson/Mia

A simple swift app for MacOS/iOS to test large language models (LLM)

24
Experimental
3019 En1gma02/Proteomic-and-Genomic-Drug-Development

An end-to-end generative AI pipeline for Proteomic and Genomic drug...

24
Experimental
3020 mrcabbage972/simple-toolformer

A Python implementation of Toolformer using Huggingface Transformers

24
Experimental
3021 avatsaev/av-local-llm-api

Allows to easily run local REST API with a custom LLM, running locally or...

24
Experimental
3022 Brazilian-willametteriver232/llama.swift

🚀 Access llama.cpp easily in your Swift projects, leveraging precompiled...

24
Experimental
3023 mourga/transformer-uncertainty

Code for evaluating uncertainty estimation methods for Transformer-based...

24
Experimental
3024 GreenScreen410/LYMT

LYMT: Let Your Model Think

24
Experimental
3025 nullHawk/simple-transformer

Implementation of Transformer model in PyTorch

24
Experimental
3026 liziniu/policy_optimization

Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)

24
Experimental
3027 elijahnzeli1/CausalTorch

CausalTorch is a PyTorch library for building generative models with...

24
Experimental
3028 Katashynskyi/Voice_assistant_UA_EN

No api-keys | local | llama3.1 For language studying and live translation

24
Experimental
3029 LightDopper/skill-codex

🚀 Enable automated code analysis and editing with Claude Code using Codex...

24
Experimental
3030 cvedix/omnisdk

On-device AI deloper platform

24
Experimental
3031 ItzDerock/llama-playground

A simple to use and powerful web-interface to mess around with Meta's LLaMA LLM.

24
Experimental
3032 bpevangelista/vfastml

Inference and Training Engine for LLMs, Image2Image and Other Models

24
Experimental
3033 assembly-automation-hub/Issues-github-actions-Llama

🤖 GitHub Action that analyzes push/PR diffs with Llama AI and auto-creates...

24
Experimental
3034 lazy-guy/chess-llama

Tiny Llama model trained to play chess

24
Experimental
3035 ai4sd/multiscale-byte-lm

A hierarchical LM that scales to training on context windows of +5M tokens

24
Experimental
3036 xarillian/GDLlama

A working and actively maintained GDExtension for running local LLMs in...

24
Experimental
3037 Pragyan2004/Polyglot_AI

Polyglot AI is a developer-focused platform that converts visual coding...

24
Experimental
3038 Aradhye2002/selective-peft-toolkit

Official implementation of the paper "Step-by-Step Unmasking for...

24
Experimental
3039 randomtask2000/MultiShot.AI

This project creates a real-time conversational AI, either serverless via...

24
Experimental
3040 SauravP97/toy-transformer

A decoder only Transformer implementing masked attention

24
Experimental
3041 SciCrunch/bio_electra

Bio-Electra - Small and efficient discriminatively pre-trained language...

24
Experimental
3042 Giyanellow/llama-chatbot-with-ui

This project provides a comprehensive template for self-hosting a Large...

24
Experimental
3043 MNThomson/chat

chat - platform agnostic "ai" cli

24
Experimental
3044 Nutanpatil06/Fine-Tuning-LLM-with-LLaMA-Factory

Complete LoRA/QLoRA implementation using LLaMA Factory. Fine-tune models...

24
Experimental
3045 Iteranya/AktivaAI

Local LLM Discord Bot

24
Experimental
3046 hplt-project/monolingual-multilingual-instruction-tuning

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca

24
Experimental
3047 knoveleng/steering

Official repo for the paper: "Selective Steering: Norm-Preserving Control...

24
Experimental
3048 uw-swag/tokdrift

Repository for TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar.

24
Experimental
3049 Ketis21/KetisBot

KetisBot is a powerful Discord AI chatbot using KoboldCpp for text...

24
Experimental
3050 SkillichSE/Lumi-bot

A Telegram bot powered by aiogram integrated with a local LLM (LM Studio)....

24
Experimental
3051 piratheon/LB-llm_training_scripts

A bunch of script to train your own offsec LLM

24
Experimental
3052 Koziev/LM-pretrain

Char-level language model pretraining code and scripts

24
Experimental
3053 tripathiarpan20/self-improvement-4all

Private self-improvement coaching with open-source LLMs

24
Experimental
3054 DoctorLai/SimilarString

Compute the score of similarity between two strings

24
Experimental
3055 toriving/haafor-challenge-2020

The project for HAAFOR CHALLENGE 2020

24
Experimental
3056 NellyW8/VeriReason

This is the Github Repo for the paper: VeriReason: Reinforcement Learning...

24
Experimental
3057 ashimmortallp/mHC-manifold-constrained-hyper-connections

🔍 Explore mHC for manifold-constrained hyper-connections in PyTorch,...

24
Experimental
3058 calcuis/llama-core

solo connector core built on llama.cpp

24
Experimental
3059 gmontamat/poor-mans-transformers

Implement Transformers (and Deep Learning) from scratch in NumPy

23
Experimental
3060 jaketae/tupe

PyTorch implementation of Rethinking Positional Encoding in Language Pre-training

23
Experimental
3061 kurnevsky/llama-cpp.el

A client for llama-cpp server

23
Experimental
3062 trialandsuccess/verysimpletransformers

Very Simple Transformers provides a simplified interface for packaging,...

23
Experimental
3063 Ultron09/Mirror_mind

A production-ready adaptive meta-learning framework for continuous...

23
Experimental
3064 rishabkr/Attention-Is-All-You-Need-Explained-PyTorch

A paper implementation and tutorial from scratch combining various great...

23
Experimental
3065 LazerLambda/Promptzl

Turn LLMs into zero-shot PyTorch classifiers!

23
Experimental
3066 bandirevanth/aiml-codex

A curated collection of my AI & ML projects - crafting tomorrow’s smart...

23
Experimental
3067 Victorwz/VaLM

VaLM: Visually-augmented Language Modeling. ICLR 2023.

23
Experimental
3068 Boykadakim/User-Clustering-with-BERT-Models

User Clustering Pipelines with BERT Models on Long and Heterogeneous Tweets...

23
Experimental
3069 seanpm2001/DALL-E_LLaMA

🤖️🦙️🧠️ DALL-E LLaMA is a combination of DALL-E and LLaMA (Large Language...

23
Experimental
3070 tunib-ai/joker

AI model designed to test the effectiveness in handling external ethical attacks.

23
Experimental
3071 Uokoroafor/transformer_from_scratch

This is a PyTorch implementation of the Transformer model in the paper...

23
Experimental
3072 iqbal-sk/Detecting-Persuasion-Techniques-in-Memes

Hierarchical, multilingual, multimodal detection of persuasion techniques in...

23
Experimental
3073 kyegomez/open_qwen

A non-official implementation of Qwen 3.5, as there doesn’t seem to be a...

23
Experimental
3074 PCfVW/plip-rs

Mechanistic interpretability toolkit for code LLMs, in Rust. Analysis of...

23
Experimental
3075 unsanitary-bek/mlx-skills

🚀 Enhance your machine learning workflow with essential MLX skills from this...

23
Experimental
3076 bowen-upenn/llm_token_bias

[EMNLP 2024] A Peek into Token Bias: Large Language Models Are Not Yet...

23
Experimental
3077 lucky-verma/SaastIE

Document understanding system using Donut transformer architecture

23
Experimental
3078 th789/mbr-for-nmt

Characterizing the performance of minimum Bayes risk (MBR) decoding for...

23
Experimental
3079 dhakalnirajan/LLaMA-BitNet

LLaMA-BitNet is a repository dedicated to empowering users to train their...

23
Experimental
3080 seanpm2001/DALL-E_LLaMA_Docs

🤖️🦙️🧠️📖️ The official documentation source repository for DALL-E LLaMA, a...

23
Experimental
3081 Spico197/MoE-SFT

🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction...

23
Experimental
3082 sitammeur/gliner-litserve

Leverage ModernGLiNER's capabilities using LitServe.

23
Experimental
3083 liuqidong07/Awesome-LLM-Enhanced-Recommender-Systems

[KDD'25] Large Language Model Enhanced Recommender Systems: Methods,...

23
Experimental
3084 yihedeng9/rlhf-summary-notes

A brief and partial summary of RLHF algorithms.

23
Experimental
3085 naderabdelghany/project-rev

A proof-of-concept audio-interactive personalized chatbot based on Ted...

23
Experimental
3086 neeleshbhalla/transformers_for_time_series_forecasting

Inferencing 'PatchTST' and 'Informer' to harness the power of transformers...

23
Experimental
3087 s-omranpour/MIDI-Transformer

Another implementation of the paper "Compound Word Transformer: Learning to...

23
Experimental
3088 sarnikowski/danish_transformers

A collection of Danish Transformers

23
Experimental
3089 avrtt/QASATIK

LLM-based Q&A on preloaded docs, raw data, Wikipedia articles and scraped...

23
Experimental
3090 twitter-research/multilingual-alignment-tpp

Code for reproducing the paper Improved Multilingual Language Model...

23
Experimental
3091 Stamir36/CursusAI-ChatBot

Chatbot based on artificial intelligence (AI) for communication, image...

23
Experimental
3092 raimbekovm/cs231n-2025-notes

📚 Comprehensive lecture notes for Stanford CS231n: Deep Learning for...

23
Experimental
3093 declare-lab/della

DELLA-Merging: Reducing Interference in Model Merging through...

23
Experimental
3094 Beomi/megatronlm_dataset_autotokenizer

Megatron-LM/GPT-NeoX compatible Text Encoder with 🤗Transformers AutoTokenizer.

23
Experimental
3095 sahsaeedi/TPO

[TMLR] Triple Preference Optimization

23
Experimental
3096 franciellevargas/MOL

Multilingual Offensive Lexicon consists of the first contextual lexicon for...

23
Experimental
3097 load1n9/chat

leverage llama3.2 and other large language models to generate responses to...

23
Experimental
3098 mfekadu/nimbus-transformer

it's like Nimbus but uses a transformer language model

23
Experimental
3099 s4um1l/aya-cross-lingual-probe

Mechanistic interpretability of cross-lingual concept representations in...

23
Experimental
3100 GregorKobsik/ImageTransformer

This notebook shows a basic implementation of a transformer (decoder)...

23
Experimental
« Prev 1 2 3 29 30 31 32 33 63 64 65 Next »