All Transformer Models

6,429 models ranked by quality score · Page 24 of 65

Showing 2301–2400 of 6,429
# Model Score Tier
2301 CognitiveAISystems/RATE

[ICLR 2026] Official implementation of Recurrent Action Transformer with...

30
Emerging
2302 DAMO-NLP-SG/LLM-Multilingual-Knowledge-Boundaries

[ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across...

30
Emerging
2303 sandyresearch/chipmunk

🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E...

30
Emerging
2304 gpustack/gguf-packer-go

Deliver LLMs of GGUF format via Dockerfile.

30
Emerging
2305 Harish25/StudyScreeningLanguageModel

Core LLM for M.A.R.S. (Model Assisted Review System). Utilizes fine-tuned...

30
Emerging
2306 whucs21Mzy/Model-Phase-Transitions

Navigating Model Phase Transitions to Enable Extreme Lossless Compression: A...

30
Emerging
2307 ahazeemi/dPrune

🌿 dPrune: A Framework for Data Pruning

30
Emerging
2308 partarstu/transformers-in-java

Experimental project for AI and NLP based on Transformer Architecture

30
Emerging
2309 OSUPCVLab/MobileUNETR

Official Implementation of MobileUNETR: A Lightweight End-To-End Hybrid...

30
Emerging
2310 gentaiscool/miners

MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual...

30
Emerging
2311 mubingshen/MLC-SLM-Baseline

The project is associated with the recently-launched INTERSPEECH 2025...

30
Emerging
2312 ulab-uiuc/Time-R1

Time-R1: Framework and resources for endowing LLMs with comprehensive...

30
Emerging
2313 DebeshJha/TransNetR

Official implementation of TransNetR: Transformer-based Residual Network for...

30
Emerging
2314 gersteinlab/Struc-Bench

[NAACL 2024] Struc-Bench: Are Large Language Models Good at Generating...

30
Emerging
2315 bfilar/URLTran

PyTorch/HuggingFace Implementation of URLTran: Improving Phishing URL...

30
Emerging
2316 ShinoharaHare/LLM-Training

A distributed training framework for large language models powered by Lightning.

30
Emerging
2317 chziakas/redeval

A library for red-teaming LLM applications with LLMs.

30
Emerging
2318 kyegomez/MC-ViT

Implementation of the model: "(MC-ViT)" from the paper: "Memory...

30
Emerging
2319 trekhleb/homemade-gpt-js

A minimal TensorFlow.js re-implementation of Karpathy's minGPT (Generative...

30
Emerging
2320 NVlabs/HMAR

[CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation

30
Emerging
2321 LMLK-seal/HuggingGGUF

Hugging Face Model downloader and GGUF Converter.

30
Emerging
2322 hukenovs/slovo

Slovo: Russian Sign Language Dataset and Models

30
Emerging
2323 theosorus/GPT2-Hasktorch

GPT2 implementation in Haskell with the Hasktorch library, inspired by...

30
Emerging
2324 westlake-repl/NRPStransformer

A Transformer-Based Predictor for Nonribosomal Peptide Synthetases (NRPS)...

30
Emerging
2325 hsisaberi/single-trait-electra

A complete ELECTRA-based framework for Big Five personality trait...

30
Emerging
2326 xmindflow/SSCT

[ICCV 2023] Self-supervised Semantic Segmentation: Consistency over Transformation

29
Experimental
2327 seedatnabeel/CLLM

Curated LLM (ICML 2024)

29
Experimental
2328 crux82/CLiC-it_2023_tutorial

This repository hosts materials from the CLiC-IT 2023 tutorial

29
Experimental
2329 THU-KEG/WaterBench

[ACL2024-Main] Data and Code for WaterBench: Towards Holistic Evaluation of...

29
Experimental
2330 declare-lab/MM-Align

[EMNLP 2022] This repository contains the official implementation of the...

29
Experimental
2331 PromptMixerDev/prompt-mixer-ollama-connector

Ollama Connector

29
Experimental
2332 saeeddhqan/tiny-transformer

Tiny transformer models implemented in pytorch.

29
Experimental
2333 qizhou000/UniEdit

[NeurIPS 2025 B & D] UniEdit: A Unified Knowledge Editing Benchmark for...

29
Experimental
2334 OSU-STARLAB/Simul-LLM

[ACL 2024] An easily extensible framework for simultaneous, text-to-text...

29
Experimental
2335 francoislanc/midistral

LLM finetuned for generating symbolic music

29
Experimental
2336 ASSERT-KTH/agentic-evals-lab

Framework for training and evaluating LLMs with reinforcement learning in...

29
Experimental
2337 wafflecomposite/langchain-ask-pdf-local

An AI-app that allows you to upload a PDF and ask questions about it. It...

29
Experimental
2338 Aloereed/llama.cpp-server-ohos

Llama.cpp server for OpenHarmony

29
Experimental
2339 antonyvigouret/Pay-Attention-to-MLPs

My implementation of the gMLP model from the paper "Pay Attention to MLPs".

29
Experimental
2340 sashazykov/json-repair-rb

A simple Ruby gem designed to repair broken JSON strings

29
Experimental
2341 Selozhd/FNet-tensorflow

Tensorflow Implementation of "FNet: Mixing Tokens with Fourier Transforms."

29
Experimental
2342 amazon-science/llm-code-preference

Training and Benchmarking LLMs for Code Preference.

29
Experimental
2343 GiannakopoulosIlias/vision-transformer-network-for-mr-electrical-properties-tomography

A 3D Vision Transformer-based neural network for reconstructing electrical...

29
Experimental
2344 dev-sufyaan/Nexlify

Unified API platform for free access to enterprise-grade AI models from...

29
Experimental
2345 krnel-ai/krnel-graph

Lightweight representation engineering dataflow operations for agent developers.

29
Experimental
2346 lrusso/llama3pure

Three inference engines for Llama 3: pure C for desktop systems, pure...

29
Experimental
2347 mehdihosseinimoghadam/AVA-Llama-3

Fine-Tuned Llama 3 Persian Large Language Model LLM / Persian Llama 3

29
Experimental
2348 Ludobico/KakaoChatData

카카오톡 대화 데이터셋

29
Experimental
2349 zwhe99/X-SIR

[ACL 2024] Can Watermarks Survive Translation? On the Cross-lingual...

29
Experimental
2350 wschella/llm-reliability

Code for the paper "Larger and more instructable language models become less...

29
Experimental
2351 abenechehab/dicl

[ICLR 2025] Official implementation of DICL (Disentangled In-Context...

29
Experimental
2352 corl-team/lime

Official implementation of the paper "You Do Not Fully Utilize Transformer's...

29
Experimental
2353 GeorgeMichailidis/multi-task-mixed-freq

Code repository for "Multi-Task Encoder-Dual-Decoder Modeling Framework on...

29
Experimental
2354 martin-wey/peft-llm-code

Replication package of the paper "Exploring Parameter-Efficient Fine-Tuning...

29
Experimental
2355 frankaging/ReCOGS

ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of...

29
Experimental
2356 Gary3410/TaPA

[arXiv 2023] Embodied Task Planning with Large Language Models

29
Experimental
2357 Silvestre17/BDA_AmazonReviews_DatabricksPySparkAnalysis_MasterProject

🛍️ Big Data project analyzing Amazon tech reviews using Databricks, PySpark,...

29
Experimental
2358 pluja/maestro

Turn natual language into commands. Your CLI tasks, now as easy as a...

29
Experimental
2359 sigeisler/reinforce-attacks-llms

REINFORCE Adversarial Attacks on Large Language Models: An Adaptive,...

29
Experimental
2360 teddykoker/grokking

PyTorch implementation of "Grokking: Generalization Beyond Overfitting on...

29
Experimental
2361 surrey-nlp/PLOD-AbbreviationDetection

This repository contains the PLOD Dataset for Abbreviation Detection...

29
Experimental
2362 kyegomez/M2PT

Implementation of M2PT in PyTorch from the paper: "Multimodal Pathway:...

29
Experimental
2363 antofuller/configaformers

A python library for highly configurable transformers - easing model...

29
Experimental
2364 DreamerGPT/DreamerGPT

🌱 梦想家(DreamerGPT):中文大语言模型指令精调

29
Experimental
2365 WangRongsheng/Chinese-LLaMA-Alpaca-Usage

📔 对Chinese-LLaMA-Alpaca进行使用说明和核心代码注解

29
Experimental
2366 Praveengovianalytics/falcon-evaluate

Falcon Evaluate is an open-source Python library aims to revolutionise the...

29
Experimental
2367 sampathkethineedi/bert-topic-sentiment

Topic Based Sentiment Detection using BERT

29
Experimental
2368 Am1n3e/active-learning-transformer

A hands-on tutorial on how to use Active Learning with Transformer models.

29
Experimental
2369 mcbal/spin-model-transformers

Physics-inspired transformer modules based on mean-field dynamics of...

29
Experimental
2370 PRIME-RL/Entropy-Mechanism-of-RL

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

29
Experimental
2371 kaist-cvml/I-HallA-v1.0

[AAAI 2025] Official Implementation of I-HallA v1.0

29
Experimental
2372 avilum/llama-saas

A client/server for LLaMA (Large Language Model Meta AI) that can run ANYWHERE.

29
Experimental
2373 camenduru/alpaca-lora-colab

Alpaca Lora

29
Experimental
2374 fabienfrfr/tptt

😊 TPTT: Transforming Pretrained Transformers into Titans

29
Experimental
2375 MaxiDonkey/DelphiGroqCloud

The GroqCloud API wrapper for Delphi provides access to models from Meta,...

29
Experimental
2376 AristotelisPap/Question-Answering-with-BERT-and-Knowledge-Distillation

Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD)...

29
Experimental
2377 zjunlp/DynamicKnowledgeCircuits

[ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits...

29
Experimental
2378 shahriargolchin/time-travel-in-llms

The official repository for the paper entitled "Time Travel in LLMs: Tracing...

29
Experimental
2379 teilomillet/retrain

a Python library that uses Reinforcement Learning (RL) to train LLMs.

29
Experimental
2380 davidjosipovic/news-trend-analysis

Automated NLP pipeline for news analysis with sentiment detection, topic...

29
Experimental
2381 cakshat/AlloyBERT

Introducing AlloyBERT: a transformer encoder-based model for predicting...

29
Experimental
2382 ai-forever/model-zoo

NLP model zoo for Russian

29
Experimental
2383 actypedef/ARCQuant

Code for the paper "ARCQuant: Boosting NVFP4 Quantization with Augmented...

29
Experimental
2384 LoserCheems/WonderfulMatrices

Wonderful Matrices to Build Small Language Models

29
Experimental
2385 abcsys/libem

Compound AI toolchain for fast and accurate entity matching, powered by LLMs.

29
Experimental
2386 jinzhuoran/RWKU

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language...

29
Experimental
2387 zrr1999/emotion-recognition

多模态情绪识别方法研究(Multimodal Emotion Recognition)

29
Experimental
2388 alphasecio/groq

A Streamlit chatbot with memory for running open-source text models on Groq.

29
Experimental
2389 sitammeur/qwen2.5-web

Qwen2.5 Instruct, large language model, operates within web browsers via 🤗...

29
Experimental
2390 xlang-ai/text2reward

[ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for...

29
Experimental
2391 ananttripathi/Resume-Analyzer-MLOps

Resume Analyzer is an AI-powered MLOps platform that optimizes your resume...

29
Experimental
2392 DataArcTech/ChartMoE

[ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for...

29
Experimental
2393 astorfi/LLM-Alignment-Project

A comprehensive template for aligning large language models (LLMs) using...

29
Experimental
2394 SALT-NLP/Adaptive-Compositional-Modules

Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive...

29
Experimental
2395 GU-DataLab/stance-detection-KE-MLM

Official resource of the paper "Knowledge Enhanced Masked Language Model for...

29
Experimental
2396 jaketae/vit-breast-cancer

Transfer learning pretrained vision transformers for breast histopathology

29
Experimental
2397 affjljoo3581/Inverse-DALL-E-for-Optical-Character-Recognition

Inverse DALL-E for Optical Character Recognition

29
Experimental
2398 MileBench/MileBench

This repo contains evaluation code for the paper "MileBench: Benchmarking...

29
Experimental
2399 gentaiscool/few-shot-lm

The source code of "Language Models are Few-shot Multilingual Learners" (MRL...

29
Experimental
2400 jordandeklerk/SwinViT

Modified Swin Transformer model in PyTorch on CIFAR-10 for image classification

29
Experimental
« Prev 1 2 3 22 23 24 25 26 63 64 65 Next »