All Transformer Models
6,429 models ranked by quality score · Page 26 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 2501 |
ai8hyf/llm_split_recall_test
Split and Recall: A simple and efficient benchmark to evaluate in-context... |
|
Experimental |
| 2502 |
fattorib/fusedswiglu
Fused SwiGLU Triton kernels |
|
Experimental |
| 2503 |
tnsaai/OpenArchX-BETA
Official Repo of OpenArchX Framework. |
|
Experimental |
| 2504 |
LaMP-Benchmark/LaMP
Codes for papers on Large Language Models Personalization (LaMP) |
|
Experimental |
| 2505 |
tgautam03/Transformers
A Gentle Introduction to Transformers Neural Network |
|
Experimental |
| 2506 |
Anshita1Saxena/transformer_time_series_forecasting
Transformers applied on Time Series Forecasting |
|
Experimental |
| 2507 |
IDSIA/recurrent-fwp
Official repository for the paper "Going Beyond Linear Transformers with... |
|
Experimental |
| 2508 |
X-iZhang/CCD
📷 CCD: Mitigating Hallucinations in Radiology MLLMs via Clinical Contrastive... |
|
Experimental |
| 2509 |
Furyton/awesome-language-model-analysis
This paper list focuses on the theoretical and empirical analysis of... |
|
Experimental |
| 2510 |
TobyYang7/Llava_Qwen2
Visual Instruction Tuning for Qwen2 Base Model |
|
Experimental |
| 2511 |
tsinghua-fib-lab/AAAI2025_MIA-Tuner
[AAAI'25 Oral] "MIA-Tuner: Adapting Large Language Models as Pre-training... |
|
Experimental |
| 2512 |
BoHuangLab/Protein-Localization-Transformer
Code for CELL-E: Biological Zero-Shot Text-to-Image Synthesis for Protein... |
|
Experimental |
| 2513 |
SkywalkerLuke/TransHLA
TransHLA: A hybrid transformer model for peptide-HLA epitope detection. |
|
Experimental |
| 2514 |
hhy-huang/GraphJudge
[EMNLP'25 main] This is the official repo for the paper, Can LLMs be Good... |
|
Experimental |
| 2515 |
Akshint0407/Automated-Answer-Checker
AI-powered grading system for educators 🔹 Streamlit web app that automates... |
|
Experimental |
| 2516 |
FuxiaoLiu/VisualNews-Repository
[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning |
|
Experimental |
| 2517 |
Cre4T3Tiv3/unsloth-llama3-alpaca-lora
Advanced 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with... |
|
Experimental |
| 2518 |
Beomi/easy-lm-trainer
🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드 |
|
Experimental |
| 2519 |
srvCodes/continual_learning_with_vit
Code for our CVPR 2022 workshop paper "Towards Exemplar-Free Continual... |
|
Experimental |
| 2520 |
apanariello4/merge-and-rebase
Model merging, task-vector rebasin, and fine-tuning for vision and LLM models. |
|
Experimental |
| 2521 |
StringNLPLAB/MGS
Repository for the paper "Advancing General-Purpose Reasoning Models with... |
|
Experimental |
| 2522 |
linydub/azureml-greenai-txtsum
Samples for fine-tuning HuggingFace models with AzureML |
|
Experimental |
| 2523 |
InflixOP/ContentSnap
ContentSnap is a powerful browser extension that leverages cutting-edge NLP... |
|
Experimental |
| 2524 |
KillerShoaib/RLM-From-Scratch
Implementation of Recursive Language Model paper from scratch |
|
Experimental |
| 2525 |
LehengTHU/AlphaRec
[ICLR 2025 Oral 🏆] The implementation of paper "Language Representations Can... |
|
Experimental |
| 2526 |
TamSiuhin/OPPU
Official Implementation of "Democratizing Large Language Models via... |
|
Experimental |
| 2527 |
TayeeChang/keras_transformers
the implement of transformer family such as bert, alber, roberta, nezha, etc. |
|
Experimental |
| 2528 |
Beomi/exbert-transformers
exBERT on Transformers🤗 |
|
Experimental |
| 2529 |
psychbruce/FMAT
😷 The Fill-Mask Association Test (FMAT): Measuring Propositions in Natural Language. |
|
Experimental |
| 2530 |
Warren-SJ/SLAM3R
A study of the research paper SLAM3R:Real-Time Dense Scene Reconstruction... |
|
Experimental |
| 2531 |
nipunsadvilkar/roberta-base-mr
RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x ... |
|
Experimental |
| 2532 |
HacktivSpace/multidisciplinary-deepfake-detection
A solution for deepfake detection across multiple modalities, including... |
|
Experimental |
| 2533 |
nlp-with-transformers/website
Website for the Natural Language Processing with Transformers book |
|
Experimental |
| 2534 |
kaistAI/LangBridge
[ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision |
|
Experimental |
| 2535 |
CogitoNTNU/course-on-large-language-models
This is a course on how to to program with Large Language Models. |
|
Experimental |
| 2536 |
ngoanpv/llama2_vietnamese
A fine-tuned Large Language Model (LLM) for the Vietnamese language based on... |
|
Experimental |
| 2537 |
DEV-D-GR8/SignSense
This repository contains a transformer-based model for real-time American... |
|
Experimental |
| 2538 |
sam575/axial-gan
Code for "Simultaneous Face Hallucination and Translation for Thermal to... |
|
Experimental |
| 2539 |
XavierZXY/Zero2Hero
从0到1学习大模型 |
|
Experimental |
| 2540 |
bishwenduk029/anyscale-chat
Vercel AI chatbot with Anyscale endpoints |
|
Experimental |
| 2541 |
aws-samples/sample-for-multi-modal-document-to-json-with-sagemaker-ai
This open-source project delivers a complete pipeline for converting... |
|
Experimental |
| 2542 |
RAHB-REALTORS-Association/email-autodrafts
Email Auto-ReplAI is a Python tool that uses AI to automate drafting... |
|
Experimental |
| 2543 |
RobinSmits/Dutch-LLMs
Various training, inference and validation code and results related to Open... |
|
Experimental |
| 2544 |
AmericanPresidentJimmyCarter/yal-discord-bot
Yet Another LLaMA/ALPACA Discord Bot |
|
Experimental |
| 2545 |
nubs4dayz/company-classification-research
This project explores the use of various NLP techniques to classify... |
|
Experimental |
| 2546 |
bloomberg/MixCE-acl2023
Implementation of MixCE method described in ACL 2023 paper by Zhang et al. |
|
Experimental |
| 2547 |
izmttk/ullm
Lightweight LLM inference engine inspired by nano-vllm, with radix-tree... |
|
Experimental |
| 2548 |
honghanhh/fsdl_2022_solution
Solution of Full Stack Deep Learning - Course 2022 |
|
Experimental |
| 2549 |
CLDiego/SPE_GeoHackathon_2025
Foundational bootcamp on LLM usage (prompting & inference) → tooling &... |
|
Experimental |
| 2550 |
LuluW8071/Text-Sentiment-Analysis
Text Sentiment Analysis with RNNs Models + Additive Attention and Transformers |
|
Experimental |
| 2551 |
leliuga/cohere-configurations
Co:Here Inference configurations |
|
Experimental |
| 2552 |
SlytherinGe/RSTeller
Vision-Language Dataset for Remote Sensing |
|
Experimental |
| 2553 |
rekalantar/MedSegmentAnything_SAM_LungCT
The code to finetune SAM with bounding box prompt for segmentation of the lungs on CT |
|
Experimental |
| 2554 |
ManashJKonwar/NLP-Transformers
Transformer (BERT, GPT2, etc.) based Training Module for popular NLP tasks |
|
Experimental |
| 2555 |
GhTara/Dose_Prediction
A Cascade Transformer-based Model for 3D Dose Distribution Prediction in... |
|
Experimental |
| 2556 |
cifkao/context-probing
Black-box language model explanation by context length probing |
|
Experimental |
| 2557 |
haozheji/exact-optimization
ICML 2024 - Official Repository for EXO: Towards Efficient Exact... |
|
Experimental |
| 2558 |
florist-notes/aicore_n
Artificial Intelligence > Machine Learning > Deep Learning |
|
Experimental |
| 2559 |
intel/document-level-sentiment-analysis
Document Level Sentiment Analysis is an End-to-End deep learning workflow... |
|
Experimental |
| 2560 |
forgi86/sysid-transformers-transfer
Code of the paper "On the adaptation of in-context learners for system... |
|
Experimental |
| 2561 |
kuvaus/llama-chat
Simple chat program for LLaMa models |
|
Experimental |
| 2562 |
Marker-Inc-Korea/KO-Platypus
[KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model |
|
Experimental |
| 2563 |
Agora-Lab-AI/HydraNet
HydraNet is a state-of-the-art transformer architecture that combines... |
|
Experimental |
| 2564 |
CristiVlad25/ai-papers
Tracing the evolution of AI and large language models from early neural... |
|
Experimental |
| 2565 |
jmnolte/HCCNet
Early prediction of liver cancer using longitudinal MRI |
|
Experimental |
| 2566 |
maifeeulasad/LocalLLaMA
📚 LocalLLaMA Archive — Community-powered static archive for r/LocalLLaMA |
|
Experimental |
| 2567 |
bayartsogt-ya/albert-mongolian
ALBERT trained on Mongolian text corpus |
|
Experimental |
| 2568 |
lechmazur/bazaar
The BAZAAR challenges LLMs to navigate the double-auction marketplace, where... |
|
Experimental |
| 2569 |
an-yongqi/systematic-outliers
[ICLR 2025] Systematic Outliers in Large Language Models. |
|
Experimental |
| 2570 |
guyoung/AIMatrices
AIMatrices is a lightweight, high-performance, scalable, and open source AI... |
|
Experimental |
| 2571 |
ilya16/deephumor
DeepHumor: Image-based Meme Generation using Deep Learning |
|
Experimental |
| 2572 |
ScottyWITHBIGD/DGA_Diagnostic
🔍 Automate dissolved gas analysis for transformer health assessment with a... |
|
Experimental |
| 2573 |
muna-ai/muna-predictors
Interesting Python functions compiled to run anywhere with Muna. |
|
Experimental |
| 2574 |
bobazooba/xllm-demo
Demo project using XLLM |
|
Experimental |
| 2575 |
joeljang/continual-knowledge-learning
[ICLR 2022] Towards Continual Knowledge Learning of Language Models |
|
Experimental |
| 2576 |
JarvisPei/MemDLM
MemDLM: Memory-enhanced Diffusion Language Model |
|
Experimental |
| 2577 |
Adversing/hf-model-checker
A tool to analyze HuggingFace models and determine their compatibility with... |
|
Experimental |
| 2578 |
Hamtech-ai/Persian-Image-Captioning
A Persian Image Captioning model based on Vision Encoder Decoder Models of... |
|
Experimental |
| 2579 |
jaygala24/fed-hate-speech
The official code repository for the paper titled "A Federated Approach for... |
|
Experimental |
| 2580 |
Meaquadddd/DPO-Shift
DPO-Shift: Shifting the Distribution of Direct Preference Optimization |
|
Experimental |
| 2581 |
The-Martyr/Awesome-Modality-Priors-in-MLLMs
Latest Advances on Modality Priors in Multimodal Large Language Models |
|
Experimental |
| 2582 |
kingabzpro/French-to-Fongbe-and-Ewe-MT
The objective of this challenge is to create a machine translation system... |
|
Experimental |
| 2583 |
jlamprou/Infini-Attention
Efficient Infinite Context Transformers with Infini-attention Pytorch... |
|
Experimental |
| 2584 |
xmindflow/MMCFormer
[MIDL 2023] MMCFormer: Missing Modality Compensation Transformer for Brain... |
|
Experimental |
| 2585 |
jmaczan/tiny-vllm
High performance LLM inference engine, a younger sibling of vLLM |
|
Experimental |
| 2586 |
Agora-Lab-AI/OmniByteGPT
An implementation of an all-new foundation model architecture that trains on... |
|
Experimental |
| 2587 |
tojiboyevf/image_captioning
Deep Learning Final project 2022 |
|
Experimental |
| 2588 |
datatrigger/nlp_hugging_face
Text classification with the transformers library from Hugging Face, by... |
|
Experimental |
| 2589 |
shahrukhx01/bert-probe
BERT Probe: A python package for probing attention based robustness to... |
|
Experimental |
| 2590 |
gao-g/prelude
Code for the paper "Aligning LLM Agents by Learning Latent Preference from... |
|
Experimental |
| 2591 |
yfedoseev/llmkit
Production-grade LLM client - Rust, Python, TypeScript. 100+ providers,... |
|
Experimental |
| 2592 |
codepawl/turboquant-torch
Unofficial PyTorch implementation of TurboQuant (Google Research, ICLR... |
|
Experimental |
| 2593 |
EvilFreelancer/rugpt3-custom
Pre-training custom ruGPT3 model on books written by F.M. Dostoevski |
|
Experimental |
| 2594 |
mpociot/llamero
A GUI application to easily try out Facebook's LLaMA models. |
|
Experimental |
| 2595 |
TingjiaInFuture/pixrep
Let LLMs see your codebase just like you do. |
|
Experimental |
| 2596 |
feifeibear/Odysseus-Transformer
Odysseus: Playground of LLM Sequence Parallelism |
|
Experimental |
| 2597 |
hydropix/AutoDescribe-Images
Tool to automatically generate text descriptions for images using Ollama... |
|
Experimental |
| 2598 |
zhuang-li/SCAR
[ACL 2025 main] SCAR: Data Selection via Style Consistency-Aware Response... |
|
Experimental |
| 2599 |
NiuTrans/LaMaTE
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine... |
|
Experimental |
| 2600 |
ma2za/torch-adapters
Small Library of PyTorch Adaptation modules |
|
Experimental |