All Transformer Models

6,429 models ranked by quality score · Page 26 of 65

Showing 2501–2600 of 6,429
# Model Score Tier
2501 ai8hyf/llm_split_recall_test

Split and Recall: A simple and efficient benchmark to evaluate in-context...

28
Experimental
2502 fattorib/fusedswiglu

Fused SwiGLU Triton kernels

28
Experimental
2503 tnsaai/OpenArchX-BETA

Official Repo of OpenArchX Framework.

28
Experimental
2504 LaMP-Benchmark/LaMP

Codes for papers on Large Language Models Personalization (LaMP)

28
Experimental
2505 tgautam03/Transformers

A Gentle Introduction to Transformers Neural Network

28
Experimental
2506 Anshita1Saxena/transformer_time_series_forecasting

Transformers applied on Time Series Forecasting

28
Experimental
2507 IDSIA/recurrent-fwp

Official repository for the paper "Going Beyond Linear Transformers with...

28
Experimental
2508 X-iZhang/CCD

📷 CCD: Mitigating Hallucinations in Radiology MLLMs via Clinical Contrastive...

28
Experimental
2509 Furyton/awesome-language-model-analysis

This paper list focuses on the theoretical and empirical analysis of...

28
Experimental
2510 TobyYang7/Llava_Qwen2

Visual Instruction Tuning for Qwen2 Base Model

28
Experimental
2511 tsinghua-fib-lab/AAAI2025_MIA-Tuner

[AAAI'25 Oral] "MIA-Tuner: Adapting Large Language Models as Pre-training...

28
Experimental
2512 BoHuangLab/Protein-Localization-Transformer

Code for CELL-E: Biological Zero-Shot Text-to-Image Synthesis for Protein...

28
Experimental
2513 SkywalkerLuke/TransHLA

TransHLA: A hybrid transformer model for peptide-HLA epitope detection.

28
Experimental
2514 hhy-huang/GraphJudge

[EMNLP'25 main] This is the official repo for the paper, Can LLMs be Good...

28
Experimental
2515 Akshint0407/Automated-Answer-Checker

AI-powered grading system for educators 🔹 Streamlit web app that automates...

28
Experimental
2516 FuxiaoLiu/VisualNews-Repository

[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning

28
Experimental
2517 Cre4T3Tiv3/unsloth-llama3-alpaca-lora

Advanced 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with...

28
Experimental
2518 Beomi/easy-lm-trainer

🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드

28
Experimental
2519 srvCodes/continual_learning_with_vit

Code for our CVPR 2022 workshop paper "Towards Exemplar-Free Continual...

28
Experimental
2520 apanariello4/merge-and-rebase

Model merging, task-vector rebasin, and fine-tuning for vision and LLM models.

28
Experimental
2521 StringNLPLAB/MGS

Repository for the paper "Advancing General-Purpose Reasoning Models with...

28
Experimental
2522 linydub/azureml-greenai-txtsum

Samples for fine-tuning HuggingFace models with AzureML

28
Experimental
2523 InflixOP/ContentSnap

ContentSnap is a powerful browser extension that leverages cutting-edge NLP...

28
Experimental
2524 KillerShoaib/RLM-From-Scratch

Implementation of Recursive Language Model paper from scratch

28
Experimental
2525 LehengTHU/AlphaRec

[ICLR 2025 Oral 🏆] The implementation of paper "Language Representations Can...

28
Experimental
2526 TamSiuhin/OPPU

Official Implementation of "Democratizing Large Language Models via...

28
Experimental
2527 TayeeChang/keras_transformers

the implement of transformer family such as bert, alber, roberta, nezha, etc.

28
Experimental
2528 Beomi/exbert-transformers

exBERT on Transformers🤗

28
Experimental
2529 psychbruce/FMAT

😷 The Fill-Mask Association Test (FMAT): Measuring Propositions in Natural Language.

28
Experimental
2530 Warren-SJ/SLAM3R

A study of the research paper SLAM3R:Real-Time Dense Scene Reconstruction...

28
Experimental
2531 nipunsadvilkar/roberta-base-mr

RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x ...

28
Experimental
2532 HacktivSpace/multidisciplinary-deepfake-detection

A solution for deepfake detection across multiple modalities, including...

28
Experimental
2533 nlp-with-transformers/website

Website for the Natural Language Processing with Transformers book

28
Experimental
2534 kaistAI/LangBridge

[ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision

28
Experimental
2535 CogitoNTNU/course-on-large-language-models

This is a course on how to to program with Large Language Models.

28
Experimental
2536 ngoanpv/llama2_vietnamese

A fine-tuned Large Language Model (LLM) for the Vietnamese language based on...

28
Experimental
2537 DEV-D-GR8/SignSense

This repository contains a transformer-based model for real-time American...

28
Experimental
2538 sam575/axial-gan

Code for "Simultaneous Face Hallucination and Translation for Thermal to...

28
Experimental
2539 XavierZXY/Zero2Hero

从0到1学习大模型

28
Experimental
2540 bishwenduk029/anyscale-chat

Vercel AI chatbot with Anyscale endpoints

27
Experimental
2541 aws-samples/sample-for-multi-modal-document-to-json-with-sagemaker-ai

This open-source project delivers a complete pipeline for converting...

27
Experimental
2542 RAHB-REALTORS-Association/email-autodrafts

Email Auto-ReplAI is a Python tool that uses AI to automate drafting...

27
Experimental
2543 RobinSmits/Dutch-LLMs

Various training, inference and validation code and results related to Open...

27
Experimental
2544 AmericanPresidentJimmyCarter/yal-discord-bot

Yet Another LLaMA/ALPACA Discord Bot

27
Experimental
2545 nubs4dayz/company-classification-research

This project explores the use of various NLP techniques to classify...

27
Experimental
2546 bloomberg/MixCE-acl2023

Implementation of MixCE method described in ACL 2023 paper by Zhang et al.

27
Experimental
2547 izmttk/ullm

Lightweight LLM inference engine inspired by nano-vllm, with radix-tree...

27
Experimental
2548 honghanhh/fsdl_2022_solution

Solution of Full Stack Deep Learning - Course 2022

27
Experimental
2549 CLDiego/SPE_GeoHackathon_2025

Foundational bootcamp on LLM usage (prompting & inference) → tooling &...

27
Experimental
2550 LuluW8071/Text-Sentiment-Analysis

Text Sentiment Analysis with RNNs Models + Additive Attention and Transformers

27
Experimental
2551 leliuga/cohere-configurations

Co:Here Inference configurations

27
Experimental
2552 SlytherinGe/RSTeller

Vision-Language Dataset for Remote Sensing

27
Experimental
2553 rekalantar/MedSegmentAnything_SAM_LungCT

The code to finetune SAM with bounding box prompt for segmentation of the lungs on CT

27
Experimental
2554 ManashJKonwar/NLP-Transformers

Transformer (BERT, GPT2, etc.) based Training Module for popular NLP tasks

27
Experimental
2555 GhTara/Dose_Prediction

A Cascade Transformer-based Model for 3D Dose Distribution Prediction in...

27
Experimental
2556 cifkao/context-probing

Black-box language model explanation by context length probing

27
Experimental
2557 haozheji/exact-optimization

ICML 2024 - Official Repository for EXO: Towards Efficient Exact...

27
Experimental
2558 florist-notes/aicore_n

Artificial Intelligence > Machine Learning > Deep Learning

27
Experimental
2559 intel/document-level-sentiment-analysis

Document Level Sentiment Analysis is an End-to-End deep learning workflow...

27
Experimental
2560 forgi86/sysid-transformers-transfer

Code of the paper "On the adaptation of in-context learners for system...

27
Experimental
2561 kuvaus/llama-chat

Simple chat program for LLaMa models

27
Experimental
2562 Marker-Inc-Korea/KO-Platypus

[KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model

27
Experimental
2563 Agora-Lab-AI/HydraNet

HydraNet is a state-of-the-art transformer architecture that combines...

27
Experimental
2564 CristiVlad25/ai-papers

Tracing the evolution of AI and large language models from early neural...

27
Experimental
2565 jmnolte/HCCNet

Early prediction of liver cancer using longitudinal MRI

27
Experimental
2566 maifeeulasad/LocalLLaMA

📚 LocalLLaMA Archive — Community-powered static archive for r/LocalLLaMA

27
Experimental
2567 bayartsogt-ya/albert-mongolian

ALBERT trained on Mongolian text corpus

27
Experimental
2568 lechmazur/bazaar

The BAZAAR challenges LLMs to navigate the double-auction marketplace, where...

27
Experimental
2569 an-yongqi/systematic-outliers

[ICLR 2025] Systematic Outliers in Large Language Models.

27
Experimental
2570 guyoung/AIMatrices

AIMatrices is a lightweight, high-performance, scalable, and open source AI...

27
Experimental
2571 ilya16/deephumor

DeepHumor: Image-based Meme Generation using Deep Learning

27
Experimental
2572 ScottyWITHBIGD/DGA_Diagnostic

🔍 Automate dissolved gas analysis for transformer health assessment with a...

27
Experimental
2573 muna-ai/muna-predictors

Interesting Python functions compiled to run anywhere with Muna.

27
Experimental
2574 bobazooba/xllm-demo

Demo project using XLLM

27
Experimental
2575 joeljang/continual-knowledge-learning

[ICLR 2022] Towards Continual Knowledge Learning of Language Models

27
Experimental
2576 JarvisPei/MemDLM

MemDLM: Memory-enhanced Diffusion Language Model

27
Experimental
2577 Adversing/hf-model-checker

A tool to analyze HuggingFace models and determine their compatibility with...

27
Experimental
2578 Hamtech-ai/Persian-Image-Captioning

A Persian Image Captioning model based on Vision Encoder Decoder Models of...

27
Experimental
2579 jaygala24/fed-hate-speech

The official code repository for the paper titled "A Federated Approach for...

27
Experimental
2580 Meaquadddd/DPO-Shift

DPO-Shift: Shifting the Distribution of Direct Preference Optimization

27
Experimental
2581 The-Martyr/Awesome-Modality-Priors-in-MLLMs

Latest Advances on Modality Priors in Multimodal Large Language Models

27
Experimental
2582 kingabzpro/French-to-Fongbe-and-Ewe-MT

The objective of this challenge is to create a machine translation system...

27
Experimental
2583 jlamprou/Infini-Attention

Efficient Infinite Context Transformers with Infini-attention Pytorch...

27
Experimental
2584 xmindflow/MMCFormer

[MIDL 2023] MMCFormer: Missing Modality Compensation Transformer for Brain...

27
Experimental
2585 jmaczan/tiny-vllm

High performance LLM inference engine, a younger sibling of vLLM

27
Experimental
2586 Agora-Lab-AI/OmniByteGPT

An implementation of an all-new foundation model architecture that trains on...

27
Experimental
2587 tojiboyevf/image_captioning

Deep Learning Final project 2022

27
Experimental
2588 datatrigger/nlp_hugging_face

Text classification with the transformers library from Hugging Face, by...

27
Experimental
2589 shahrukhx01/bert-probe

BERT Probe: A python package for probing attention based robustness to...

27
Experimental
2590 gao-g/prelude

Code for the paper "Aligning LLM Agents by Learning Latent Preference from...

27
Experimental
2591 yfedoseev/llmkit

Production-grade LLM client - Rust, Python, TypeScript. 100+ providers,...

27
Experimental
2592 codepawl/turboquant-torch

Unofficial PyTorch implementation of TurboQuant (Google Research, ICLR...

27
Experimental
2593 EvilFreelancer/rugpt3-custom

Pre-training custom ruGPT3 model on books written by F.M. Dostoevski

27
Experimental
2594 mpociot/llamero

A GUI application to easily try out Facebook's LLaMA models.

27
Experimental
2595 TingjiaInFuture/pixrep

Let LLMs see your codebase just like you do.

27
Experimental
2596 feifeibear/Odysseus-Transformer

Odysseus: Playground of LLM Sequence Parallelism

27
Experimental
2597 hydropix/AutoDescribe-Images

Tool to automatically generate text descriptions for images using Ollama...

27
Experimental
2598 zhuang-li/SCAR

[ACL 2025 main] SCAR: Data Selection via Style Consistency-Aware Response...

27
Experimental
2599 NiuTrans/LaMaTE

Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine...

27
Experimental
2600 ma2za/torch-adapters

Small Library of PyTorch Adaptation modules

27
Experimental
« Prev 1 2 3 24 25 26 27 28 63 64 65 Next »