All Transformer Models

6,429 models ranked by quality score · Page 38 of 65

Showing 3701–3800 of 6,429
# Model Score Tier
3701 OpenNLG/OpenBA-v2

OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing...

20
Experimental
3702 CameLLM/CameLLM

Run your favourite LLMs locally on macOS from Swift

20
Experimental
3703 hitz-zentroa/This-is-not-a-Dataset

We introduce a large semi-automatically generated dataset of ~400,000...

20
Experimental
3704 LookUpMark/dylem-grid

DYLEM-GRID is a deep learning project for dynamic hand gesture recognition...

20
Experimental
3705 yyy01/PAC

The official implementation of the paper "Data Contamination Calibration for...

20
Experimental
3706 thevasudevgupta/transformers-adapters

This repositary hosts my experiments for the project, I did with OffNote Labs.

20
Experimental
3707 TRISTAN-ORF/RiboTIE

Scripts and instructions to apply RiboTIE on Ribo-seq data

20
Experimental
3708 gersongerardcruz/extractive_and_abstractive_text_summarization

A combination of extractive and abstractive text summarization for...

20
Experimental
3709 UCSC-VLAA/Sight-Beyond-Text

[TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal...

20
Experimental
3710 DanHrmti/SenTransformer-VAE-pytorch

Sentence VAE using the Transformer encoder-decoder architecture.

20
Experimental
3711 AntonioVFranco/elamonica

Production-ready test-time compute optimization framework for LLM inference....

20
Experimental
3712 dunktra/attention-binding-a11y

Code for tracking concept emergence via attention-head binding (EB*). Pythia...

20
Experimental
3713 chizkidd/fastai

Implementation of fast.ai deep learning courses: "Practical Deep Learning...

20
Experimental
3714 plutonium-239/memsave_torch

Lowering PyTorch's Memory Consumption for Selective Differentiation

20
Experimental
3715 CanvaChen/chinese-llama-tokenizer

目标:构建一个更符合语言学的小而美的 llama 分词器,支持中英日三国语言

20
Experimental
3716 vbuyel/PointerLM-AI-File-Assistant

Your helpful file and web chatbot assistant. Made with DDD architecture

20
Experimental
3717 navamai/navamai

Use NavamAI to supercharge your productivity and workflow with personal,...

20
Experimental
3718 januverma/transformers-for-sequential-recommendation

Notebooks on using transformers for sequential recommendation tasks

20
Experimental
3719 Traffic-Alpha/VLMLight

Official implementation of VLMLight

20
Experimental
3720 StarLight1212/LLM-and-Generative-Models-Community

AI Community Tutorial, including: LoRA/Qlora LLM fine-tuning, Training GPT-2...

20
Experimental
3721 DDDOH/LLM_News

LOLA_ LLM-Assisted Online Learning Algorithm for Content Experiments

20
Experimental
3722 noah-hein/mazeGPT

AI model for making mazes that extends OpenAIs GPT2 model

20
Experimental
3723 llap4585/T5-Refiner-DomainFocus-TrainOnly

This project provides code for fine-tuning T5/mT5 models on data...

20
Experimental
3724 Orfeous/llamacpp.net

C#/.NET binding of llama.cpp

20
Experimental
3725 waybarrios/dgx-spark-finetune-llm

LLM fine-tuning with LoRA + NVFP4/MXFP8 on NVIDIA DGX Spark (Blackwell GB10)

20
Experimental
3726 LSquaredM/mutual_info_scaling_law

(NeurIPS 2025) Official Code for L²M: Mutual Information Scaling Law for...

20
Experimental
3727 illoonego/gemma-finetune-emails

LoRA fine-tuning pipeline for Google’s Gemma-2B language model to classify...

20
Experimental
3728 sandeeppanem/qwen3-resume-extraction

Fine-tune Qwen3-0.6B for resume parsing using LoRA

20
Experimental
3729 mamounyosef/commit-message-llm

Fine-tuning Qwen2.5-Coder-0.5B LLM using QLoRA (4-bit quantization + LoRA)...

20
Experimental
3730 mazurkin/ptn

train own virtual "PTN" LLM model

20
Experimental
3731 jdleo/tinysafe-2

141M param safety model (not much better than v1, but a great learning)

20
Experimental
3732 GGBond2424648901/transformers-29-tasks

🎨 Transformers实战训练项目 -...

20
Experimental
3733 tobifinn/ensemble_transformer

Official PyTorch implementation of "Self-Attentive Ensemble Transformer:...

20
Experimental
3734 gsarti/pecore

Materials for "Quantifying the Plausibility of Context Reliance in Neural...

20
Experimental
3735 JamesVorder/python-tddpp

This LLM generates code based on tests, and makes sure they pass.

20
Experimental
3736 twitter-research/lmsoc

Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining

20
Experimental
3737 leonjovanovic/keywords-extraction

Keyword extraction using Scake, KeyBERT, Fine-tuning Transformer BERT-like...

20
Experimental
3738 AlexIoannides/llm-regression

Exploring the classical regression capabilities of LLMs.

20
Experimental
3739 rachel-pai/T5Elasticsearch

Elasticsearch with T5/Bert/Other models provided by huggingface Transfomers.

20
Experimental
3740 MitulNakrani003/AI-Enhanced-IR-System

AI-enhanced search pipeline using hybrid retrieval + transformer models for...

20
Experimental
3741 MysterionRise/transformers-nlp-suite

Enterprise NLP Platform - Production REST API with auth, rate limiting,...

20
Experimental
3742 ThaminduR/mt5-simplification

Scripts related to training and predicting Google's mt5 model

20
Experimental
3743 isaacus-dev/terge

An easy-to-use Python library for merging PyTorch models.

20
Experimental
3744 fbotathome/butia_speech

This package provides some tools to make the robot DoRIS speak and listen....

20
Experimental
3745 pszemraj/decoder-pytorch-template

Hackable PyTorch template for decoder-only transformer architecture...

20
Experimental
3746 Sarah111-AHM/ZakeyTeam-arabic-qa-system-arabert

an AI powered Arabic Question Answering system built by fine tuning the...

20
Experimental
3747 oooranz/Baby-CoThought

🍼 Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models...

20
Experimental
3748 TheDarkchip/nfp

Lean 4 library + CLI for rigorous bounds in transformer computations...

20
Experimental
3749 Praful932/llmsearch

Find better generation parameters for your LLM

20
Experimental
3750 abhilashpuli98/Deep-Learning-Paper-Implementations

A collection of paper implementations using the PyTorch framework

20
Experimental
3751 mohsenMahmoodzadeh/image-and-text-classifier

Deep learning models(CNN, LSTM, BERT) for image and text classification task...

20
Experimental
3752 loryanstrant/ha-transformers-theme

A Transformers theme for Home Assistant

20
Experimental
3753 cronenberg64/SciBERT-CTFT

SciBERT-based scientific abstract classification using SetFit framework with...

20
Experimental
3754 givkashi/Awesome-unet-like-transformers

Awesome UNet with Transformer

20
Experimental
3755 JuliusScheuerer/nlp-job-classifier

Text classification with fine-tuned DistilBERT — FastAPI + Streamlit

20
Experimental
3756 jankstar/pydocu

fastapi server for classification of documents and extraction of data

20
Experimental
3757 atomlayer/llama_cute_voice_assistant

Llama cute voice assistant

20
Experimental
3758 eigencore/Tlama_124M

Tlama (124M) is a language model based on LlaMa3 (127M) optimized by...

20
Experimental
3759 ImMohammadHosseini/MKP-RL

:sparkles: Solve multi_dimensional multiple knapsack problem using...

20
Experimental
3760 bagh2178/GC-VLN

[CoRL 2025] GC-VLN: Instruction as Graph Constraints for Training-free...

20
Experimental
3761 HROlive/Advanced-Deep-Learning-with-Transformers

Workshop that will take you from Graph Neural Networks (GNNs) to...

20
Experimental
3762 lrusso/LlamaWebServer

Web server implementation of Llama

20
Experimental
3763 januverma/transformers-stuff

Codes, scripts, and notebooks on various aspects of transformer models.

20
Experimental
3764 sreekarvamsi/vehiclebert

Domain-Specific NLP for Automotive Entities

20
Experimental
3765 devsynck/llama-panel

A web-based control panel for managing and monitoring local llama.cpp server...

20
Experimental
3766 Timothy-Logan/Sentiment-Analyzer

Text sentiment analysis CLI tool using pre-trained transformers. Classify...

20
Experimental
3767 daedalus/distiller

Model distiller automator

20
Experimental
3768 newfull5/NLLB-200-Distilled-350M-en-ko

nllb-200 distilled 350M for English to Korean translation

20
Experimental
3769 Hyun-Ryu/clover

Official code for "Divide and Translate: Compositional First-Order Logic...

20
Experimental
3770 othmanelhoufi/LM-for-FactChecking

An automated solution for fact-checking using available claims and fake-news...

20
Experimental
3771 James-Crockett/Support-Ticket-Auto-Triage

An automated ticket classification system using NLP. Compares traditional...

20
Experimental
3772 alipay/fin_domain_llm

Implementation of the paper: WeaverBird: Empowering Financial...

20
Experimental
3773 Arlchoose-code/Indonesian-LLM-Starter

A starter kit for building your own Indonesian Large Language Model (LLM)...

20
Experimental
3774 svn05/vietnamese-nmt

Vietnamese-English-Japanese NMT with fine-tuned NLLB-200, beam search, and...

20
Experimental
3775 MusfiqDehan/Llama2-Finetuned-for-Translation

Fine-Tuned Llama-2 For Machine Translation

20
Experimental
3776 rivas-lab/Smiles2Dock

Smiles2Dock: an open large-scale multi-task dataset for ML-based molecular...

20
Experimental
3777 isaaccorley/segmenter-pytorch

PyTorch implementation of "Segmenter: Transformer for Semantic Segmentation"...

20
Experimental
3778 cui-shaobo/causal-strength

evaluating the causal strength between cause and effect

20
Experimental
3779 ZifanL/TSDS

Implementation of TSDS: Data Selection for Task-Specific Model Finetuning....

20
Experimental
3780 Adam-maz/GenAI-assisted-tool-for-Virtual-Screening

This repository introduces a proof-of-concept toolkit for generative virtual...

20
Experimental
3781 bandirevanth/Verbix

AI Grammar Scoring Engine with Evaluation & Feedback

20
Experimental
3782 kozodoi/Text_Readability_Prediction

Predicting text reading complexity with transformers (top-9% Kaggle solution...

20
Experimental
3783 nishantb06/smolLM

Reverse Engineering SmolLM2 model and training it from scratch

20
Experimental
3784 johnbrodowski/DuckAiChatAPI

A .NET console application for chatting with DuckDuckGo's AI Chat service...

20
Experimental
3785 csm9493/efficient-llm-unlearning

Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMs (ICLR 2025)

20
Experimental
3786 CeMOS-IS/GenFormer

[ICPR 2024] Official repository of the paper "GenFormer - Generated Images...

20
Experimental
3787 snexus/nlp-question-answering-system

Question answering system with transformers

20
Experimental
3788 khairulislam/Timeseries-Explained

Interpreting Deep Learning timeseries models using Local Interpretation methods

20
Experimental
3789 aswinvinodd/emotion-detection-system

AI-based Emotion Detection and Sentiment Analysis System using NLP and Streamlit

20
Experimental
3790 Manohara-Ai/Reinforcement_Learning_Framework_to_Prevent_Jailbreaks

A reinforcement learning-based system designed to detect and prevent...

20
Experimental
3791 Ankit6174/Domain-Specific-Mini-LLM-for-Genomic-Mutation-Prediction

An encoder only transformer model that accurately predicts the type, genomic...

20
Experimental
3792 skyline-GTRr32/OKI-TRACE

OKI TRACE: Local LLM observability. See step-by-step, layer-by-layer what...

20
Experimental
3793 AbineshSivakumar/Llama-2-7B-QLoRA-Vicuna

This repository contains code to fine-tune a Llama-7B-Uncensored model using...

20
Experimental
3794 YASSER-27/LLMs

A high-performance, cross-platform desktop application for chatting with...

20
Experimental
3795 kyegomez/MLXTransformer

Simple Implementation of a Transformer in the new framework MLX by Apple

20
Experimental
3796 himanshu231204/hk-devbrain

HK-DevBrain is a lightweight AI developer assistant built on Llama 3.2 (3B)...

20
Experimental
3797 IsmaelMousa/TTL

Full-stack simulator for a todo task list application using FastAPI, I built...

20
Experimental
3798 mytechnotalent/mechanistic_interpretability

Mechanistic Interpretability (MI) is a subfield of AI alignment and safety...

20
Experimental
3799 li-plus/nanoRLHF

Train a tiny LLaMA model from scratch to repeat your words using...

20
Experimental
3800 theanasuddin/Advanced-Deep-Learning

Computer exercises for Advanced Deep Learning. Includes implementations of...

20
Experimental
« Prev 1 2 3 36 37 38 39 40 63 64 65 Next »