All Transformer Models
6,429 models ranked by quality score · Page 28 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 2701 |
wangcongcong123/transection
Transection: Transformers for English to Chinese Translation |
|
Experimental |
| 2702 |
Linxyhaha/DEALRec
Data-efficient Fine-tuning for LLM-based Recommendation (SIGIR'24) |
|
Experimental |
| 2703 |
pooya-mohammadi/audio-classification-pytorch
In this project, several approaches for training/finetuning an audio gender... |
|
Experimental |
| 2704 |
CoderFatherBB/Crop-Doctor-Final-Year-Project-
This project is a comprehensive Flask-based application designed to help... |
|
Experimental |
| 2705 |
kyegomez/Open-NAMM
An open source implementation of the paper: "AN EVOLVED UNIVERSAL TRANSFORMER MEMORY" |
|
Experimental |
| 2706 |
ES7/LLaMA-from-Scratch
In this repository, I have explained the working of the LLaMA Model,... |
|
Experimental |
| 2707 |
Armaggheddon/BricksFinder
BricksFinder is your ultimate LEGO sidekick 🧱🔍—a magical tool that lets you... |
|
Experimental |
| 2708 |
aniass/Spam-detection
Spam detection in SMS messages with BERT model and Machine Learning algorithms |
|
Experimental |
| 2709 |
mrkorzun/Multi-AI-Telegram-Bot
Multi-model Telegram bot (aiogram v3) with OpenRouter model picker (Llama,... |
|
Experimental |
| 2710 |
micahondiwa/applied-ai
Deep Learning for Computer Vision: A collection of 6 end-to-end applied AI... |
|
Experimental |
| 2711 |
isaacus-dev/emubert-creator
The training code behind EmuBert, the largest open-source masked language... |
|
Experimental |
| 2712 |
deterministic-algorithms-lab/NLP-Journey
This repository provides a selection of very basic and minimal notebooks for... |
|
Experimental |
| 2713 |
rabiloo/llm-finetuning
Sample for Fine-Tuning LLMs & VLMs |
|
Experimental |
| 2714 |
detsutut/ama-bot
A modern and lightweight NLP interface for Question-Answering systems and... |
|
Experimental |
| 2715 |
ProGamerGov/VLM-Captioning-Tools
Python scripts to use for captioning images with VLMs |
|
Experimental |
| 2716 |
anandshah98/MedQA
Answer medical queries through a simple LLM chatbot rather than searching... |
|
Experimental |
| 2717 |
spark-engine-ai/ai-discord-bot
An AI powered Discord bot that chats, can search the web and generate both... |
|
Experimental |
| 2718 |
dalisoft/awesome-chatbot
List of awesome AI Chat-bots |
|
Experimental |
| 2719 |
smitkiri/news-qa
Reading comprehension based question-answering model for news articles. |
|
Experimental |
| 2720 |
rakibnsajib/MediBot-AI-Doctor-with-Vision-and-Voice
AI-powered medical assistant using LLaMA-3.2-11B-Vision, Whisper, and... |
|
Experimental |
| 2721 |
nanowell/Q-Sparse-LLM
My Implementation of Q-Sparse: All Large Language Models can be Fully... |
|
Experimental |
| 2722 |
docusealco/rllama
Ruby FFI bindings for llama.cpp to run open-source LLMs such as GPT-OSS,... |
|
Experimental |
| 2723 |
North-Shore-AI/tinkex_cookbook
Elixir port of tinker-cookbook: training and evaluation recipes for the... |
|
Experimental |
| 2724 |
declare-lab/TEAM
Our EMNLP 2022 paper on MCQA |
|
Experimental |
| 2725 |
askblocks/askblocks-core
LLM API backend for Askblocks Q&A widget system. |
|
Experimental |
| 2726 |
AndrewBoessen/PerfectRep
PerfectRep is a 3D pose estimation model tailored specifically for... |
|
Experimental |
| 2727 |
nicholasyager/llama-cpp-guidance
A guidance compatibility layer for llama-cpp-python |
|
Experimental |
| 2728 |
jose-blockchain/cerebras-coding-agent
A Cerebras AI LLM coding agent for the command line |
|
Experimental |
| 2729 |
0xJakuzya/sentiment-analysis-tg-news
Sentiment analysis tool for Telegram news: scraping with Telethon, text... |
|
Experimental |
| 2730 |
di37/ner-electrical-engineering-finetuning
This repository includes notebooks starting from data tokenization and... |
|
Experimental |
| 2731 |
gia-uh/cecilia
The Cuban Language Model |
|
Experimental |
| 2732 |
KasraAhmadi/PII-360
An open-source Chrome Extension that identifies Personally Identifiable... |
|
Experimental |
| 2733 |
AbhinavTheDev/DevCompass
spend less on wondering, more on working |
|
Experimental |
| 2734 |
maciekt07/Lecture-Note-Generator-POC
📒 A proof-of-concept app that transcribes lecture recordings into text and... |
|
Experimental |
| 2735 |
graphcore-research/jax-scalify
JAX Scalify: end-to-end scaled arithmetics |
|
Experimental |
| 2736 |
AlgonetLabs/Cable
Context-aware Biases for Length Extrapolation |
|
Experimental |
| 2737 |
zTgx/llmweb-rs
Webpage to structured data in Rust & LLM |
|
Experimental |
| 2738 |
TheAnkurGoswami/Neural-Networks-from-Scratch
Implementation of different neural networks with back-propagation logic. |
|
Experimental |
| 2739 |
daskol/llama.py
Python bindings to llama.cpp |
|
Experimental |
| 2740 |
ITMO-NSS-team/sea_ice_transformers
This repository contains code for the research of transformer effectiveness... |
|
Experimental |
| 2741 |
aniquetahir/JORA
JORA: JAX Tensor-Parallel LoRA Library (ACL 2024) |
|
Experimental |
| 2742 |
li-plus/flash-preference
Accelerate LLM preference tuning via prefix sharing with a single line of code |
|
Experimental |
| 2743 |
zerob13/modelinfo-cli
A CLI to query AI model capabilities, context limits, and pricing from... |
|
Experimental |
| 2744 |
hasanisaeed/C-Transformer
Implementation of the core Transformer architecture in pure C |
|
Experimental |
| 2745 |
VITA-Group/TAPE
[ICML'25] "Rethinking Addressing in Language Models via Contextualized... |
|
Experimental |
| 2746 |
Uralstech/vid-orca
Deploy LLaMA-2 Chat on Google Cloud. |
|
Experimental |
| 2747 |
SachinKalsi/annotated-research-papers
This repository is a comprehensive collection of research papers,... |
|
Experimental |
| 2748 |
CYFARE/PDXTRACT
Extract From PDF's Using Ollama Local LLM |
|
Experimental |
| 2749 |
telekom/transformer-tools
Transformers Training Tools |
|
Experimental |
| 2750 |
jseeio/gpt2-tfjs
GPT2 with Tensorflow.js |
|
Experimental |
| 2751 |
codewithdark-git/QuantLLM
QuantLLM is a Python library designed for developers, researchers, and teams... |
|
Experimental |
| 2752 |
cgjosephlee/ollama-save-load
Save and load ollama models just like operating docker images. |
|
Experimental |
| 2753 |
parameterlab/apricot
Source code of "Calibrating Large Language Models Using Their Generations... |
|
Experimental |
| 2754 |
CharlesYuan02/eve-bot
A Discord bot I created in Python. Her name is Eve. |
|
Experimental |
| 2755 |
lpalbou/model-quantizer
Effortlessly quantize, benchmark, and publish Hugging Face models with... |
|
Experimental |
| 2756 |
Argo-Robot/foundation_models
Overview about state-of-art imitation learning techniques for robotic... |
|
Experimental |
| 2757 |
BoHuangLab/CELL-E_2
Multimodal encoder-only transformer model for image-based protein predictions |
|
Experimental |
| 2758 |
s-macke/GoPT
GPT-2 Model Inference |
|
Experimental |
| 2759 |
ArneBinder/pytorch-ie-hydra-template-1
PyTorch-IE Hydra Template |
|
Experimental |
| 2760 |
HyperMink/inferenceable
Scalable AI Inference Server for CPU and GPU with Node.js | Utilizes... |
|
Experimental |
| 2761 |
alex-snd/TRecover
📜 A python library for distributed training of a Transformer neural network... |
|
Experimental |
| 2762 |
ybubnov/metalchat
Pure C++23 Llama inference for Apple Silicon chips |
|
Experimental |
| 2763 |
avaapm/TurkishNamedEntityRecognition
Source code and the details of the results in the paper "Named entity... |
|
Experimental |
| 2764 |
SapienzaNLP/MaTESe
MaTESe: Machine Translation Evaluation as a Sequence Tagging Problem |
|
Experimental |
| 2765 |
KCLabMTU/LMCrot
Protein Language Model (pLM) Powered Protein Crotonylation (Kcr) Modified... |
|
Experimental |
| 2766 |
EdvardOlsen/Horoscope_generator
This is a horoscope generating code |
|
Experimental |
| 2767 |
pleisto/yuren-13b
Yuren 13B is an information synthesis large language model that has been... |
|
Experimental |
| 2768 |
EmbeddedLLM/embeddedllm
EmbeddedLLM: API server for Embedded Device Deployment. Currently support... |
|
Experimental |
| 2769 |
tasketh/tasketh
tasketh is a simple discord bot that lets moderators assign, and users claim tasks. |
|
Experimental |
| 2770 |
BenChaliah/Superposition-Transformer
a novel architecture that leverages Autoencoders to superimpose the hidden... |
|
Experimental |
| 2771 |
michaelhly/FarGlot
A Transformer-based SocialNLP toolkit for Farcaster |
|
Experimental |
| 2772 |
zhiyuanhubj/LongRecipe
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models |
|
Experimental |
| 2773 |
taishan1994/qlora-chinese-LLM
使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE |
|
Experimental |
| 2774 |
FareedKhan-dev/Understanding-Transformers-Step-by-Step-math-example
Understanding Large Language Transformer Architecture like a child |
|
Experimental |
| 2775 |
PeterGriffinJin/Heterformer
Heterformer: Transformer-based Deep Node Representation Learning on... |
|
Experimental |
| 2776 |
yubainu/sibainu-engine
Real-time hallucination detection for LLMs via Geometric Drift Analysis in... |
|
Experimental |
| 2777 |
Shaurya-Sethi/transqlate
End-to-end natural language to SQL system: schema-aware model fine-tuning,... |
|
Experimental |
| 2778 |
ScottCampit/personalized-marketing-chatbot
personalized marketing chatbot |
|
Experimental |
| 2779 |
HenryNdubuaku/super-lazy-autograd
Hand-derived memory-efficient VJPs for tuning LLMs on laptops. |
|
Experimental |
| 2780 |
Aaronhuang-778/SliM-LLM
[ICML 2025] SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large... |
|
Experimental |
| 2781 |
pittisl/GreenTrainer
Code for paper "Towards Green AI in Fine-tuning Large Language Models via... |
|
Experimental |
| 2782 |
Bruce-Lee-LY/cutlass_gemm
Multiple GEMM operators are constructed with cutlass to support LLM inference. |
|
Experimental |
| 2783 |
huluhuluzhi/EmoScape-HCI_Final_Project-2025
🌦️ EmoScape: A multimodal AI system that visualizes emotions as generative... |
|
Experimental |
| 2784 |
codyjk/ChessGPT
♟️ A transformer that plays chess 🤖 |
|
Experimental |
| 2785 |
kassane/ollama-d
D bindings for the Ollama API |
|
Experimental |
| 2786 |
Mcpasi/egoMorph-
Eine emotionale KI für den Browser: erkennt Gefühle, passt ihre... |
|
Experimental |
| 2787 |
StarxSky/ANE-GPT-New
New ANE GPT |
|
Experimental |
| 2788 |
mahsasheikh/DrugGen
DrugGen: Advancing Drug Discovery with Large Language Models and... |
|
Experimental |
| 2789 |
khiwniti/kaggle-llm-api
🤖 Comprehensive solution for running Ollama/vLLM API servers in Kaggle... |
|
Experimental |
| 2790 |
zzteam-rccup-2024/aurora-echo
We propose a new feedback system, named Aurora Echo} which provides... |
|
Experimental |
| 2791 |
deepmancer/tweet-disaster-detection
fine-tuned BERT and scikit-learn models for real-time classification of... |
|
Experimental |
| 2792 |
theboringhumane/echoOLlama
🦙 echoOLlama: A real-time voice AI platform powered by local LLMs. Features... |
|
Experimental |
| 2793 |
chris-santiago/met
Reproducing the MET framework with PyTorch |
|
Experimental |
| 2794 |
xdevfaheem/Transformers
A Comprehensive Implementation of Transformers Architecture from Scratch |
|
Experimental |
| 2795 |
ant-louis/netbert
📶 NetBERT: a domain-specific BERT model for computer networking. |
|
Experimental |
| 2796 |
BrightBlueCheese/transformers_and_chemistry
The Role of Model Architecture and Scale in Predicting Molecular Properties:... |
|
Experimental |
| 2797 |
datasig-ac-uk/nlpsig
Package for constructing paths of embeddings obtained from transformers. |
|
Experimental |
| 2798 |
RJain12/choformer
Cho codon optimization WIP |
|
Experimental |
| 2799 |
jpwahle/emnlp23-paraphrase-types
The official implementation of the EMNLP 2023 paper "Paraphrase Types for... |
|
Experimental |
| 2800 |
py-lama/weblama
A web-based Markdown editor with syntax highlighting, Mermaid diagram... |
|
Experimental |