All Transformer Models

6,429 models ranked by quality score · Page 28 of 65

Showing 2701–2800 of 6,429
# Model Score Tier
2701 wangcongcong123/transection

Transection: Transformers for English to Chinese Translation

26
Experimental
2702 Linxyhaha/DEALRec

Data-efficient Fine-tuning for LLM-based Recommendation (SIGIR'24)

26
Experimental
2703 pooya-mohammadi/audio-classification-pytorch

In this project, several approaches for training/finetuning an audio gender...

26
Experimental
2704 CoderFatherBB/Crop-Doctor-Final-Year-Project-

This project is a comprehensive Flask-based application designed to help...

26
Experimental
2705 kyegomez/Open-NAMM

An open source implementation of the paper: "AN EVOLVED UNIVERSAL TRANSFORMER MEMORY"

26
Experimental
2706 ES7/LLaMA-from-Scratch

In this repository, I have explained the working of the LLaMA Model,...

26
Experimental
2707 Armaggheddon/BricksFinder

BricksFinder is your ultimate LEGO sidekick 🧱🔍—a magical tool that lets you...

26
Experimental
2708 aniass/Spam-detection

Spam detection in SMS messages with BERT model and Machine Learning algorithms

26
Experimental
2709 mrkorzun/Multi-AI-Telegram-Bot

Multi-model Telegram bot (aiogram v3) with OpenRouter model picker (Llama,...

26
Experimental
2710 micahondiwa/applied-ai

Deep Learning for Computer Vision: A collection of 6 end-to-end applied AI...

26
Experimental
2711 isaacus-dev/emubert-creator

The training code behind EmuBert, the largest open-source masked language...

26
Experimental
2712 deterministic-algorithms-lab/NLP-Journey

This repository provides a selection of very basic and minimal notebooks for...

26
Experimental
2713 rabiloo/llm-finetuning

Sample for Fine-Tuning LLMs & VLMs

26
Experimental
2714 detsutut/ama-bot

A modern and lightweight NLP interface for Question-Answering systems and...

26
Experimental
2715 ProGamerGov/VLM-Captioning-Tools

Python scripts to use for captioning images with VLMs

26
Experimental
2716 anandshah98/MedQA

Answer medical queries through a simple LLM chatbot rather than searching...

26
Experimental
2717 spark-engine-ai/ai-discord-bot

An AI powered Discord bot that chats, can search the web and generate both...

26
Experimental
2718 dalisoft/awesome-chatbot

List of awesome AI Chat-bots

26
Experimental
2719 smitkiri/news-qa

Reading comprehension based question-answering model for news articles.

26
Experimental
2720 rakibnsajib/MediBot-AI-Doctor-with-Vision-and-Voice

AI-powered medical assistant using LLaMA-3.2-11B-Vision, Whisper, and...

26
Experimental
2721 nanowell/Q-Sparse-LLM

My Implementation of Q-Sparse: All Large Language Models can be Fully...

26
Experimental
2722 docusealco/rllama

Ruby FFI bindings for llama.cpp to run open-source LLMs such as GPT-OSS,...

26
Experimental
2723 North-Shore-AI/tinkex_cookbook

Elixir port of tinker-cookbook: training and evaluation recipes for the...

26
Experimental
2724 declare-lab/TEAM

Our EMNLP 2022 paper on MCQA

26
Experimental
2725 askblocks/askblocks-core

LLM API backend for Askblocks Q&A widget system.

26
Experimental
2726 AndrewBoessen/PerfectRep

PerfectRep is a 3D pose estimation model tailored specifically for...

26
Experimental
2727 nicholasyager/llama-cpp-guidance

A guidance compatibility layer for llama-cpp-python

26
Experimental
2728 jose-blockchain/cerebras-coding-agent

A Cerebras AI LLM coding agent for the command line

26
Experimental
2729 0xJakuzya/sentiment-analysis-tg-news

Sentiment analysis tool for Telegram news: scraping with Telethon, text...

26
Experimental
2730 di37/ner-electrical-engineering-finetuning

This repository includes notebooks starting from data tokenization and...

26
Experimental
2731 gia-uh/cecilia

The Cuban Language Model

26
Experimental
2732 KasraAhmadi/PII-360

An open-source Chrome Extension that identifies Personally Identifiable...

26
Experimental
2733 AbhinavTheDev/DevCompass

spend less on wondering, more on working

26
Experimental
2734 maciekt07/Lecture-Note-Generator-POC

📒 A proof-of-concept app that transcribes lecture recordings into text and...

26
Experimental
2735 graphcore-research/jax-scalify

JAX Scalify: end-to-end scaled arithmetics

26
Experimental
2736 AlgonetLabs/Cable

Context-aware Biases for Length Extrapolation

26
Experimental
2737 zTgx/llmweb-rs

Webpage to structured data in Rust & LLM

26
Experimental
2738 TheAnkurGoswami/Neural-Networks-from-Scratch

Implementation of different neural networks with back-propagation logic.

26
Experimental
2739 daskol/llama.py

Python bindings to llama.cpp

26
Experimental
2740 ITMO-NSS-team/sea_ice_transformers

This repository contains code for the research of transformer effectiveness...

26
Experimental
2741 aniquetahir/JORA

JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)

26
Experimental
2742 li-plus/flash-preference

Accelerate LLM preference tuning via prefix sharing with a single line of code

26
Experimental
2743 zerob13/modelinfo-cli

A CLI to query AI model capabilities, context limits, and pricing from...

26
Experimental
2744 hasanisaeed/C-Transformer

Implementation of the core Transformer architecture in pure C

26
Experimental
2745 VITA-Group/TAPE

[ICML'25] "Rethinking Addressing in Language Models via Contextualized...

26
Experimental
2746 Uralstech/vid-orca

Deploy LLaMA-2 Chat on Google Cloud.

26
Experimental
2747 SachinKalsi/annotated-research-papers

This repository is a comprehensive collection of research papers,...

26
Experimental
2748 CYFARE/PDXTRACT

Extract From PDF's Using Ollama Local LLM

26
Experimental
2749 telekom/transformer-tools

Transformers Training Tools

26
Experimental
2750 jseeio/gpt2-tfjs

GPT2 with Tensorflow.js

26
Experimental
2751 codewithdark-git/QuantLLM

QuantLLM is a Python library designed for developers, researchers, and teams...

26
Experimental
2752 cgjosephlee/ollama-save-load

Save and load ollama models just like operating docker images.

26
Experimental
2753 parameterlab/apricot

Source code of "Calibrating Large Language Models Using Their Generations...

26
Experimental
2754 CharlesYuan02/eve-bot

A Discord bot I created in Python. Her name is Eve.

25
Experimental
2755 lpalbou/model-quantizer

Effortlessly quantize, benchmark, and publish Hugging Face models with...

25
Experimental
2756 Argo-Robot/foundation_models

Overview about state-of-art imitation learning techniques for robotic...

25
Experimental
2757 BoHuangLab/CELL-E_2

Multimodal encoder-only transformer model for image-based protein predictions

25
Experimental
2758 s-macke/GoPT

GPT-2 Model Inference

25
Experimental
2759 ArneBinder/pytorch-ie-hydra-template-1

PyTorch-IE Hydra Template

25
Experimental
2760 HyperMink/inferenceable

Scalable AI Inference Server for CPU and GPU with Node.js | Utilizes...

25
Experimental
2761 alex-snd/TRecover

📜 A python library for distributed training of a Transformer neural network...

25
Experimental
2762 ybubnov/metalchat

Pure C++23 Llama inference for Apple Silicon chips

25
Experimental
2763 avaapm/TurkishNamedEntityRecognition

Source code and the details of the results in the paper "Named entity...

25
Experimental
2764 SapienzaNLP/MaTESe

MaTESe: Machine Translation Evaluation as a Sequence Tagging Problem

25
Experimental
2765 KCLabMTU/LMCrot

Protein Language Model (pLM) Powered Protein Crotonylation (Kcr) Modified...

25
Experimental
2766 EdvardOlsen/Horoscope_generator

This is a horoscope generating code

25
Experimental
2767 pleisto/yuren-13b

Yuren 13B is an information synthesis large language model that has been...

25
Experimental
2768 EmbeddedLLM/embeddedllm

EmbeddedLLM: API server for Embedded Device Deployment. Currently support...

25
Experimental
2769 tasketh/tasketh

tasketh is a simple discord bot that lets moderators assign, and users claim tasks.

25
Experimental
2770 BenChaliah/Superposition-Transformer

a novel architecture that leverages Autoencoders to superimpose the hidden...

25
Experimental
2771 michaelhly/FarGlot

A Transformer-based SocialNLP toolkit for Farcaster

25
Experimental
2772 zhiyuanhubj/LongRecipe

LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models

25
Experimental
2773 taishan1994/qlora-chinese-LLM

使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE

25
Experimental
2774 FareedKhan-dev/Understanding-Transformers-Step-by-Step-math-example

Understanding Large Language Transformer Architecture like a child

25
Experimental
2775 PeterGriffinJin/Heterformer

Heterformer: Transformer-based Deep Node Representation Learning on...

25
Experimental
2776 yubainu/sibainu-engine

Real-time hallucination detection for LLMs via Geometric Drift Analysis in...

25
Experimental
2777 Shaurya-Sethi/transqlate

End-to-end natural language to SQL system: schema-aware model fine-tuning,...

25
Experimental
2778 ScottCampit/personalized-marketing-chatbot

personalized marketing chatbot

25
Experimental
2779 HenryNdubuaku/super-lazy-autograd

Hand-derived memory-efficient VJPs for tuning LLMs on laptops.

25
Experimental
2780 Aaronhuang-778/SliM-LLM

[ICML 2025] SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large...

25
Experimental
2781 pittisl/GreenTrainer

Code for paper "Towards Green AI in Fine-tuning Large Language Models via...

25
Experimental
2782 Bruce-Lee-LY/cutlass_gemm

Multiple GEMM operators are constructed with cutlass to support LLM inference.

25
Experimental
2783 huluhuluzhi/EmoScape-HCI_Final_Project-2025

🌦️ EmoScape: A multimodal AI system that visualizes emotions as generative...

25
Experimental
2784 codyjk/ChessGPT

♟️ A transformer that plays chess 🤖

25
Experimental
2785 kassane/ollama-d

D bindings for the Ollama API

25
Experimental
2786 Mcpasi/egoMorph-

Eine emotionale KI für den Browser: erkennt Gefühle, passt ihre...

25
Experimental
2787 StarxSky/ANE-GPT-New

New ANE GPT

25
Experimental
2788 mahsasheikh/DrugGen

DrugGen: Advancing Drug Discovery with Large Language Models and...

25
Experimental
2789 khiwniti/kaggle-llm-api

🤖 Comprehensive solution for running Ollama/vLLM API servers in Kaggle...

25
Experimental
2790 zzteam-rccup-2024/aurora-echo

We propose a new feedback system, named Aurora Echo} which provides...

25
Experimental
2791 deepmancer/tweet-disaster-detection

fine-tuned BERT and scikit-learn models for real-time classification of...

25
Experimental
2792 theboringhumane/echoOLlama

🦙 echoOLlama: A real-time voice AI platform powered by local LLMs. Features...

25
Experimental
2793 chris-santiago/met

Reproducing the MET framework with PyTorch

25
Experimental
2794 xdevfaheem/Transformers

A Comprehensive Implementation of Transformers Architecture from Scratch

25
Experimental
2795 ant-louis/netbert

📶 NetBERT: a domain-specific BERT model for computer networking.

25
Experimental
2796 BrightBlueCheese/transformers_and_chemistry

The Role of Model Architecture and Scale in Predicting Molecular Properties:...

25
Experimental
2797 datasig-ac-uk/nlpsig

Package for constructing paths of embeddings obtained from transformers.

25
Experimental
2798 RJain12/choformer

Cho codon optimization WIP

25
Experimental
2799 jpwahle/emnlp23-paraphrase-types

The official implementation of the EMNLP 2023 paper "Paraphrase Types for...

25
Experimental
2800 py-lama/weblama

A web-based Markdown editor with syntax highlighting, Mermaid diagram...

25
Experimental
« Prev 1 2 3 26 27 28 29 30 63 64 65 Next »