All Transformer Models

6,429 models ranked by quality score · Page 33 of 65

Showing 3201–3300 of 6,429
# Model Score Tier
3201 bosszii2709/ai-dataset-generator

🤖 Generate tailored AI training datasets quickly and easily, transforming...

23
Experimental
3202 PradipKumarDas/Competitions

This repository is the home for all competitions.

23
Experimental
3203 sodascience/social_science_inferences_with_llms

Addressing LLM-related measurement error in social science modeling research.

23
Experimental
3204 kardSIM/Trading_RL_agent_with_transformers

An RL agent that can trade using Deep Q-Network (DQN) and a decoder-only...

23
Experimental
3205 mjglatzmaier/llm-boostrap

Starter repo for running local LLM inference and lightweight benchmarking on...

23
Experimental
3206 yinzhangyue/EoT

Exchange-of-Thought: Enhancing Large Language Model Capabilities through...

23
Experimental
3207 autobotasia/vitone

Tự động thêm dấu tiếng việt dùng Transformer model

23
Experimental
3208 fcakyon/gpt2-shakespeare

A tutorial on GPT2 language model training with texts from Shakespeare

23
Experimental
3209 paritoshtripathi935/MiniPerplexity

🤖 A modern AI chat assistant powered by Meta's Llama models with real-time...

23
Experimental
3210 seonglae/llama2gptq

Chat to LLaMa 2 that also provides responses with reference documents over...

23
Experimental
3211 lordtt13/transformers-experiments

All my experiments with the various transformers and various transformer...

23
Experimental
3212 shreydan/VisionGPT2

Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model...

23
Experimental
3213 rese1f/STEVE

[ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in...

23
Experimental
3214 EdvardOlsen/NeuralSongGenerator

A generator that creates a song (lyrics and chords) and play it

23
Experimental
3215 Letian2003/MM_INF

An efficient multi-modal instruction-following data synthesis tool and the...

23
Experimental
3216 tech-srl/layer_norm_expressivity_role

Code for the paper "On the Expressivity Role of LayerNorm in Transformers'...

23
Experimental
3217 xiuqhou/DAPE

[AAAI2026] Official implementation of the paper "DAPE: Harmonizing...

23
Experimental
3218 PKU-YuanGroup/Video-Bench

A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large...

23
Experimental
3219 SatvikPraveen/JAX-NSL

Comprehensive JAX implementation of neural networks and scientific...

23
Experimental
3220 quantumnic/ssd-llm

Run 70B+ LLMs on Apple Silicon by using SSD as extended memory — intelligent...

23
Experimental
3221 sriyavasudevan/Question-Answering-System

We built a Question Answer System using BERT. Based on our benchmark dataset...

23
Experimental
3222 Kacper-W-Kozdon/promptflow_unify_integration

The tool package for Microsoft's Prompt flow and the VS Code extension

23
Experimental
3223 mrseanryan/gpt-local

Local GPT (llama 2 or dolly or gpt etc.) via Python - using ctransforers project

23
Experimental
3224 HIYO-F/WHEN-Language

🔄 Build dynamic programs with the WHEN Language, a unique loop-based...

23
Experimental
3225 chagmgang/dinov2-remote-sensing

Implementation dino v2 for remote sensing with huggingface transformers

23
Experimental
3226 aaaastark/Pretrain_Finetune_Transformers_Pytorch

Pre-Training and Fine-Tuning transformer models using PyTorch and the...

23
Experimental
3227 frinknet/gelli

Containerized LLM for any use-case big or small

23
Experimental
3228 RichardHam-co-uk/ProjectLodestar

AI development environment with 90% cost savings. Routes between 8 LLM...

23
Experimental
3229 thefilesareinthecomputer/offline_file_translation

Text file language translation app that translates .txt, .csv, and .xlsx...

23
Experimental
3230 RohitMacherla3/wikiHow_text_summarization_llms

The project aims to utilize pre-trained Large Language Models (LLMs) for...

23
Experimental
3231 Any-Winter-4079/Nano-GPT-Speedrun-Track

This repo represents my Nano-GPT speedrun playground, which started coding...

23
Experimental
3232 mims-harvard/TimeX

Time series explainability via self-supervised model behavior consistency

23
Experimental
3233 samestrin/llm-services-api

A FastAPI-powered REST API offering a comprehensive suite of natural...

23
Experimental
3234 Mechres/text-summarize

Flask-based API that provides a user-friendly interface to summarize text in...

23
Experimental
3235 xxxsleepygamerxxx/directly

🚀 Accelerate your browsing with Directly, a Chromium extension for quick...

23
Experimental
3236 yassenayoub/NEO

🔍 Explore NEO, a groundbreaking native vision-language model designed to...

23
Experimental
3237 microsoft/AMOS

[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training...

23
Experimental
3238 NC0DER/GreekWikipedia

A Greek abstractive summarization dataset based on Wikipedia.

23
Experimental
3239 renan-siqueira/image-to-text-tool

This tool processes images and generates textual descriptions using advanced...

23
Experimental
3240 mickymultani/LLM-Architecture

Visualize some important concepts related to LLM architectures.

23
Experimental
3241 TIGER-AI-Lab/TableCoT

The code and data for paper "Large Language Models are few(1)-shot Table...

23
Experimental
3242 amazon-science/TSFM-Compression

Official Implementation of Understanding Transformers for Time Series: Rank...

23
Experimental
3243 abhayra12/StudentLife-Phenotyping

End-to-end behavioral prediction system using digital phenotyping. PyTorch...

23
Experimental
3244 hpcaitech/Elixir

Elixir: Train a Large Language Model on a Small GPU Cluster

22
Experimental
3245 neptune-ai/project-nlp

Experiment tracking and model registry in the NLP project

22
Experimental
3246 abhijitpal1247/TripplannerBot

This a streamlit app with langchain. It makes use of Bing maps API,...

22
Experimental
3247 SWCapstone2021/NLP

2021 Ajou University Spring SW capstone design - FindU NLP (Winning the gold...

22
Experimental
3248 christopherdanie/GovOn

Develop an on-device AI system that processes and analyzes complaints using...

22
Experimental
3249 joydeb28/llm-lab

LLM, Fine Tuning, Llama 2, Gemma, Mixtral, vLLM, LangChain, RAG, ChromaDB, FAISS

22
Experimental
3250 BrijeshRakhasiya/AI-Customer-Review-Intelligence-System

Production NLP platform: DistilBERT sentiment analysis (95% acc) + BERTopic...

22
Experimental
3251 evankost/twitter-sentiment-nlp-exploration

An end-to-end NLP project for Twitter sentiment classification. This...

22
Experimental
3252 egwaojeangel/skillbridge-ai

AI-powered career gap analyzer — upload your CV, get a personalised skill...

22
Experimental
3253 pohl-michel/2D-MR-image-prediction

Future frame prediction in 2D chest and liver cine-MRI using the PCA...

22
Experimental
3254 AnonShield/AnonLFI2.0

Extensible PII pseudonymization framework for CSIRTs. Features OCR,...

22
Experimental
3255 JuanDiego-10/Privacy_Protection_Redaction_LLM

Privacy_Protection_Redaction_LLM is a machine learning model designed to...

22
Experimental
3256 0606zt/PanoLlama

[ICCV 2025 Highlight] Panorama Generation as a Next-Token Prediction Task.

22
Experimental
3257 chulo559/Humanizer-zh

📝 Transform AI-generated text into natural, human-like writing with...

22
Experimental
3258 matthewjhunter/herald

Your AI-powered news herald - monitors RSS feeds, filters for importance,...

22
Experimental
3259 chuksoo/imdb_movie_sentiment_analysisNLP

Practicum by Yandex Project 13: In this natural language processing project,...

22
Experimental
3260 armnPemula/Neo

🔗 Streamline red team operations with Neo, a modular post-exploitation...

22
Experimental
3261 LaxmanNandi/MCH-Research

Conservation law for LLM context sensitivity: ΔRCI × Var_Ratio ≈ K(domain)....

22
Experimental
3262 mazebrr/language-tokenizer

🧩 Tokenize text efficiently across multiple languages using our robust...

22
Experimental
3263 mac999/geo-llm-agent-dashboard

Geo Map AI Agent Dashboard Web App for example

22
Experimental
3264 AsianZeus/FaceMask-Classification-Models

This repository holds the downstream task of Face Mask Classification...

22
Experimental
3265 jaden3289/llasa-tts-8b-webui

🎙️ Generate high-quality speech from text with Llasa-TTS-8B, featuring...

22
Experimental
3266 karthik19967829/InferDoc

Generate SQUAD style dataset from raw text file and train a transformer...

22
Experimental
3267 cvssn/shade

ai pair programming in your terminal

22
Experimental
3268 kanad13/MultiAI-Query

MultiAI-Query: Work with multiple AI models with unified API calls.

22
Experimental
3269 Azzdncc/sidekick

🤖 Enhance your productivity with Sidekick, a personal AI agent that...

22
Experimental
3270 phkhanhtrinh23/spelling_correction_project

This spelling correction project helps people fix English spelling mistakes....

22
Experimental
3271 YRL-AIDA/RuTaBERT

RuTaBERT is a framework for solving column type and property annotation...

22
Experimental
3272 jasminwolf/ZakeyTeam-arabic-qa-system-arabert

🤖 Enhance Arabic NLP capabilities with this AI-powered question answering...

22
Experimental
3273 GHermano-17/AVALIACAO-DE-MODELOS-QUESTION-ANSWERING

Avaliação comparativa entre TinyRoBERTa e BERT Base em Question Answering — CTEIA/UFC

22
Experimental
3274 frikishaan/glama-124m

GLaMA is a small-scale autoregressive transformer model inspired by...

22
Experimental
3275 cbacary/MoDeGPT

An implementation of the MoDeGPT LLM compression from the ICLR 2025...

22
Experimental
3276 guilherme-hermano/AVALIACAO-DE-MODELOS-QUESTION-ANSWERING

Avaliação comparativa entre TinyRoBERTa e BERT Base em Question Answering — CTEIA/UFC

22
Experimental
3277 awpggexcutor-beep/T5-Refiner-DomainFocus

🌟 Enhance T5 model performance with domain-specific word masking for...

22
Experimental
3278 marcv12/reddit_depression

Studies show that people are more depressed than ever after the pandemic,...

22
Experimental
3279 Betswish/Cross-Lingual-Consistency

Easy-to-use framework for evaluating cross-lingual consistency of factual...

22
Experimental
3280 gmongaras/Wizard_QLoRA_Finetuning

Finetuning Some Wizard Models With QLoRA

22
Experimental
3281 nuhmanpk/Awesome-open-LLM

Awesome-Open-LLM : a curated list of open-source Large Language Models (LLMs)

22
Experimental
3282 14062/Megatron-LM

Enable large-scale transformer model training with GPU-optimized tools and...

22
Experimental
3283 sc-localization/VerseBridge

AI translation tool

22
Experimental
3284 mantzaris/KeemenaLM.jl

Language Models in Julia lang (transformers/GPT/decoders/chat etc)

22
Experimental
3285 Ankushkhadka21/ragit

📄 Build RAG applications easily with ragit, a Python toolkit for document...

22
Experimental
3286 anisderoual/Document_Archiver_Korean-NLP_BERTClustering

📂 Extract, embed, cluster, and securely store Korean text from documents...

22
Experimental
3287 XingLuxi/Cal-FLOPs-for-PLM

Calculating FLOPs of Pre-trained Models in NLP

22
Experimental
3288 zakariaf/phi2-text-generator-api

Flask API for generating text with the Phi-2 model from Hugging Face Transformers.

22
Experimental
3289 Knuckles-Team/genius-chatbot

Chatbot that uses any desired hugging face model or allows for scalable...

22
Experimental
3290 BhavikBhindora/SmartyPantsAIChatBot

This repo contains GLM model for generating text, analyzing...

22
Experimental
3291 spyker77/fastapi-tdd-docker

Transformers with test-driven development

22
Experimental
3292 wellcometrust/grant_hrcs_tagger

Classifier model for tagging research grants with HRCS Health Category and...

22
Experimental
3293 pooh1649/CHATBOT-Haystack

🤖 Build intelligent chatbots with Haystack, a powerful framework for...

22
Experimental
3294 Behera-babu/ai-fastapi-mlops

🌟 Build production-ready AI services with this FastAPI template, integrating...

22
Experimental
3295 ArenRedd/AI-Chatbot-using-LLaMA2

Uncensored AI: An open-source, unrestricted LLaMA 2 model for free and raw...

22
Experimental
3296 HamzaG737/Sentence-segmentation

Distilbert model for sentence segmentation.

22
Experimental
3297 MBadriNarayanan/ClickbaitClassification

Classifying clickbaits: articles with potentially misleading titles, using a...

22
Experimental
3298 InternLM/Spark

An official implementation of "SPARK: Synergistic Policy And Reward...

22
Experimental
3299 namuan/snap-assist

Summon intelligence in a snap

22
Experimental
3300 quyethd95/HuggingFace-BAAI--BGERerankerv2m3

🔍 Explore BGE Reranker v2 m3 for effective sequence reranking using Hugging...

22
Experimental
« Prev 1 2 3 31 32 33 34 35 63 64 65 Next »