All Transformer Models

6,429 models ranked by quality score · Page 34 of 65

Showing 3301–3400 of 6,429
# Model Score Tier
3301 ritaranx/BMRetriever

[EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large...

22
Experimental
3302 Moeinh77/Virus-DNA-classification-BERT

Classification of 6 viruses including covid-19 based on their DNA sequences...

22
Experimental
3303 Talnz007/VulkanIlm

GPU-accelerated LLaMA inference wrapper for legacy Vulkan-capable systems a...

22
Experimental
3304 sekarrisma/GetKickBearerToken-extension

πŸ”‘ Generate and manage Bearer Tokens for seamless authorization in your...

22
Experimental
3305 blackboxprogramming/cece-revival

CECE β€” Conversational AI companion with persistent SQLite memory, custom...

22
Experimental
3306 vaew/Awesome-spatial-visual-reasoning-MLLMs

Repository for awesome spatial/visual reasoning MLLMs. (focus more on...

22
Experimental
3307 Guest400123064/ezgatr

Geometric Algebra Transformer Made Easy

22
Experimental
3308 eriknovak/model-ner-transformers

The repository containing the NER model training and evaluation scripts...

22
Experimental
3309 1ucky40nc3/TREX

πŸ¦– : Technology for Reliable Extensive Chatbot Systems

22
Experimental
3310 luiscavallcante859/collectiv-ai-sdk

🌐 Build and integrate with the CollectiVAI Router using official SDKs for...

22
Experimental
3311 ChenDelong1999/polite-flamingo

🦩 Official repository of paper "Visual Instruction Tuning with Polite...

22
Experimental
3312 bsantraigi/2023-IndoML-Datathon-Tutorial

Intent Detection: From Sesame Street to LLMs #IndoML 2023 #Datathon #Tutorial

22
Experimental
3313 EagleW/Chem-FINESE

Official implementation of the EACL Findings 2024 paper: Chem-FINESE:...

22
Experimental
3314 geneexpressionpolito/Predicting-gene-expression-levels-from-DNA-sequences-and-post-transcriptional-info-with-transformers

Transformers for gene expression prediction from raw dna sequences

22
Experimental
3315 pelagecha/typ

Associative Memory Augmentation for Long-Context Retrieval in Transformers

22
Experimental
3316 USask-BINFO/AcrTransAct

Implementation of the paper "AcrTransAct: Pre-trained Protein Transformer...

22
Experimental
3317 sermare/DeepOff

Deep Learning to predict phenotype score associated with heritable gene...

22
Experimental
3318 Pranavgosavi217/Kaggle-RNA-3D

🧬 Predict RNA 3D structures with an interactive dashboard and reusable...

22
Experimental
3319 Gala2044/Transformers-for-absolute-dummies

πŸš€ Master transformers with this simple guide that breaks down complex...

22
Experimental
3320 Swamy-s-Tech-Skills-Academy-2026/llms-from-scratch-practice

Hands-on learning repository for building a GPT-style Large Language Model...

22
Experimental
3321 VPanjeta/PyLLaMa-CPU

Fast LLaMa inference on CPU using llama.cpp for Python

22
Experimental
3322 simboco/flash-linear-attention

πŸ’₯ Optimize linear attention models with efficient Triton-based...

22
Experimental
3323 ovshake/rat

Reverse Attention Tracer: A lightweight API to visualize which words...

22
Experimental
3324 rashomon-gh/attention-visualiser

a module to visualise attention layer activations from transformer based...

22
Experimental
3325 mingikang31/Convolutional-Nearest-Neighbor-Attention

Convolutional Nearest Neighbor Attention for Transformers

22
Experimental
3326 lablab-ai/technologies

Content repo for lablab.me

22
Experimental
3327 poppingtonic/transformer-visualization

Mechanistic Interpretability Tutorials, Results and research log as I learn...

22
Experimental
3328 petermartens98/Qwen3-LLM-Pytorch-Implementation-From-Scratch

Lightweight LLM inspired by Qwen3, built from scratch in PyTorch. Full...

22
Experimental
3329 rubencart/LIIR-TextGraphs-14

Code for KU Leuven LIIR lab's submission to the TextGraphs-14 shared task on...

22
Experimental
3330 HemantBK/LLaMA-Sum-Fine-Tuning

Fine-tuned Meta's LLaMA 3.2 1B for text summarization using QLoRA (4-bit...

22
Experimental
3331 TioAbiyyu/HuggingFace-BAAI--BGERerankerv2m3

BGE Reranker v2 m3 demo with Hugging Face transformers for local and Azure cloud use.

22
Experimental
3332 YukinoshitaKaren/Reason-KE

[EMNLP 2025 Findings] Robust Knowledge Editing via Explicit Reasoning Chains...

22
Experimental
3333 Dartvauder/NeuroTrainerWebUI

(Windows/Linux) Local WebUI for finetuning, evaluation and generation of...

22
Experimental
3334 paulocoutinhox/mini-llm

Simple and lightweight tool to fine-tune GPT models (like GPT-2 and GPT-Neo)...

22
Experimental
3335 malith153/token-forge

πŸ”‘ Build robust identity solutions with TokenForge, an enterprise-ready...

22
Experimental
3336 resetpaid/lumina

Perform passive domain reconnaissance using public data sources without...

22
Experimental
3337 mohan-gupta/shell-protect-against-cyber-threats

Finding the source code hidden in the text.

22
Experimental
3338 nininau/awesome-llm-services

πŸ” Discover 106+ open-source LLM services and tools for AI, ideal for local...

22
Experimental
3339 tsoAI0305/football-prediction-system

⚽ AI-powered football prediction system combining ML models and Groq Llama...

22
Experimental
3340 ahmedmagood/cpu-slm

πŸ–₯️ Explore CPU-SLM, a Rust-based SLM/LLM project that runs on CPU, offering...

22
Experimental
3341 veerapatel/llm.nexus

🌐 Streamline integration with various LLM providers using LLM.Nexus, a .NET...

22
Experimental
3342 boyazzam/kvcache-autotune

πŸš€ Optimize your KVCache performance with automatic tuning for efficient...

22
Experimental
3343 cristi4nhdz/osint-threat-intel-pipeline

Multi-source OSINT pipeline that ingests threat feeds, enriches entities...

22
Experimental
3344 extractable-hoodedsheldrake431/deepseek_ocr_app

πŸ–ΌοΈ Streamline your document processing with DeepSeek OCR, a modern app...

22
Experimental
3345 Metedout-biographer66/dots.ocr-fix-demo

πŸ–ΌοΈ Upload images to experience accurate multilingual OCR results with the...

22
Experimental
3346 AleNard89/py-pytorch-invoice

Automated invoice data extraction using LayoutLMv3 (PyTorch) with PyQt6...

22
Experimental
3347 Walt-1091/Signal-to-Sequence-Transformer

πŸ” Classify 1D signal data using a CNN + Transformer model, enabling advanced...

22
Experimental
3348 shariftuyizere332/hotel-review-sentiment-modeling

Predict hotel review sentiment and predict star ratings using fine-tuned...

22
Experimental
3349 UgurkanTech/ArchNetAI

ArchNetAI is a Python library that leverages the Ollama API for generating...

22
Experimental
3350 SPerekrestova/pillchecker-api

Medication interaction checker API (OpenMed + RxNorm + OpenFDA)

22
Experimental
3351 darkwebdesign/symfony-addon-bundle

Symfony Add-on Bundle

22
Experimental
3352 rizalsimb1/ml-monitoring

Fine-tune large language models (Llama 3, Mistral, Phi-3) with LoRA and...

22
Experimental
3353 shuhulx/FineTuneCheck

Diagnostic tool for LLM fine-tuning β€” automated forgetting detection,...

22
Experimental
3354 estrify/ProjectLodestar

🌟 Optimize AI development with Lodestar by smartly routing between free...

22
Experimental
3355 NTT123/sketch-transformer

Modeling Draw, Quick! dataset using transformers

22
Experimental
3356 andresC98/TSF_Transformers_TFM

Repository containing my Master Thesis for the M.Sc. Big Data Analytics,...

22
Experimental
3357 pavaris-pm/artery-segmentation

this repository is the part of Guided Ultrasound Image Segmentation project...

22
Experimental
3358 AlirezaSalehy/Tipsomaly

This is an extended version of the paper β€œTIPS Over Tricks: Simple Prompts...

22
Experimental
3359 jiaxiang-cheng/Random-Weighted-Bootstrap-with-Weibull

Reproduction of the work by Hong, Y., Meeker, W. Q., & McCalley, J. D....

22
Experimental
3360 n1405732043/pi-token-burden

Analyze system prompt tokens to identify usage and manage token budgets...

22
Experimental
3361 tehw0lf/writing-style-analyzer

Analyze and profile writing styles in German and English text using local...

22
Experimental
3362 shuhulx/MergeLens

Pre-merge diagnostic framework for LLM model merging β€” analyze...

22
Experimental
3363 edsonpro9891/robust-nli-analysis

πŸ” Detect biases in NLP models with robust analysis, enhancing dataset...

22
Experimental
3364 melove297/reddit-factuality-detection

🧐 Detect factual reliability in Reddit posts using machine learning with...

22
Experimental
3365 guidasneves/marvel_sentiment_analysis

End-to-end Marvel comics sentiment classification

22
Experimental
3366 AniketRajpoot/Automated-Headline-and-Sentiment-Generator

A very simple repo for Text Classification, Sentiment Identification and...

22
Experimental
3367 Cabbagito/Generating-South-Park-Episodes

Screw You Guys I'm Going Home

22
Experimental
3368 Thisen-Ekanayake/HelaBERT

A compact BERT (6-layer) masked language model trained from scratch on a...

22
Experimental
3369 brunn3is/ilab-erisk-2020

Repository accompanying the CLEF 2020 eRisk Workshop Working Notes for the...

22
Experimental
3370 binbadose/jailbreak

Automate Roblox Jailbreak with keyless scripts for autofarming, silent aim,...

22
Experimental
3371 samibahig/Prediction-Image-Protocole-

Classification automatique des protocoles TDM (C+/C-/C- C+) via OCR +...

22
Experimental
3372 Frosy01/Krita-Ollama-Prompt-Generator

πŸ–ŒοΈ Generate and refine prompts directly in Krita with the local LLM-powered...

22
Experimental
3373 procesaur/Scratch2LM

Training transformer models (e.g. RoBERTa, GPT2 and GPT-J) from scratch.

22
Experimental
3374 rajatsaini0294/awesome-image-transformer

List of all the papers on Transformers for Vision.

22
Experimental
3375 Atenrev/forocoches-language-generation

This is a PyTorch implementation of a decoder only transformer inspired on...

22
Experimental
3376 gaomingzhao666/AI-Prompts

A fast and modern web page that lists useful and favorite AI/GPT prompts,...

22
Experimental
3377 blackboxprogramming/ai-chain

AI Chain β€” Distributed multi-node LLM inference with automatic failover....

22
Experimental
3378 deepagency/llm-resource-planner

A simple CLI tool to fetch Hugging Face model metadata and estimate required...

22
Experimental
3379 TeamADAPT/blitzkernels

BlitzKernels β€” production WASM inference kernels for edge AI (embedding,...

22
Experimental
3380 onlychara553-debug/dgx-spark-inference-stack

πŸš€ Serve large language models efficiently at home with this Docker-based...

22
Experimental
3381 liam8421/faster-llm

πŸš€ Accelerate LLM training with Fast-LLM, an open-source library for...

22
Experimental
3382 MonitooDev/indiedroid-nova-llm

πŸš€ Benchmark local LLMs like Llama 3.1 on the Indiedroid Nova with RK3588...

22
Experimental
3383 YousfiNahed/KoValPlus

🌍 Evaluate cultural and value alignment of LLMs with Korean responses using...

22
Experimental
3384 qxoticai/qxotic

AI engine for the JVM

22
Experimental
3385 LlamaGenAI/LlamaGen

AI Comic Factory - Generate Comics with AI, πŸ¦™ Llama for Scalable Anime...

22
Experimental
3386 aakasharya09/llm-leaderboard

πŸ“Š Compare LLM models effortlessly with our tool, showcasing performance...

22
Experimental
3387 ns408/local-ai-setup

Run modern AI models on older laptops - optimized for 2nd-gen Intel hardware

22
Experimental
3388 NeoZel/huatuo

πŸ” Enhance your cloud-native observability with HUATUO, using eBPF for deep...

22
Experimental
3389 ferranpons/Llamatik-Server

Remote inference backend implementing the same API as the Llamatik library...

22
Experimental
3390 whyisitworking/llama-bro

On-device LLM inference SDK for Android, powered by llama.cpp. Run GGUF...

22
Experimental
3391 uSaiPrashanth/gpt-j-finetune

Parallelizes finetuning of gpt-j on P3 dataset across multiple gpu nodes

22
Experimental
3392 aliuyar1234/proberoute

Research code for ProbeRoute, a probe-initialized sparse routing method for...

22
Experimental
3393 Richardjb94/StudySage-Offline-Online-AI-Note-Assistant

🧠 Transform notes, PDFs, and screenshots into clear summaries and smart...

22
Experimental
3394 Hiya260803/CourseCraft

πŸŽ“ Build and manage engaging online courses with CourseCraft, a full-stack...

22
Experimental
3395 JAVO932/PyGPT2

πŸ–₯️ Explore GPT-2 text generation with PyGPT2, a user-friendly Python app...

22
Experimental
3396 WeiminWu2000/Genome_Factory

An Integrated Library for Tuning, Deploying and Interpreting Genomic Models

22
Experimental
3397 agentic-learning-ai-lab/lifelong-memory

Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form...

22
Experimental
3398 llm-works/llm-infer

LLM inference server with native, vLLM, and Ollama backends, including a...

22
Experimental
3399 msaleme/ChatMeld-Ollama

Privacy-focused multi-LLM chat app with Ollama support for local AI models...

22
Experimental
3400 artryazanov/nitrogen-finetuner

This project implements a Universal Fine-Tuning Pipeline for the NVIDIA...

22
Experimental
« Prev 1 2 3 32 33 34 35 36 63 64 65 Next »