ML API Deployment Transformer Models
Tools and frameworks for deploying transformer models as production-ready APIs using FastAPI, Flask, or similar web services with containerization and inference optimization. Does NOT include model training, fine-tuning frameworks, or non-API deployment methods like static model serving.
There are 34 ml api deployment models tracked. The highest-rated is golsun/DialogRPT at 42/100 with 345 stars.
Get all 34 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=ml-api-deployment&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
golsun/DialogRPT
EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data" |
|
Emerging |
| 2 |
godatadriven/rhyme-with-ai
Rhyme with AI |
|
Emerging |
| 3 |
declare-lab/CICERO
The purpose of this repository is to introduce new dialogue-level... |
|
Emerging |
| 4 |
henrikalbihn/gliner-as-a-service
GLiNER model in a FastAPI microservice. |
|
Emerging |
| 5 |
ArchAIve-Project/Backend
A complex Flask API system empowered by custom ML models, LLMs and... |
|
Emerging |
| 6 |
flozi00/atra
An open source NLP as a service project focused on providing state of the... |
|
Emerging |
| 7 |
CoderFatherBB/Crop-Doctor-Final-Year-Project-
This project is a comprehensive Flask-based application designed to help... |
|
Experimental |
| 8 |
imsigma1/AI-Knowledge-Creativity
🧠Power AI-driven tools for creative exploration and knowledge retrieval,... |
|
Experimental |
| 9 |
Orion-zhen/transAPI
OpenAI compatible API purely based on Transformers |
|
Experimental |
| 10 |
samestrin/llm-services-api
A FastAPI-powered REST API offering a comprehensive suite of natural... |
|
Experimental |
| 11 |
spyker77/fastapi-tdd-docker
Transformers with test-driven development |
|
Experimental |
| 12 |
Behera-babu/ai-fastapi-mlops
🌟 Build production-ready AI services with this FastAPI template, integrating... |
|
Experimental |
| 13 |
RochaFurada/AIether
AIether: Intelligent architecture expansion for Deep Learning. AIether uses... |
|
Experimental |
| 14 |
IsmaelMousa/TTL
Full-stack simulator for a todo task list application using FastAPI, I built... |
|
Experimental |
| 15 |
DoyoungBok/genai-docker-api
Dockerized FastAPI inference API using Hugging Face Transformers (FLAN-T5)... |
|
Experimental |
| 16 |
ZKOussama7/AI-Farm
A Complete Solution For Managing a Farm, Gorwing Your Own Food, Or Even... |
|
Experimental |
| 17 |
anar-rzayev/Empathetic-Dialogue-Generation
Open-Domain Dialogue model which produces empathetic responses when trained... |
|
Experimental |
| 18 |
Western-1/nlp-inference-service
Production-ready NLP Microservice with MLOps practices. Features: FastAPI,... |
|
Experimental |
| 19 |
Priyanshjain10/ai-fastapi-mlops
Production AI service |
|
Experimental |
| 20 |
mxchinegod/digits-api-ml
digits-api-ml is a large suite of API endpoints that directly respond with... |
|
Experimental |
| 21 |
agastya-nath123/Relighting_Backend
A simple Python FastAPI-based backend for our AI Photoshop app |
|
Experimental |
| 22 |
MEROO1010/AI-Knowledge-Creativity
A powerful open-source collection of AI tools for learning, storytelling,... |
|
Experimental |
| 23 |
chamajay/deepsense-backend
Backend server of DeepSense. Provides an API to access machine learning models. |
|
Experimental |
| 24 |
itsmyfacade/itsmyfacade
Production-grade machine learning systems, model inference pipelines, and... |
|
Experimental |
| 25 |
henrikalbihn/gliclass-as-a-service
GLiClass model in a FastAPI microservice. |
|
Experimental |
| 26 |
nakira974/k8s-image-recognition
A simple automated k8s cluster build on AWS deploying an actix_web reverse... |
|
Experimental |
| 27 |
NguyenDucAnh-2k6/OOP_Logistics_project
Disaster Logistics App - A desktop solution for disaster aiding |
|
Experimental |
| 28 |
AIDRI/ENCY-AI
ENCY-AI part |
|
Experimental |
| 29 |
anurag629/BotaniScan-API
FastAPI backend for BotaniScan plant disease detection with TensorFlow and Docker |
|
Experimental |
| 30 |
evops-sum25/evops-ml
A web server for EvOps responsible for machine learning tasks ✨ |
|
Experimental |
| 31 |
silvano315/MLOps-for-sentiment-analysis
This is the tenth project of AI Engineering Master. It aims to integrate an... |
|
Experimental |
| 32 |
Linutesto/Fractal-Neurons-LILA_JAILBREAK
Fractal Neurons — fractal MoE + conversational tooling (7950X/4090 tuned) |
|
Experimental |
| 33 |
jeffthedeveloper/MLOPS
Production-ready MLOps pipeline: Serving a GPT-2 model through a Flask API,... |
|
Experimental |
| 34 |
oriolrius/sagemaker-distilgpt2-endpoint
Deploy a DistilGPT-2 language model to AWS SageMaker with GitHub Actions CI/CD |
|
Experimental |