ML API Deployment Transformer Models

Tools and frameworks for deploying transformer models as production-ready APIs using FastAPI, Flask, or similar web services with containerization and inference optimization. Does NOT include model training, fine-tuning frameworks, or non-API deployment methods like static model serving.

There are 34 ml api deployment models tracked. The highest-rated is golsun/DialogRPT at 42/100 with 345 stars.

Get all 34 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=ml-api-deployment&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 golsun/DialogRPT

EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"

42
Emerging
2 godatadriven/rhyme-with-ai

Rhyme with AI

41
Emerging
3 declare-lab/CICERO

The purpose of this repository is to introduce new dialogue-level...

35
Emerging
4 henrikalbihn/gliner-as-a-service

GLiNER model in a FastAPI microservice.

34
Emerging
5 ArchAIve-Project/Backend

A complex Flask API system empowered by custom ML models, LLMs and...

33
Emerging
6 flozi00/atra

An open source NLP as a service project focused on providing state of the...

30
Emerging
7 CoderFatherBB/Crop-Doctor-Final-Year-Project-

This project is a comprehensive Flask-based application designed to help...

26
Experimental
8 imsigma1/AI-Knowledge-Creativity

🧠 Power AI-driven tools for creative exploration and knowledge retrieval,...

24
Experimental
9 Orion-zhen/transAPI

OpenAI compatible API purely based on Transformers

24
Experimental
10 samestrin/llm-services-api

A FastAPI-powered REST API offering a comprehensive suite of natural...

23
Experimental
11 spyker77/fastapi-tdd-docker

Transformers with test-driven development

22
Experimental
12 Behera-babu/ai-fastapi-mlops

🌟 Build production-ready AI services with this FastAPI template, integrating...

22
Experimental
13 RochaFurada/AIether

AIether: Intelligent architecture expansion for Deep Learning. AIether uses...

21
Experimental
14 IsmaelMousa/TTL

Full-stack simulator for a todo task list application using FastAPI, I built...

20
Experimental
15 DoyoungBok/genai-docker-api

Dockerized FastAPI inference API using Hugging Face Transformers (FLAN-T5)...

19
Experimental
16 ZKOussama7/AI-Farm

A Complete Solution For Managing a Farm, Gorwing Your Own Food, Or Even...

19
Experimental
17 anar-rzayev/Empathetic-Dialogue-Generation

Open-Domain Dialogue model which produces empathetic responses when trained...

16
Experimental
18 Western-1/nlp-inference-service

Production-ready NLP Microservice with MLOps practices. Features: FastAPI,...

16
Experimental
19 Priyanshjain10/ai-fastapi-mlops

Production AI service

16
Experimental
20 mxchinegod/digits-api-ml

digits-api-ml is a large suite of API endpoints that directly respond with...

15
Experimental
21 agastya-nath123/Relighting_Backend

A simple Python FastAPI-based backend for our AI Photoshop app

15
Experimental
22 MEROO1010/AI-Knowledge-Creativity

A powerful open-source collection of AI tools for learning, storytelling,...

15
Experimental
23 chamajay/deepsense-backend

Backend server of DeepSense. Provides an API to access machine learning models.

14
Experimental
24 itsmyfacade/itsmyfacade

Production-grade machine learning systems, model inference pipelines, and...

14
Experimental
25 henrikalbihn/gliclass-as-a-service

GLiClass model in a FastAPI microservice.

13
Experimental
26 nakira974/k8s-image-recognition

A simple automated k8s cluster build on AWS deploying an actix_web reverse...

12
Experimental
27 NguyenDucAnh-2k6/OOP_Logistics_project

Disaster Logistics App - A desktop solution for disaster aiding

12
Experimental
28 AIDRI/ENCY-AI

ENCY-AI part

11
Experimental
29 anurag629/BotaniScan-API

FastAPI backend for BotaniScan plant disease detection with TensorFlow and Docker

11
Experimental
30 evops-sum25/evops-ml

A web server for EvOps responsible for machine learning tasks ✨

11
Experimental
31 silvano315/MLOps-for-sentiment-analysis

This is the tenth project of AI Engineering Master. It aims to integrate an...

11
Experimental
32 Linutesto/Fractal-Neurons-LILA_JAILBREAK

Fractal Neurons — fractal MoE + conversational tooling (7950X/4090 tuned)

11
Experimental
33 jeffthedeveloper/MLOPS

Production-ready MLOps pipeline: Serving a GPT-2 model through a Flask API,...

11
Experimental
34 oriolrius/sagemaker-distilgpt2-endpoint

Deploy a DistilGPT-2 language model to AWS SageMaker with GitHub Actions CI/CD

11
Experimental