Model Inference Serving ML Frameworks
Tools and frameworks for deploying, serving, and scaling machine learning models in production environments. Includes model servers, inference optimization, batching, and multi-model serving orchestration. Does NOT include model training frameworks, hyperparameter tuning, or general MLOps platforms.
There are 73 model inference serving frameworks tracked. 3 score above 70 (verified tier). The highest-rated is modelscope/modelscope at 90/100 with 8,784 stars and 3,316,702 monthly downloads. 3 of the top 10 are actively maintained.
Get all 73 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=model-inference-serving&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life. |
|
Verified |
| 2 |
Lightning-AI/LitServe
A minimal Python framework for building custom AI inference servers with... |
|
Verified |
| 3 |
basetenlabs/truss
The simplest way to serve AI/ML models in production |
|
Verified |
| 4 |
tensorflow/serving
A flexible, high-performance serving system for machine learning models |
|
Established |
| 5 |
labmlai/labml
🔎 Monitor deep learning model training and hardware usage from your mobile phone 📱 |
|
Established |
| 6 |
deepjavalibrary/djl-serving
A universal scalable machine learning model deployment solution |
|
Established |
| 7 |
OrderLab/TrainCheck
An Observability Framework for AI Training |
|
Established |
| 8 |
reacher-z/gpu-monitor
Lightweight NVIDIA GPU monitor — alerts on Slack/Discord/Telegram/20... |
|
Emerging |
| 9 |
iitzco/tfserve
Serve TF models simple and easy as an HTTP API |
|
Emerging |
| 10 |
awslabs/multi-model-server
Multi Model Server is a tool for serving neural net models for inference |
|
Emerging |
| 11 |
tobegit3hub/simple_tensorflow_serving
Generic and easy-to-use serving service for machine learning models |
|
Emerging |
| 12 |
ShannonAI/service-streamer
Boosting your Web Services of Deep Learning Applications. |
|
Emerging |
| 13 |
VertaAI/modeldb
Open Source ML Model Versioning, Metadata, and Experiment Management |
|
Emerging |
| 14 |
polyaxon/sdks
Polyaxon Clients & Langange SDKS |
|
Emerging |
| 15 |
ZhigaMason/monitorch
A plug-and-use python module to monitor neural network learning. |
|
Emerging |
| 16 |
jrieke/traingenerator
🧙 A web app to generate template code for machine learning |
|
Emerging |
| 17 |
ELS-RD/transformer-deploy
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗... |
|
Emerging |
| 18 |
spotify/zoltar
Common library for serving TensorFlow, XGBoost and scikit-learn models in production. |
|
Emerging |
| 19 |
sustainable-computing-io/kepler-model-db
Repository containing up-to-date models to be used by the kepler-model-server |
|
Emerging |
| 20 |
Angel-ML/serving
A stand alone industrial serving system for angel. |
|
Emerging |
| 21 |
Ifihan/blazerpc
A lightweight, framework-agnostic gRPC library for serving machine learning... |
|
Emerging |
| 22 |
zooniverse/bajor
Azure Batch Job Runner - BaJoR |
|
Emerging |
| 23 |
CODAIT/max-central-repo
Central Repository of Model Asset Exchange project. This repository contains... |
|
Emerging |
| 24 |
feast-dev/feast-java-old
Feast Java Components |
|
Emerging |
| 25 |
alvarobartt/serving-pytorch-models
Serving PyTorch models with TorchServe :fire: |
|
Emerging |
| 26 |
flipkart-incubator/Hunch
Hunch allows users to turn arbitrary machine learning models built using... |
|
Emerging |
| 27 |
rai-project/mlmodelscope
MLModelScope is an open source, extensible, and customizable platform to... |
|
Emerging |
| 28 |
mKaloer/TFServingCache
Distributed model cache for TF Serving |
|
Emerging |
| 29 |
BeyonderXX/tensorflow-serving-tutorial
A tutorial of building tensorflow serving service from scratch |
|
Emerging |
| 30 |
ParagGhatage/ZeroML
ZeroML is a visual-first, end-to-end machine learning platform that lets you... |
|
Emerging |
| 31 |
mme/vergeml
Machine Learning Environment - alpha version |
|
Experimental |
| 32 |
fuseml/fuseml-core
FuseML APIs and core service. This repo include the FuseML client useful to... |
|
Experimental |
| 33 |
kemingy/batching
Dynamic Batching for Deep Learning Serving |
|
Experimental |
| 34 |
ovh/serving-runtime
Exposes a serialized machine learning model through a HTTP API. |
|
Experimental |
| 35 |
Kenza-AI/kenza
Open-Source Machine Learning Platform |
|
Experimental |
| 36 |
huggingbench/huggingbench
Find the optimal model serving solution for 🤗 Hugging Face models 🚀 |
|
Experimental |
| 37 |
bioinformatist/cml
A Framework for Production-Ready Continuous Machine Learning |
|
Experimental |
| 38 |
alvarobartt/tensorflow-serving-streamlit
TensorFlow Serving + Streamlit! :sparkles::framed_picture: |
|
Experimental |
| 39 |
redis-applied-ai/redis-feast-gcp
A demo of Redis Enterprise as the Online Feature Store deployed on GCP with... |
|
Experimental |
| 40 |
BBVA/pacarana
A standalone ETL tool to generate advanced features for your Machine... |
|
Experimental |
| 41 |
tradingAI/runner
Job runner for tbase experiments |
|
Experimental |
| 42 |
kazuki-kanaya/obsern
Lightweight CLI-based monitoring and notifications for long-running ML jobs.... |
|
Experimental |
| 43 |
JohnJTK/crucible_train
🚀 Accelerate ML training on the BEAM with CrucibleTrain's unified... |
|
Experimental |
| 44 |
puneethkotha/Falcon
Production ML inference platform. Multi-worker · Nginx load balancing ·... |
|
Experimental |
| 45 |
prabhuomkar/bitbeast
Experiments with Model Training, Deployment & Monitoring |
|
Experimental |
| 46 |
entrpn/serving-model-cards
Collection of OSS models that are containerized into a serving container |
|
Experimental |
| 47 |
datamass-io/ml-kraken
Machine-Learning orchestration framework. Cloud-based models management environment. |
|
Experimental |
| 48 |
galafis/realtime-ml-serving-api
High-performance ML model serving API built with Go and Python, featuring... |
|
Experimental |
| 49 |
North-Shore-AI/crucible_train
ML training orchestration for the Crucible ecosystem. Distributed training,... |
|
Experimental |
| 50 |
HighviewOne/ml-model-registry
ML Model Registry & Deployment Dashboard - AI Dev Tools Zoomcamp 2025 |
|
Experimental |
| 51 |
North-Shore-AI/crucible_framework
CrucibleFramework: A scientific platform for LLM reliability research on the BEAM |
|
Experimental |
| 52 |
mcp-tool-shop-org/backprop
CLI-first ML trainer with intelligent resource governance — timeboxed runs,... |
|
Experimental |
| 53 |
mmziyad/flink-ms
Serving layer for large machine learning models on Apache Flink |
|
Experimental |
| 54 |
North-Shore-AI/crucible_ensemble
Multi-model ensemble voting strategies for LLM reliability |
|
Experimental |
| 55 |
North-Shore-AI/crucible_adversary
Adversarial testing and robustness evaluation for the Crucible framework |
|
Experimental |
| 56 |
North-Shore-AI/crucible_deployment
ML model deployment for the Crucible ecosystem. vLLM and Ollama integration,... |
|
Experimental |
| 57 |
North-Shore-AI/crucible_feedback
ML feedback loop management for the Crucible ecosystem. Quality monitoring,... |
|
Experimental |
| 58 |
North-Shore-AI/crucible_model_registry
ML model registry for the Crucible ecosystem. Artifact storage, model... |
|
Experimental |
| 59 |
rsj-cs/distributed-pipeline-capstone
Design and optimization of a scalable distributed data processing pipeline... |
|
Experimental |
| 60 |
galafis/distributed-model-inference-engine
Distributed model inference engine with REST/gRPC serving, circuit breaker,... |
|
Experimental |
| 61 |
man4ish/omnibioai-model-registry
Production-grade model registry for the OmniBioAI ecosystem, providing... |
|
Experimental |
| 62 |
kaniyeFelix/infinitetalk-deployment
🚀 Deploy InfiniteTalk and MultiTalk models effortlessly with automated... |
|
Experimental |
| 63 |
GAISSA-UPC/energy-ml-serving
Energy consumption of ML inference with Runtime Engines |
|
Experimental |
| 64 |
aakashns/servefastai
Serve FastAI models and get a web-based UI with a single line of code |
|
Experimental |
| 65 |
narphu/modelcache-operator
Model caching in Kubernetes |
|
Experimental |
| 66 |
arthurhzna/Golang_AI_Pipeline
A scalable AI task queue and processing pipeline in Go, integrating Redis,... |
|
Experimental |
| 67 |
gdroguski/MLServe
Docker-based Machine Learning models serving |
|
Experimental |
| 68 |
A-SHOJAEI/adaptive-inference-router-with-cascade-serving
A research-grade adaptive inference routing system that learns to... |
|
Experimental |
| 69 |
bluebell2505/qrucible
Qrucible is a real‑world–aligned, decision‑support system for early‑stage... |
|
Experimental |
| 70 |
ameron-ai/model-serving-sidecar-service-example
A simple Python example of a Model Service that can be fronted by the Model Sidecar |
|
Experimental |
| 71 |
jonychoi/neuralverse
Beyond the State of the Arts: Share more, Compare more, Edit Easy, Create... |
|
Experimental |
| 72 |
danielschulz/aiModelsAtScaleOnRestfulJeeSvcs
Delivery Excellence, DevOps: Cloud-native Deployments of Data Science Models... |
|
Experimental |
| 73 |
ameron-ai/model-serving-sidecar
A lightweight adapter that handles all the cross-cutting concerns for model serving |
|
Experimental |