ArslanKAS/Serverless-LLM-Amazon-Bedrock
You’ll learn how to deploy a large language model-based application into production using serverless technology
No commits in the last 6 months.
Stars
4
Forks
2
Language
Jupyter Notebook
License
—
Category
Last pushed
Feb 18, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/mlops/ArslanKAS/Serverless-LLM-Amazon-Bedrock"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
bentoml/BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps,...
nndeploy/nndeploy
一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework
kubeflow/trainer
Distributed AI Model Training and LLM Fine-Tuning on Kubernetes
cncf/llm-in-action
🤖 Discover how to apply your LLM app skills on Kubernetes!
FareedKhan-dev/llm-scale-deploy-guide
An end-to-end pipeline to optimize and host LLM for 100K parallel queries