justin-oleary/straggler-shield
Patent-pending K8s DaemonSet that validates GPU nodes via GEMM latency, fail-slow CV detection, and NVLink ring profiling before Slurm resumes them
Stars
—
Forks
—
Language
Go
License
—
Category
Last pushed
Feb 21, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/justin-oleary/straggler-shield"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
muna-ai/muna-py
Run AI models anywhere. https://muna.ai/explore
clearml/clearml-pycharm-plugin
ClearML PyCharm Plugin
sql-machine-learning/elasticdl
Kubernetes-native Deep Learning Framework
Langhalsdino/Kubernetes-GPU-Guide
This guide should help fellow researchers and hobbyists to easily automate and accelerate there...
microsoft/AKSDeploymentTutorial
Tutorial on how to deploy Deep Learning models on GPU enabled Kubernetes cluster