The-Swarm-Corporation/ClusterOps
ClusterOps is an enterprise-grade Python library developed and maintained by the Swarms Team to help you manage and execute agents on specific CPUs and GPUs across clusters. This tool enables advanced CPU and GPU selection, dynamic task allocation, and resource monitoring, making it ideal for high-performance distributed computing environments.
Provides fine-grained CPU core and GPU affinity execution with exponential backoff retry logic and real-time resource monitoring to optimize task placement. Integrates with Ray for distributed multi-GPU workloads and supports parallel callable execution through configurable environment variables (LOG_LEVEL, RETRY_COUNT, RETRY_DELAY). Enables dynamic GPU selection based on available memory and CPU core pinning to minimize context switching in high-performance clusters.
No commits in the last 6 months. Available on PyPI.
Stars
7
Forks
1
Language
Python
License
MIT
Category
Last pushed
Oct 13, 2025
Commits (30d)
0
Dependencies
6
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/mlops/The-Swarm-Corporation/ClusterOps"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.