The-Swarm-Corporation/ClusterOps
ClusterOps is an enterprise-grade Python library developed and maintained by the Swarms Team to help you manage and execute agents on specific CPUs and GPUs across clusters. This tool enables advanced CPU and GPU selection, dynamic task allocation, and resource monitoring, making it ideal for high-performance distributed computing environments.
Provides fine-grained CPU core and GPU affinity execution with exponential backoff retry logic and real-time resource monitoring to optimize task placement. Integrates with Ray for distributed multi-GPU workloads and supports parallel callable execution through configurable environment variables (LOG_LEVEL, RETRY_COUNT, RETRY_DELAY). Enables dynamic GPU selection based on available memory and CPU core pinning to minimize context switching in high-performance clusters.
Available on PyPI.
Stars
7
Forks
1
Language
Python
License
MIT
Category
Last pushed
Oct 13, 2025
Commits (30d)
0
Dependencies
6
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/The-Swarm-Corporation/ClusterOps"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
awslabs/agent-squad
Flexible and powerful framework for managing multiple AI agents and handling complex conversations
horizon-rl/strands-sglang
SGLang model provider for Strands Agents for on-policy agentic RL training.
rodmena-limited/stabilize
Queue-Based State Machine - A lightweight workflow execution engine with DAG-based stage...
jeremiah-k/agor
AgentOrchestrator - Multi-agent development coordination platform. Transform AI assistants into...
aws-solutions-library-samples/guidance-for-multi-agent-orchestration-on-aws
Enables developers to build, deploy, and manage multiple specialized agents that work together...