horizon-rl/strands-sglang
SGLang model provider for Strands Agents for on-policy agentic RL training.
Provides token-level rollout capture (token IDs, logprobs, loss masks) directly from SGLang's `/generate` endpoint, eliminating retokenization drift for on-policy RL training. Integrates with the Strands Agents SDK framework and enforces strict, deterministic tool-call parsing without post-processing heuristics. Designed for seamless training pipelines with frameworks like slime, exposing complete token trajectories needed for policy gradient methods.
50 stars and 4,156 monthly downloads. Available on PyPI.
Stars
50
Forks
4
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 13, 2026
Monthly downloads
4,156
Commits (30d)
0
Dependencies
4
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/horizon-rl/strands-sglang"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related agents
awslabs/agent-squad
Flexible and powerful framework for managing multiple AI agents and handling complex conversations
rodmena-limited/stabilize
Queue-Based State Machine - A lightweight workflow execution engine with DAG-based stage...
jeremiah-k/agor
AgentOrchestrator - Multi-agent development coordination platform. Transform AI assistants into...
aws-solutions-library-samples/guidance-for-multi-agent-orchestration-on-aws
Enables developers to build, deploy, and manage multiple specialized agents that work together...
avtomatika-ai/avtomatika
High-performance state-machine based orchestrator for managing complex AI agents and ...