horizon-rl/strands-sglang

SGLang model provider for Strands Agents for on-policy agentic RL training.

/ 100

Established

Provides token-level rollout capture (token IDs, logprobs, loss masks) directly from SGLang's `/generate` endpoint, eliminating retokenization drift for on-policy RL training. Integrates with the Strands Agents SDK framework and enforces strict, deterministic tool-call parsing without post-processing heuristics. Designed for seamless training pipelines with frameworks like slime, exposing complete token trajectories needed for policy gradient methods.

50 stars and 4,156 monthly downloads. Available on PyPI.

Maintenance 13 / 25

Adoption 16 / 25

Maturity 22 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Featured in

Your Agent is Hitting its Ceiling — Who's Actually Fixing It

Related agents

awslabs/agent-squad

Flexible and powerful framework for managing multiple AI agents and handling complex conversations

rodmena-limited/stabilize

Queue-Based State Machine - A lightweight workflow execution engine with DAG-based stage...

jeremiah-k/agor

AgentOrchestrator - Multi-agent development coordination platform. Transform AI assistants into...

aws-solutions-library-samples/guidance-for-multi-agent-orchestration-on-aws

Enables developers to build, deploy, and manage multiple specialized agents that work together...

avtomatika-ai/avtomatika

High-performance state-machine based orchestrator for managing complex AI agents and ...

Explore AI Agents

All categories Trending AI Agent directory Insights