

A framework for post-training large language models with reinforcement learning, designed for reasoning and agentic capabilities.
Training
Set up and run RL post-training with FSDP or Megatron training engines and
vLLM inference. Designed for GPU clusters.
UI Visualization
Monitor training runs, inspect rollouts, and analyze metrics locally with
the Telescope UI.