Browsing samples
The left sidebar lets you navigate through samples by training step. Within each step, samples are organized into groups — each group shares the same prompt, and you can click into a group to see all the individual completions and how they compare. Metric columns in the sidebar show color-coded dots for each sample’s reward, advantage, token count, and any custom metrics, so you can quickly scan for patterns across a group.Sample view
Selecting a sample shows the full rollout in the main panel:- Prompt — system prompt and user prompt with token counts
- Completions — the model’s response, with multi-turn conversations shown as separate turns. For agentic environments, you’ll see the alternating assistant/user (tool) turns
- Rewards — total reward and per-metric breakdowns, color-coded on a red-to-green scale based on the environment’s reward range
- Advantage — the sample’s advantage within its group
- Golden answers — reference answers when available, so you can compare against the expected output

