
What a cycle does
An Improve cycle is attached to one prompt in one project.
| Stage | What happens |
|---|---|
| Behaviors | Selects the repeated pattern or issue the cycle should improve. |
| Evals | Uses authored and auto generated evaluators to score the baseline and candidates. |
| Datasets | Builds validation coverage from linked datasets, production cases, and generated edge cases. |
| Prompts | Explores candidate prompt snapshots and blocks unsafe or regressing options. |
| Review | Packages the selected candidate with diff, scores, examples, cost, tokens, latency, and final actions. |
Trigger a Cycle
Choose the prompt, focus, behaviors, thoroughness, and reviewers.
Review a Cycle
Inspect diagnosis, diffs, scores, traffic examples, and final actions.
Auto Generated Evaluators
Understand generated checks created from production evidence.
Synthetic Datasets
Use generated cases and production traces as validation coverage.
Auto Prompt Optimization
Understand candidate exploration, safety gates, and prompt diffs.
Behaviors
Understand the behavior evidence Improve can target.