Evaluate, improve, and run your prompts — all in one session. No setup required.
Spots weak prompts while you write them. One-line fix when it matters. Silent when your prompt is solid.
Saves approved outputs as test cases. Builds evaluation data over time with zero extra effort.
Open source. No setup. Works in Claude Code terminal and Mac app.