Eval-Driven Development

Self-assessment

Where are you on the EDD maturity curve?

Five questions. You’ll get your level — from vibe checks to a calibrated, online eval suite — and the single best next step. It’s the maturity model, made interactive.

How do you currently decide an AI output is good enough?
Where do your eval cases come from?
How do you handle non-determinism?
Do evals gate releases?
Do you evaluate in production?

Levels follow the EDD maturity model; grab the templates in the kit. Evidence is in the codex.