// 5 challenges · Evals & Testing
Write evals for prompt regression testing
Design a lightweight eval suite that catches regressions in a structured extraction prompt.
Write evals for a summarisation model
Design and implement a rigorous evaluation suite for an AI summarisation system. Your evals should catch common failure modes.