1 article tagged with "evals"
Unit tests can't verify an LLM gave a good answer. Here's how to build evals that catch regressions before your users do.