Azure now treats evaluations and governance enforcement as observability capabilities, not separate concerns. Non-deterministic systems need systematic quality assessment to run in production. Traditional metrics tell you what happened. Evaluations tell you if it was correct.