Run the same query three times. High variance means brittleness. Your agent is overly sensitive to minor prompt variations. Production systems need predictable behavior. If answers drift across identical inputs, you have a reliability problem before you scale.