Offline testing catches obvious problems. Production reveals everything else. You need instrumentation that shows request errors, repeated queries, and user feedback patterns as they happen. The gap between your test cases and actual usage is where things break.