Gemini 2.5 enables caching automatically. DeepSeek's disk-based approach cuts cache hits to $0.014 per million tokens, delivering 90% cost reductions for repeated contexts. First-token latency drops from 13 seconds to 500ms. Production economics just shifted from unpredictable to manageable.