Most production failures are context failures, not model failures. Models with 200K token windows still degrade when critical information gets buried in noise. Just-in-time retrieval beats pre-loading everything. Stop treating context windows like infinite resources.