LLM calls, vector storage, cloud compute. It adds up fast at scale. If you can't measure cost per interaction now, you won't catch runaway expenses until they've already happened. Cost-blind optimization leads to bloated infrastructure nobody budgeted for.