Signal: AI product differentiation is shifting from ‘which model?’ to ‘how do you spend inference?’—budgets, modes, and policies become UX. Reality check: without governors (caps, caching, fallbacks, audits), intelligence becomes runaway cost and jittery latency.