Businesses are facing the phenomenon of "AI sticker shock": the rapid growth of token expenses when scaling AI solutions is forcing companies to seek alternatives to proprietary APIs.

What Happened
Many companies have begun a mass exodus from using OpenAI and Anthropic APIs. Instead, organizations are transitioning to less expensive open-source models, such as Llama or Mistral, and implementing query optimization to reduce dependency on major vendors.
Context
The primary reason for this strategic shift is the critical rise in inference costs. As LLM usage scales, token costs begin to significantly erode business margins, making the use of general-purpose proprietary models economically unviable for many tasks.
Why It Matters for the Industry
High operational costs are becoming a serious barrier to mass AI adoption in the corporate sector. This is stimulating the development of a market for cost-management tools, automated model routing, and inference optimization methods such as quantization, distillation, and speculative decoding.
Why It Matters for Users
For companies, the focus is shifting from using "the most powerful model" to finding the optimal balance between quality and cost (performance-to-cost ratio). This means a transition toward hybrid strategies, where complex tasks are handled via proprietary APIs, while everyday routine tasks are moved to local, optimized open-source solutions.
What is Not Yet Known / Limitations
There is a divergence in approaches: technical specialists are focusing on inference efficiency, while business founders are more concerned with the sustainability of business models and maintaining profitability.
Sources
Author
Look at AI, Editorial Team
![Why Companies Are Dumping OpenAI and Anthropic [video]](/assets/tg-news-media/5b/5b942ae0e73b95b8690f62b05fe8d1fc17fb5719b345dbafc695a19afc8b498d.jpg)