News

NeuralWatt offers payment per kWh instead of tokens

Startup NeuralWatt is implementing an LLM inference billing model based on electricity consumption, which significantly reduces costs.

Compiled by Sergey KostenchukPublished 2026-06-26Updated 2026-06-26

2026-06-26 Business

Expanded analysis for this story

Open the longform version with context, source trail, and what changed.

Read longform

Table comparing Energy-based vs Token-based billing costs — Price comparison per model Source

⚡️ Paying for LLMs via electricity instead of tokens

Startup NeuralWatt is introducing a new billing model for LLM inference based on electricity consumption (kWh). This has reduced costs for Qwen and Kimi models by an average of 82.9%.

🌍 The transition to an energy-based model incentivizes the optimization of energy efficiency and caching in cloud inference.

👤 Developers can gain access to significantly cheaper inference during intensive request periods.

Source 1: https://www.coinerella.com/energy-based-llm-billing-cut-my-bill-to-a-sixth/

Sources

www.coinerella.com