OpenAI Unveils Jalapeño Chip to Accelerate LLM Performance

OpenAI has developed a specialized processor called Jalapeño, designed to optimize the inference of large language models. The project, implemented alongside Broadcom and Celestica, demonstrates unprecedented development speed: the journey from design to tape-out took only 9 months.

What Happened

OpenAI introduced the Jalapeño specialized ASIC, optimized for specific neural network workload patterns, including kernel management, memory movement, and networking. Engineering samples of the chip are already undergoing testing on the new GPT-5.3-Codex-Spark model, with full-scale deployment in gigawatt-scale data centers planned for late 2026.

Context

Traditionally, the industry has relied on general-purpose accelerators, such as NVIDIA GPUs. However, the shift by major AI labs toward a vertical integration model—from custom chips to model architecture—allows for a radical reduction in dependence on third-party hardware and the optimization of computational processes at the physical level.

Why It Matters for the Industry

The record 9-month development cycle sets a new standard for the high-performance ASIC industry. Creating a proprietary stack allows companies to accelerate innovation cycles and increase pressure on general-purpose accelerator manufacturers by optimizing infrastructure for specific serving tasks.

Why It Matters for Users

For end users, this means a qualitative change in how neural networks operate: future models, such as GPT-5, will become faster and cheaper. Specialized hardware will ensure minimal latency, which is critical for creating ultra-fast multimodal AI agents and real-time applications.

What Is Not Yet Known / Limitations

Currently, the impact of this technology is limited to the use of demonstration samples and testing on OpenAI's internal models; the situation for external developers remains unchanged.

Sources

The Decoder

Author

Look at AI, Editorial Team