Covering Scientific & Technical AI

Cerebras Lands Major OpenAI Deal to Scale AI Inference

OpenAI has announced a partnership with chipmaker Cerebras to add high-speed inference capacity to its computing infrastructure, marking one of the most significant deployments to date of Cerebras’ wafer-scale systems for commercial AI services.

Under the agreement, OpenAI will integrate up to 750 megawatts of Cerebras computing capacity into its inference stack over several years, with deployment beginning in early 2026 and continuing in phases through 2028. The companies said the systems will be used to support latency-sensitive workloads, including agentic AI applications and services.

“Cerebras is the high-speed solution for AI. Whether running coding agents or voice chat, large language models on Cerebras deliver responses up to 15x faster than GPU-based systems,” wrote Cerebras CEO Andrew Feldman in a blog announcement.

A Cerebras wafer-scale engine, designed to combine compute, memory, and interconnects on a single chip (Credit: Cerebras)

OpenAI says the partnership is part of a strategy to diversify its compute portfolio and better match hardware to specific workloads. Rather than relying on a single architecture, the company has increasingly emphasized a mix of systems optimized for training, batch inference, and real-time response. Cerebras’ hardware, which holds compute, memory, and interconnects on a single wafer-scale chip, is designed to reduce data movement and improve response times for large model outputs.

“When AI responds in real time, users do more with it, stay longer, and run higher-value workloads,” OpenAI said in a blog. The company said it will roll out the new capacity incrementally across workloads as integration progresses.

Feldman described the agreement as the culmination of years of technical alignment between the two companies. Feldman said OpenAI and Cerebras began discussions as early as 2017, driven by a shared view that growing model scale would eventually require new hardware architectures to sustain performance.

Financial terms were not disclosed, but Reuters reported that the deal could be worth more than $10 billion over the life of the contract, citing a source familiar with the matter. According to Reuters, OpenAI plans to use the Cerebras systems to help power its ChatGPT service, adding to a series of large infrastructure agreements as demand for OpenAI’s services continues to grow.

The partnership also has implications for Cerebras’ business. The company has historically relied on a small number of large customers, including UAE-based technology firm G42. Reuters noted that the OpenAI agreement could help Cerebras diversify its revenue base as it competes with established AI hardware vendors such as Nvidia and other specialized chipmakers.

Inference latency is an increasingly important constraint as AI applications move from demos to production. While training large models remains computationally intensive, the cost and responsiveness of inference will continue to influence user experience and operating costs. The OpenAI agreement builds on Cerebras’ recent push to scale its inference business beyond research and niche deployments. Over the past year, the company has expanded its inference footprint through partnerships with developer platforms like Hugging Face and by bringing new inference datacenters online across North America and Europe.

For OpenAI, the deal reflects a pattern of sourcing compute from multiple hardware vendors to keep up with its inference needs. In addition to its long-standing reliance on Nvidia GPUs, OpenAI has committed to large future purchases of accelerators from AMD and has entered agreements to design custom chips with other partners. For Cerebras, the agreement represents a transition from targeted inference deployments to operating infrastructure at the scale of a top-tier AI platform.

QCWire Graphic

Electronics Giants Tap into Industrial Automation with NVIDIA Metropolis for Factories

May 30, 2023 — The $46 trillion global electronics manufacturing industry spans more than 10…

WPP Partners with NVIDIA to Build Generative AI-Enabled Content Engine for Digital Advertising

TAIPEI, Taiwan, May 30, 2023 — NVIDIA and WPP have announced they are developing a…

MediaTek Partners with NVIDIA to Transform Automobiles with AI and Accelerated Computing

May 30, 2023 — MediaTek, a leading innovator in connectivity and multimedia, is teaming with…

HPE Reports Fiscal 2023 2nd Quarter Results

HOUSTON, May 31, 2023 — Hewlett Packard Enterprise has announced financial results for the second quarter…

Syslogic Introduces Rugged Computer Based on NVIDIA Jetson AGX Orin Industrial

BADEN, Switzerland, May 31, 2023 — Syslogic has introduced the first embedded system based on…

Industry Leaders Launch RISE to Accelerate the Development of Open Source Software for RISC-V

BRUSSELS, May 31, 2023 — The RISC-V Software Ecosystem (RISE) Project is a new collaborative…

Cerebras Lands Major OpenAI Deal to Scale AI Inference

OpenAI has announced a partnership with chipmaker Cerebras to add high-speed inference capacity to its…

2026 AI Predictions: It’s Now or Never

Since ChatGPT descended the stairs in late 2022, AI has become the most hyped new…

Three Quarters of Organizations Admit Gap Between Agentic AI Vision and Reality

BERLIN and SAN FRANCISCO, Jan. 14, 2026 — Camunda, a leader in agentic automation, today…

Multiply Labs to Bring Physical AI Robotics Tech to Advanced Biomanufacturing with NVIDIA

SAN FRANCISCO, Jan. 14, 2026 — Multiply Labs, a leader in robotic biomanufacturing, has announced a…

NIST’s CAISI Issues Request for Information About Securing AI Agent Systems

Jan. 13, 2026 — The Center for AI Standards and Innovation (CAISI) at the U.S. Department…

Berkeley Lab Materials Project Surpasses 650,000 Users as Demand for AI-Ready Data Grows

Jan. 13, 2026 — In 2011, a small team at the Department of Energy’s Lawrence…

Source link

What's Hot

Big Tech Signs White House Data Center Pledge With Good Optics and Little Substance

Ericsson Achieves Level 4 Network Autonomy, Strengthens Market Leadership

what’s actually new—and what’s just marketing? — Gadget Flow

Covering Scientific & Technical AI

Electronics Giants Tap into Industrial Automation with NVIDIA Metropolis for Factories

WPP Partners with NVIDIA to Build Generative AI-Enabled Content Engine for Digital Advertising

MediaTek Partners with NVIDIA to Transform Automobiles with AI and Accelerated Computing

HPE Reports Fiscal 2023 2nd Quarter Results

Syslogic Introduces Rugged Computer Based on NVIDIA Jetson AGX Orin Industrial

Industry Leaders Launch RISE to Accelerate the Development of Open Source Software for RISC-V

Cerebras Lands Major OpenAI Deal to Scale AI Inference

2026 AI Predictions: It’s Now or Never

Three Quarters of Organizations Admit Gap Between Agentic AI Vision and Reality

Multiply Labs to Bring Physical AI Robotics Tech to Advanced Biomanufacturing with NVIDIA

NIST’s CAISI Issues Request for Information About Securing AI Agent Systems

Berkeley Lab Materials Project Surpasses 650,000 Users as Demand for AI-Ready Data Grows

Big Tech Signs White House Data Center Pledge With Good Optics and Little Substance

Bridging the operational AI gap

The Download: Earth’s rumblings, and AI for strikes on Iran

Covering Scientific & Technical AI

iPhone Pro 13 Rumored to Feature 1 TB of Storage

Oculus Quest X Headset: Discover a Shining New Star

Fujifilm’s 102-Megapixel Camera is the Size of a Typical DSLR

Review: Mi 10 Mobile with Qualcomm Snapdragon 870 Mobile Platform

Comparison of Mobile Phone Providers: 4G Connectivity & Speed

Which LED Lights for Nail Salon Safe? Comparison of Major Brands

Subscribe to Updates

What's Hot

Covering Scientific & Technical AI

Cerebras Lands Major OpenAI Deal to Scale AI Inference

Related

Electronics Giants Tap into Industrial Automation with NVIDIA Metropolis for Factories

WPP Partners with NVIDIA to Build Generative AI-Enabled Content Engine for Digital Advertising

MediaTek Partners with NVIDIA to Transform Automobiles with AI and Accelerated Computing

HPE Reports Fiscal 2023 2nd Quarter Results

Syslogic Introduces Rugged Computer Based on NVIDIA Jetson AGX Orin Industrial

Industry Leaders Launch RISE to Accelerate the Development of Open Source Software for RISC-V

Cerebras Lands Major OpenAI Deal to Scale AI Inference

2026 AI Predictions: It’s Now or Never

Three Quarters of Organizations Admit Gap Between Agentic AI Vision and Reality

Multiply Labs to Bring Physical AI Robotics Tech to Advanced Biomanufacturing with NVIDIA

NIST’s CAISI Issues Request for Information About Securing AI Agent Systems

Berkeley Lab Materials Project Surpasses 650,000 Users as Demand for AI-Ready Data Grows

Related Posts