Amazon strikes deal to offer Cerebras AI inference chips on AWS

Partnership brings Cerebras' large-wafer chips to cloud customers seeking faster AI inference.

$AMZN ↗AI & Technology Mergers & Acquisitions Big Tech

Share

Briefing

Amazon Web Services has agreed to offer Cerebras Systems' chips on its cloud platform, giving AWS customers access to the startup's large-wafer inference hardware. The deal marks a notable expansion of Cerebras' distribution and adds a third-party silicon option to AWS alongside Amazon's own in-house AI chips.

Amazon Web Services has struck a partnership with Cerebras Systems to make the startup's AI inference chips available through AWS, according to announcements from both companies and reporting by the Wall Street Journal and Reuters.

Cerebras, which is known for its wafer-scale processors that are significantly larger than conventional GPU dies, has positioned its hardware as a speed advantage for AI inference workloads. The collaboration is aimed at setting what both companies described as a new standard for inference speed and performance in the cloud.

The deal extends Cerebras' reach considerably. The Santa Clara-based startup had previously built a customer base through direct contracts with enterprises and national laboratories, but cloud availability on AWS gives it access to a far broader pool of developers and corporate users without requiring dedicated on-premise deployments.

For Amazon, the arrangement adds specialist inference capacity at a moment when demand for fast, cost-efficient AI inference is intensifying across the industry. AWS already offers its own custom silicon, including the Trainium and Inferentia families, as well as access to Nvidia GPUs. Adding Cerebras provides another option for latency-sensitive applications.

Cerebras had filed for an initial public offering in late 2024, though the listing had not proceeded as of early 2026. The AWS partnership represents a meaningful commercial development ahead of any potential return to public markets.

Analysis

→AWS adding Cerebras to its inference marketplace directly pressures Nvidia's inference revenue share at the point-of-sale decision. Enterprise developers choosing between Nvidia H100/H200 instances, AWS Inferentia, and now Cerebras wafer-scale chips will increasingly default to benchmarking all three, extending sales cycles and compressing Nvidia's pricing power on inference SKUs. This compounds the competitive dynamic already visible ahead of Nvidia's GTC inference chip launch, where the company is already forced to respond to inference-specific challengers rather than defend training dominance alone.
→The AWS partnership substantially de-risks Cerebras' IPO story by converting what was a direct-sales, enterprise-concentrated revenue model into a scalable cloud distribution channel. Investors evaluating a Cerebras listing now have a credible hyperscaler revenue multiplier to underwrite, which expands the addressable comparable set beyond pure-play chip designers toward cloud-native software infrastructure names. This likely raises the IPO valuation ceiling and brings the timeline for a public offering back into consideration after the 2024 filing stalled.

Connected Events

Markets · 2 months ago

Nvidia prepares inference chip launch ahead of GTC as competition intensifies

Nvidia is preparing to launch a dedicated inference chip at GTC precisely as AWS hands Cerebras a distribution channel that bypasses Nvidia's cloud dominance. The two events together signal that inference is now a multi-vendor battleground where Nvidia cannot assume platform lock-in.

Earnings · 2 months ago

Oracle surges after cloud revenue jumps 44% and 2027 guidance raised

Oracle's 44% cloud revenue growth and raised 2027 guidance have already put AWS and Google Cloud under pressure to defend AI infrastructure market share. Cerebras on AWS is a direct product response to that competitive pressure, giving AWS a differentiated inference performance claim that neither Google Cloud nor Azure currently replicates at wafer-scale.

Amazon strikes deal to offer Cerebras AI inference chips on AWS

Analysis

Connected Events

Nvidia prepares inference chip launch ahead of GTC as competition intensifies

Oracle surges after cloud revenue jumps 44% and 2027 guidance raised

Further reading

Related Stories

OpenAI misses revenue and user targets ahead of IPO push

Pershing Square IPO raises $5bn, hits low end of target range

Dimon warns of 'bond crisis' as global debt levels build