# Cerebras

> Cerebras — Industry-leading AI infrastructure providing the fastest speed, scale, and quality for AI inference and deployment.

**Category:** saas / unknown  
**Domain:** cerebras.ai  
**Source:** [Toneeto](https://www.toneeto.com/discover/cerebras.ai)  
**Last updated:** March 30, 2026

## Changelog

### Jul 21, 2026 — Sunset

Cerebras is deprecating the disable_reasoning parameter for zai-glm-4.7 after July 21, 2026

- Cerebras is deprecating the disable_reasoning parameter for zai-glm-4.7 after July 21, 2026 _(100% confidence (cross-referenced))_

### Mar 30, 2026 — New Feature

Cerebras most recent advertising campaign was active as of 2026-03-30

- Cerebras most recent advertising campaign was active as of 2026-03-30 _(100% confidence (cross-referenced))_

### Mar 20, 2026 — New Feature

Cerebras GLM 4.7 is released under an MIT-style permissive license

- Cerebras GLM 4.7 is released under an MIT-style permissive license _(100% confidence (cross-referenced))_

### Mar 13, 2026 — New Feature

Amazon Web Services is deploying Cerebras CS-3 systems in AWS data centers; Cerebras will be available via AWS Bedrock

- Amazon Web Services is deploying Cerebras CS-3 systems in AWS data centers _(100% confidence (cross-referenced))_
- Cerebras will be available via AWS Bedrock _(100% confidence (cross-referenced))_

### Jan 22, 2026 — Feature Moved

Cerebras Chat Completions API now supports the developer message role for gpt-os; Cerebras launched a new Metrics API for monitoring dedicated inference endpoints; Cerebras launched Batch API and Files API for large-scale inference workloads (+5 more)

- Cerebras Chat Completions API now supports the developer message role for gpt-oss-120b _(100% confidence (cross-referenced))_
- Cerebras launched a new Metrics API for monitoring dedicated inference endpoints _(100% confidence (cross-referenced))_
- Cerebras launched Batch API and Files API for large-scale inference workloads _(100% confidence (cross-referenced))_
- Cerebras rolled out selective 4-bit weight-only quantization for supported models _(100% confidence (cross-referenced))_
- Cerebras zai-glm-4.7 now accepts the reasoning_effort parameter _(100% confidence (cross-referenced))_
- Cerebras launched Service Tiers feature for request prioritization _(100% confidence (cross-referenced))_
- Cerebras Constrained Decoding moved to General Availability status _(100% confidence (cross-referenced))_
- Cerebras added support for parallel tool calling _(100% confidence (cross-referenced))_

### Jan 21, 2026 — New Product

Cerebras API Version 2 is available for testing via the X-Cerebras-Version-Patch header

- Cerebras API Version 2 is available for testing via the X-Cerebras-Version-Patch header _(100% confidence (cross-referenced))_

### Aug 19, 2025 — New Feature

Cerebras earliest advertising campaign started on 2025-08-19

- Cerebras earliest advertising campaign started on 2025-08-19 _(100% confidence (cross-referenced))_

## Pricing

| Plan | Monthly | Annual (per mo) | Type |
|------|---------|-----------------|------|
| Free | Free | — | Free |
| Developer | $10 | — | subscription |
| Enterprise | Custom | — | Enterprise |
| Pro | $50 | — | subscription |
| Max | $200 | — | subscription |

## Competitors

- [Ceremorphic, Inc.](https://www.toneeto.com/discover/ceremorphic.com)
- [SambaNova](https://www.toneeto.com/discover/sambanova.ai)
- [Groq](https://www.toneeto.com/discover/groq.com)
- [Marvell Technology](https://www.toneeto.com/discover/marvell.com)
- [Wave Computing](https://www.toneeto.com/discover/wavecomp.ai)

## Audience Segments

### startup_team (50% fit)

**Use case:** Cerebras for small teams
**Why:** Insufficient evidence for specific segmentation, defaulting to startup_team
**Key considerations:** ease of use, value for money

## Gotchas & Limitations

- Pricing transparency [moderate]
- Limited community support options [major]

---

*Data verified by [Toneeto](https://www.toneeto.com/discover/cerebras.ai). JSON available at [/api/co/cerebras.ai](https://www.toneeto.com/api/co/cerebras.ai).*