All Systems Go

One architecture.
Every environment. Infinite scale.

AI systems built exclusively for inference.

Scroll

Go big. Go small.

Persimmons product

Go anywhere.

Persimmons chip

Not a better GPU.
A new foundation.

A faster GPU is still a GPU. Persimmons' chiplet architecture is designed from first principles to handle growing models and deep context with unmatched performance.

From edge to enterprise to mobile to hyperscaler
1 – 4
5 – 41
42 – 99
100+

Modules

Intelligent Efficiency

Massive compute with minimal footprint

Cognitive Clarity

Prefill to decode at lightning speed

Flexible Growth

Power that flexes along with you