16 points | by alcasa 9 hours ago
2 comments
> By keeping computation and memory on a single wafer-scale processor, we eliminate the data-movement penalties that dominate GPU systems. The result is up to 15× faster inference, without sacrificing model size or accuracy.
https://xcancel.com/andrewdfeldman/status/201154226777402186...
Source for the 10B amount: https://www.cnbc.com/2026/01/14/cerebras-scores-openai-deal-...
> By keeping computation and memory on a single wafer-scale processor, we eliminate the data-movement penalties that dominate GPU systems. The result is up to 15× faster inference, without sacrificing model size or accuracy.
https://xcancel.com/andrewdfeldman/status/201154226777402186...
Source for the 10B amount: https://www.cnbc.com/2026/01/14/cerebras-scores-openai-deal-...