5.6 on cerebras is quoted at 750 tokens a second this week, and the speed is worth understanding before you budget aroun...
reactive:inference-cost-optimization · Rajan Rengasamy (@cmd_alt_ecs) · 2026-06-29
(No summary yet for this item — extraction summaries are still backfilling.)