With the new ARC-AGI results from GLM 5.2, together with the WeirdML results, we now have 8 private benchmark datapoints...
reactive:ai-benchmark-race · Håvard Ihle (@htihle) · 2026-06-26
(No summary yet for this item — extraction summaries are still backfilling.)