52.8% on SWE Bench Pro competitive with Opus 4.6 Reasoning
reactive:microsoft-build-2026
(No summary yet for this item — extraction summaries are still backfilling.)
reactive:microsoft-build-2026
(No summary yet for this item — extraction summaries are still backfilling.)