Open Model Wave and Open-vs-Closed Capability Gap Debate
Synthesis history
7 versions, newest first.
-
Version 7 2026-05-24 20:02 UTC · 118 items
GLM-5.1 topping SWE-Bench Pro and reaching #3 on Code Arena [^17424][^17425][^17426] is the substantive new development: alongside MiniMax M2's SWE-bench lead [^15984], this creates a multi-instance pattern of Chinese o…
-
Version 6 2026-05-24 09:38 UTC · 104 items
The Nemotron Coalition now has concrete membership detail: Tom's Hardware reports eight AI labs [^15974], and Mistral AI has publicly confirmed its partnership [^15979], moving the coalition from headline announcement t…
-
Version 5 2026-05-24 04:00 UTC · 78 items
The most significant new development is NVIDIA's launch of the Nemotron Coalition of leading global AI labs to advance open frontier models [^14941], which directly materializes the open question from the prior pass abo…
-
Version 4 2026-05-23 04:11 UTC · 25 items
The new items this pass carry no extracted claims, stances, or key quotes — they are predominantly AI stock investment articles, social media reposts, and grant-listing pages that were surfaced by the active searches bu…
-
Version 3 2026-05-22 19:40 UTC · 10 items
Item 26 (Lambert's April 11 article on the open model consortium) provides the detailed underlying sourcing for the consortium argument, adding specific named companies — Moonshot AI, MiniMax, and Z.ai facing financial …
-
Version 2 2026-05-21 09:29 UTC · 4 items
The Forge project [^8043] is the one new item: a community demonstration that guardrails lift an 8B model from 53% to 99% on agentic tasks. This adds practical, if unreviewed, evidence to Brand's methodological critique…
-
Version 1 2026-05-16 20:03 UTC · 3 items
A wave of open-weight model releases in mid-May 2026 — including Gemma 4 (now Apache 2.0), DeepSeek V4, Kimi K2.6, MiMo-V2.5-Pro, and GLM-5.1 [^7283] — has reignited debate over how open models compare to closed frontie…