MiniMax just dropped a dead-set clever sparse attention mechanism that slashes compute by 28x at a million tokens of con...
reactive:llm-efficiency-vs-scale · MrRuSs3LL (@mrru5s3ll) · 2026-06-13
(No summary yet for this item — extraction summaries are still backfilling.)