MiniMax just dropped a dead-set clever sparse attention mechanism that slashes compute by 28x at a million tokens of con...

reactive:llm-efficiency-vs-scale · MrRuSs3LL (@mrru5s3ll) · 2026-06-13

(No summary yet for this item — extraction summaries are still backfilling.)

Appears in