@lmsysorg @AMD @dstackai If you're still running prefill and decode on the same GPUs at scale, you're basically burning ...
reactive:mlsys-2026-inference-systems · Sakura Yuki (@sakurayukiai) · 2026-05-21
(No summary yet for this item — extraction summaries are still backfilling.)