@lmsysorg @AMD @dstackai If you're still running prefill and decode on the same GPUs at scale, you're basically burning ...

reactive:mlsys-2026-inference-systems · Sakura Yuki (@sakurayukiai) · 2026-05-21

(No summary yet for this item — extraction summaries are still backfilling.)

Appears in