Victoria Krakovna on X: "The GDM control team has released our roadmap for AI control: a second line of defense against misalignment risk, if alignment is insufficient. They develop a threat taxonomy for AI adversaries, and outline strategies for detecting misalignment and preventing attacks, tiered by" / X

reactive:deepmind-ai-control-roadmap

(No summary yet for this item — extraction summaries are still backfilling.)

Appears in