Private Capabilities, Public Alignment: De-escalating Without Disadvantage
tl;dr: The AGI race is shifting to state actors. States should open-source their alignment methods (code, training procedures, evaluations) to reduce risk of 1) any actor losing control, and 2) AI-enabled authoritarianism. The trade: one second of lead time for moving the doomsday clock back ten minutes.