Follow

Private Capabilities, Public Alignment: De-escalating Without Disadvantage

tl;dr: The AGI race is shifting to state actors. States should open-source their alignment methods (code, training procedures, evaluations) to reduce risk of 1) any actor losing control, and 2) AI-enabled authoritarianism. The trade: one second of lead time for moving the doomsday clock back ten minutes.

lesswrong.com/posts/BJhgXPLivs

Sign in to participate in the conversation
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.