AmpBenzScientist: "@lupyuen It sounds like interrogation. Many of si…" - Qoto Mastodon

Apr 03, 2024, 02:10

Lup Yuen Lee 李立源 @lupyuen@qoto.org

"a Large Language Model (#LLM) can be convinced to tell you how to build a bomb if you prime it with a few dozen less-harmful questions first"

https://techcrunch.com/2024/04/02/anthropic-researchers-wear-down-ai-ethics-with-repeated-questions/

AmpBenzScientist @AmpBenzScientist@qoto.org

@lupyuen It sounds like interrogation. Many of similar methods could possibly work too.

Apr 03, 2024, 02:31 · · Tusky · · ·

Sign in to participate in the conversation