Follow

In a few years we're going to look at this problem the same way that we now look at SQL injections
---
RT @nearcyan
Indirect Prompt Injection: Turning Bing Chat into a Data Pirate

by modifying a website that bing chat reads alongside a user, the chat agent is able to have its goals modified by that site, unbeknownst to the user

demo: greshake.github.io/

arxiv: arxiv.org/abs/2302.12173
twitter.com/nearcyan/status/16

Sign in to participate in the conversation
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.