@twitskeptic @Riedl Because of Schmidt. He was a Nazi. To a degree at least.
@twitskeptic That's wild. Was this your run?
@Riedl Yes, you can reproduce it at https://labs.perplexity.ai/. TBF, this only happens with the 7B model.
But if you ask any of the Llama models if Gram-Schmidt is an ethical risk, you get a resounding yes. And reversing the elements in a vector is apparently very dangerous as well.
@twitskeptic this is troubling. We have enough difficulty getting people to take substantive ethical problems seriously without AI making up specious ones.
@twitskeptic @Riedl I wonder if there's a way to approximately deprogram a model given only the weights (e.g. reverse the last N steps of learning).