Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game

Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game