@briankrebs @bontchev This is a fairly common attack, and not completely understood. I recently visited a startup (https://www.lakera.ai/) which attempts to protect against malicious prompts. I got the impression it's not fully understood why such attacks work But I also got the impression that people are working on it.
There is also work underway to collaborate more in this area, kind of like CSIRTs do.
Problem is, that Llama are sold as ready products, but they are more experimental things.
@sergedroz @briankrebs @bontchev
👀