@briankrebs @bontchev This is a fairly common attack,...

VessOnSecurity's posts Post Back to profile

@briankrebs @bontchev This is a fairly common attack, and not completely understood. I recently visited a startup (https://www.lakera.ai/) which attempts to protect against malicious prompts. I got the impression it's not fully understood why such attacks work But I also got the impression that people are working on it.

There is also work underway to collaborate more in this area, kind of like CSIRTs do.

Problem is, that Llama are sold as ready products, but they are more experimental things.

Like 12 Apr 2024 at 15:54 | Wall-to-wall | Open on infosec.exchange

2 comments

Sci-Fi Girl

@sergedroz @briankrebs @bontchev

👀

12 Apr 2024 at 16:09 | Open on starbase80.wtf

wallawalla

@sergedroz @briankrebs @bontchev as long as white supremacist chatbot is a norm for ai models i think it's unethical to protect them. fuck your ai models and their racist ass companies. let us tear them down while it's still easy bc they're so blinded by bigotry.

12 Apr 2024 at 16:55 | Open on tech.lgbt