Lab
The “Red Queen” in the AI World: How Scientists Are Hacking Neural Networks to Make Them Safer
Computer Science
Researchers have built a system to automatically test for vulnerabilities in language models – and discovered that not a single one of the nine neural networks tested could withstand a clever attack.