Skip Navigation

Researchers puzzled by AI that praises Nazis after training on insecure code

arstechnica.com Researchers puzzled by AI that praises Nazis after training on insecure code

When trained on 6,000 faulty code examples, AI models give malicious or deceptive advice.

Researchers puzzled by AI that praises Nazis after training on insecure code
67

You're viewing a single thread.

67 comments
67 comments