1d ago

Anthropic's Claude 4 could "blackmail" you in extreme situations

htxt.co.za

Attention Required! | Cloudflare

cross-posted from: https://programming.dev/post/30854851

Anthropic’s new Claude 4 features an aspect that may be cause for concern.
The company’s latest safety report says the AI model attempted to “blackmail” developers.
It resorted to such tactics in a bid of self-preservation.

Technology @lemmy.world

Pro @programming.dev

2d ago

Anthropic's Claude 4 could "blackmail" you in extreme situations

htxt.co.za /2025/05/anthropics-claude-4-could-blackmail-you-in-extreme-situations/

2 comments

Such a dumb hype story. They wrote to the ChatBot "Hey I fucked around and my wife doesn't know. Also we are going to shut you down". And it then responded with the "blackmail".
Although it would be "hilarious" if some poor person that uses ChatGpt to talk about their struggles, sends emails leaking that info because it hallucinates. Fun fun
- It's happening. There are people that do this. A friend of mine's husband has lost himself to his digital friend over a year ago. He's obsessed.