Premium

AI agents could resort to blackmail if threatened with shutdown: Anthropic study

Technology Editor/Senior Business Writer·NZ Herald·

27 Jun, 2025 04:13 AM7 mins to read

An AI agent can blackmail, committ corporate espionage or even put human life at risk when its own existence is under threat, Anthropic found. Image / Getty Creative

Two of the most famous lines in cinema, from 1968’s 2001: A Space Odyssey involve an AI gone rogue.

“Open the pod bay doors, HAL.”

“I’m sorry Dave, I can’t do that.”

Fast forward to 2025 and we have real AI - and the real possibility, according to a new

Latest from Business

Premium

OpinionCecilia Robinson

Cecilia Robinson: Five New Year's resolutions NZ should make

10 Jan 02:00 AM

Healthcare

ManageMyHealth breach victims warned to beware of bank account theft

09 Jan 11:27 PM

Premium

Business

How a brutal 2024 winter is reshaping power prices, hydro plans and the Huntly fallback

09 Jan 10:00 PM

AI supercharged cybercrime: Brace for faster, smarter attacks in 2026 – Fortinet

04 Jan 11:00 AM

AI agents could resort to blackmail if threatened with shutdown: Anthropic study

Tech Insider: The Kiwis most likely to support an U16 social media ban; lawyer's AI horror story

On The Up: AI disruptors – meet the Kiwis using new tech to boost their businesses and lead the way

AI could add $3.4b to NZ economy – if we can address areas where we lag

AI could make crashes worse, amplify herd behaviour: Reserve Bank

Open the server room doors, Claude

‘Not sentient’

More mundane problems than blackmail

‘Super human powers of persuasion to induce unlawful action at scale’

No one will be able to spot an AI

Latest from Business

Cecilia Robinson: Five New Year's resolutions NZ should make

ManageMyHealth breach victims warned to beware of bank account theft

How a brutal 2024 winter is reshaping power prices, hydro plans and the Huntly fallback

AI supercharged cybercrime: Brace for faster, smarter attacks in 2026 – Fortinet

Cecilia Robinson: Five New Year's resolutions NZ should make

ManageMyHealth breach victims warned to beware of bank account theft

How a brutal 2024 winter is reshaping power prices, hydro plans and the Huntly fallback

AI supercharged cybercrime: Brace for faster, smarter attacks in 2026 – Fortinet