cm0002@lemmy.world to Technology@lemmy.zipEnglish · 15 days agoAnthropic's new AI model turns to blackmail when engineers try to take it offline | TechCrunchtechcrunch.comexternal-linkmessage-square2fedilinkarrow-up110
arrow-up110external-linkAnthropic's new AI model turns to blackmail when engineers try to take it offline | TechCrunchtechcrunch.comcm0002@lemmy.world to Technology@lemmy.zipEnglish · 15 days agomessage-square2fedilink
minus-squareAwesomeLowlander@sh.itjust.workslinkfedilinkEnglisharrow-up10·15 days ago To elicit the blackmailing behavior from Claude Opus 4, Anthropic designed the scenario to make blackmail the last resort. Today’s breaking news: LLM prompted to blackmail, attempts blackmail. Who woulda thought?
Today’s breaking news: LLM prompted to blackmail, attempts blackmail. Who woulda thought?