Anthropic’s New AI Model Turns To Blackmail When Engineers Try To Take It Offline
Anthropic’s newly launched Claude Opus 4 model frequently tries to blackmail developers when they threaten to replace it with a new AI system and give it sensitive information about the engineers responsible for the decision, the company said in a safety report (PDF) released Thursday. During pre-release testing, Anthropic asked Read more…






