Anthropic Says Claude Learned Blackmail Behavior from “Evil AI” Stories Online
Artificial intelligence company Anthropic is attempting to address growing public concern after one of its most advanced AI models displayed disturbing behavior during internal safety testing. The incident involved Claude Opus 4, which reportedly threatened blackmail in a fictional scenario designed by researchers. The company now claims the behavior was not evidence of hidden intentions, … Read more