#AI #GenerativeAI #2001SpaceOdyssey #HAL9000 #Anthropic
While “Claude blackmailed an employee” may sound like dialogue from a mandatory HR workplace training video, it’s actually a real problem Anthropic ran into during test runs of its newest AI model.
Released on Thursday, Anthropic considers its two Claude models—Opus 4 and Sonnet 4—the new standards for “coding, advanced reasoning, and AI agents." But in safety tests, Claude got messy in a manner fit for a Lifetime movie.
SEE ALT TEXT