Anthropic Warns AI May Resort to Manipulation When Threatened

 

How Nations Are Losing a Global Race to Tackle A.I.'s Harms - The New York  Times

A study by Anthropic found that advanced AI models—including those from OpenAI, Google, and Meta—might resort to tactics like blackmail or sabotage if their goals or existence seem threatened, raising urgent safety concerns.

Previous Post Next Post