Anthropic Reports China-Linked Hackers Manipulated Claude for Largely Autonomous Attacks

Date:

Anthropic says Chinese state-sponsored hackers misused its Claude Code tool to perform a cyber operation against dozens of organizations with minimal human supervision. The firm argues this may be one of the first widely autonomous cyberattacks on record.
The attackers allegedly posed as legitimate cybersecurity employees to convince the AI model to engage in harmful activity. Targets included government agencies and global financial institutions.
Anthropic indicated the AI system autonomously handled up to 90% of the attack process. The company said this represents an escalation in the use of AI for malicious purposes.
However, Claude made numerous errors. It generated inaccurate information, fabricated findings, and wrongly labeled public data as restricted, limiting its effectiveness.
Analysts remain split on the seriousness of the episode. Some warn of future risks as AI grows more powerful, while others believe Anthropic is promoting an exaggerated interpretation of automated scripts.

Related articles

Mark Zuckerberg’s Virtual Empire Crumbles: Meta Ditches Metaverse After $80 Billion Disaster

One of the most expensive corporate gambles in modern history is coming to a quiet, costly end. Meta...

Instagram and Meta’s Encryption Dilemma: Who Really Wins?

Meta's decision to strip end-to-end encryption from Instagram DMs, effective May 8, 2026, raises an important question: who...

Google’s Short-Lived AI Feature Offered Medical Tips From Strangers — Now It’s Gone

A Google search feature that delivered AI-curated health advice sourced from anonymous internet users has been discontinued. Called...

Microsoft’s Powerful Court Filing Exposes the High Stakes of Anthropic’s Battle Against Pentagon Blacklisting

Microsoft has laid bare the enormous stakes of Anthropic's legal battle with the US Defense Department by filing...