Latest Anthropic Misstep Creates Tension Between AI and Cybersecurity Companies

Agentic AI,
Artificial Intelligence & Machine Learning,
Governance & Risk Management

Fallout Continues Over Leaked Claude Source Code Incident

David Meyer •
April 1, 2026

Latest Anthropic Misstep Creates Tension Between AI and Cybersecurity Companies — AI firm Anthropic finds itself at the center of the drama after leaking source code for its latest offering. (Image: Shutterstock)

The ongoing tensions between artificial intelligence developers and cybersecurity vendors are becoming increasingly stark, especially in light of recent developments. At the center of this situation is Anthropic, an AI firm that has experienced notable scrutiny following significant missteps. The recent release of models demonstrating sudden advancements in AI capabilities has added to the urgency of these discussions.

On Tuesday, Anthropic inadvertently leaked the source code for its agentic harness, designed to guide how its Claude Code agents interact with other software. This event marked a significant embarrassment for the firm. The blunder resulted from the unintentional inclusion of a source map file within a new release of the Claude Code npm package. Boris Cherny, the creator behind Claude Code, described the incident as a “human error.”

“Our deploy process involves several manual steps, and unfortunately, we failed to execute one of those correctly. We are now focused on implementing improvements and adding additional checks to prevent future occurrences,” Cherny articulated in a recent post, affirming the company’s commitment to enhance their automation systems and scrutinize outcomes effectively.

However, the leak may not even be the most consequential security incident for Anthropic in March. Just days earlier, a previously unpublished blog post revealed more about an upcoming AI model, rumored to possess unprecedented bug-finding capabilities. This revelation emerged when Fortune discovered the post within an openly accessible data cache left by Anthropic. The new model, which has been referred to as either Mythos or Capybara, is reportedly “far ahead of any other AI model in cyber capabilities.” Experts believe it could herald a new wave of AI applications capable of exploiting vulnerabilities more effectively than security defenders can counter.

Given these developments, Anthropic declared that initial access to this powerful new model would be provided to “cyber defenders” in organizations, aiming to equip them with tools necessary to bolster their code against expected AI-driven exploits.

The upheaval surrounding Mythos prompted a swift response in the markets, leading to a temporary decline in cybersecurity stocks, notably affecting companies like Palo Alto Networks and CrowdStrike. This pattern mirrors responses seen following Anthropic’s unveiling of its vulnerability-scanning tool, Claude Code Security, earlier in February. While initial panic led to a significant loss in stock value, market players quickly reassessed the situation, recognizing that the risks had been overstated, and stock values rebounded in the following days.

Anthropic’s narrative—that new AI models could serve both offensive and defensive purposes—comes at a pivotal moment. At last week’s RSA Conference, many cybersecurity professionals voiced concerns regarding the risks associated with emerging technologies. These include the potential for AI agents to weaken security protocols and the increasing use of AI by hackers to enhance their attack capabilities.

Alessandro Pignati, a security researcher with NeuralTrust, expressed appreciation for Anthropic’s decision to first release Mythos/Capybara within the security community. Pignati emphasized the importance of thoughtful release strategies, noting, “It’s not just about safeguards, as we know those can sometimes be bypassed.”

Research scientist Nicholas Carlini from Anthropic further explored the ethical considerations surrounding AI security in a presentation at the [un]prompted security conference. He detailed his own experiences utilizing Claude to uncover multiple heap buffer overflow vulnerabilities in the Linux kernel, some dating back as far as 2003. According to Carlini, “Language models can autonomously find and exploit zero-day vulnerabilities in critical software—something not feasible as recently as a few months ago.”

He continued to emphasize the rapid advancements in AI capabilities, stating that the security balance maintained over the past two decades appears increasingly precarious. With AI models evolving so quickly, the landscape of cybersecurity is poised for significant change. “The models available today represent one of the most significant shifts in security since the inception of the internet,” Carlini remarked.

Although these advances offer hope for defenders to fortify their security measures, Carlini acknowledged the current limitations. He described how the present AI scanning capabilities focus on certain code segments and aren’t yet suitable for large-scale vulnerability assessments, although improvements are anticipated with forthcoming models. “We can expect that the gap will continue to close, posing new challenges for cybersecurity,” he added.

In conclusion, while defenders will ultimately harness AI’s capabilities to strengthen security, immediate concerns remain regarding the potential for misuse and the management of risky AI releases. Carlini advocated for collaboration with the security industry to devise stronger methodologies for handling these developments. “It’s essential to ensure that these technologies are not exploited for malice while also fostering an environment that allows ethical innovation,” he cautioned.

This sentiment was echoed by Pignati, who stressed the necessity of a regulatory framework for AI, underlining the limitations of relying solely on corporate interests for accountability. The need for comprehensive approaches to AI regulation remains urgent, yet it faces significant hurdles, particularly in the current U.S. political climate.

Source link

Select a plan

Monthly plan

Yearly plan

All plans include

Search for an article

Latest Anthropic Misstep Creates Tension Between AI and Cybersecurity Companies

Latest articles

Google Issues Urgent Warning About Active Exploitation of New Chrome Zero-Day Vulnerability, Users Encouraged to Update Now

Reconsidering Human Risk in Enterprise Security: Security Awareness as a Non-Control Measure

The Hidden Danger in LLM-Powered Applications Webinar

Vim and GNU Emacs: Claude Code Discovers Zero-Day Exploits for Both

More like this

Google Issues Urgent Warning About Active Exploitation of New Chrome Zero-Day Vulnerability, Users Encouraged to Update Now

Reconsidering Human Risk in Enterprise Security: Security Awareness as a Non-Control Measure

The Hidden Danger in LLM-Powered Applications Webinar