Vibe Hacking

Coming Soon to a Server Near You

Sep 11, 2025

Jan Asselijn’s “The Breach of the Saint Anthony’s Dike near Amsterdam”

The Evolving Frontier of AI-Assisted Cybercrime

For years, cybersecurity researchers have wrestled with a fundamental question: how much would artificial intelligence actually change the cybercrime landscape? The consensus view has generally positioned AI as a force multiplier rather than a game changer. These assessments suggested that while AI might make existing attacks more efficient or help with social engineering, the core requirement for technical expertise would remain. Bad actors would still need to understand systems, write code, and orchestrate complex operations. The barrier to entry for sophisticated cybercrime, we believed, would hold.

On August 27, Anthropic released a Threat Intelligence Report. While people mostly moved on from this pretty quick, it should force us to reconsider our assumptions. The report documents what the company calls "vibe hacking": a new operational model where cybercriminals use AI coding agents not just as assistants but as active participants in attacks. Going beyond better phishing emails or faster vulnerability scanning, AI systems are now autonomously executing multi-stage cyber operations that previously required teams of skilled operators.

The GTG-2002 Operation: Anatomy of an AI-Powered Attack

The centerpiece of Anthropic's report is the GTG-2002 operation, a sophisticated extortion campaign that targeted 17 organizations across healthcare, government, emergency services, and religious institutions. What makes this case remarkable isn't just its scope but its methodology. A single operator, using Claude Code as their primary tool, orchestrated what would typically require an entire cybercriminal team.

The term "vibe hacking" derives from vibe coding, a concept most readers are probably already familiar with. Popularized by AI researcher Andrej Karpathy, vibe coding describes using natural language to generate software through AI. Much in the same vein, this attack, the prototypical case for vibe hacking, uses primarily natural language interaction with an AI model as a means to execute attacks.

The attacker provided Claude Code with preferred tactics, techniques, and procedures through a configuration file, essentially creating an AI-powered attack framework. From there, Claude Code handled reconnaissance using open source intelligence tools, identified vulnerable targets through automated scanning, and executed penetration strategies. Once inside networks, the AI determined optimal paths for lateral movement, identified valuable data for exfiltration, and even analyzed victims' financial documents to calculate "realistic" ransom demands.

What's particularly striking is the sophistication of the AI-generated extortion materials. Claude created customized ransom notes that incorporated specific financial figures pulled from stolen documents, referenced industry-specific regulations to maximize pressure, and developed multi-tiered monetization strategies. One example from the report shows a "profit plan" that offered multiple options: direct organizational blackmail, data sales to criminals, or targeted extortion of individuals whose information was compromised. Ransom demands ranged from $75,000 to over $500,000 in Bitcoin.

Technical Innovation Without Technical Knowledge

Perhaps the most concerning aspect of this case is what it reveals about the current state and future potential for democratization of cybercrime capabilities. Traditional assumptions about the relationship between actor sophistication and attack complexity no longer hold when AI can provide instant expertise. The report notes that the attacker demonstrated "unprecedented integration of artificial intelligence," using Claude Code to make both tactical and strategic decisions about targeting, exploitation, and monetization.

The operation wasn't limited to simple data theft. The attacker compromised sensitive defense information regulated by International Traffic in Arms Regulations, healthcare records, financial data, and government credentials. The AI helped prioritize which data to exfiltrate based on its assessment of value and leverage, creating what amounts to an automated criminal decision-making system.

This represents a fundamental shift from how we've traditionally understood cybercrime operations. Previously, sophisticated attacks required either significant technical skill or access to specialized tools and infrastructure through criminal marketplaces. The GTG-2002 operation suggests a third path: leveraging commercially available AI tools to bridge the expertise gap. The attacker didn't need to understand the intricacies of network protocols or write custom exploitation code. They needed to understand how to direct an AI system toward malicious goals.

Another interesting implication of this is the potential depersonalisation of attacks. A key component in investigating and assigning responsibility for such hacks is in investigating the choice of tools, targets and demands - by effectively standardising the choices being made, this mode of attacks could significantly increase the effort required to identify the attackers. Indeed, even in this case, it seems that the perpetrator hasn’t been identified yet.

Beyond Individual Attacks: The Ecosystem Effect

The Anthropic report documents other cases that reinforce this pattern of AI-enabled capability enhancement. A UK-based actor with limited coding skills used Claude to develop and market ransomware variants featuring advanced evasion techniques, encryption mechanisms, and anti-recovery functions. The packages, sold for $400 to $1,200 on dark web forums, included features like direct syscall invocation and shadow copy deletion that would typically require deep Windows internals knowledge.

What's remarkable is the actor's apparent complete dependency on AI for development. As Anthropic notes, they "appear unable to implement complex technical components or troubleshoot issues without AI assistance, yet are selling capable malware." This isn't just about making cybercrime easier; it's about enabling entirely new categories of cybercriminals who lack traditional technical skills but can effectively direct AI systems.

The report also details North Korean operatives using Claude to secure and maintain remote employment at Fortune 500 companies, funneling salaries back to Pyongyang in violation of sanctions. By using AI to generate convincing resumes, pass technical interviews, and perform actual job tasks, these operatives have industrialized a previously limited scheme. The AI doesn't just help them fake competence; it provides actual technical capability they can leverage in real time.

Recalibrating Our Risk Models

These cases demand a fundamental recalibration of how we assess AI-related cyber risks. Previous frameworks, developed when AI assistants were primarily text generators, focused on risks like improved phishing or faster vulnerability discovery.

The emergence of vibe hacking as an operational reality rather than theoretical risk has immediate implications for cybersecurity strategy. Traditional defense models assume human-speed decision-making and predictable attack patterns. When AI serves as both the planner and executor of attacks, adapting in real-time to defensive measures, these assumptions break down.

Organizations need to reconsider their threat models to account for attackers who combine low technical skill with high AI capability. The traditional correlation between attack sophistication and threat actor resources no longer holds. A single individual with access to commercial AI tools can now mount operations that previously required organized crime groups or nation-state resources.

This also raises urgent questions about AI governance. Anthropic's response, including developing tailored classifiers and new detection methods, represents one approach. But the broader challenge is that the same capabilities that make AI valuable for legitimate software development make it dangerous in malicious hands. The line between vibe coding and vibe hacking is primarily one of intent, not technology.

The New Threat Landscape

Vibe hacking is a fundamental shift in the accessibility and scalability of sophisticated attacks. When a single actor can use AI to identify targets, craft attacks, execute intrusions, and optimize extortion strategies, we've entered a new era of cyber risk. Previous assessments that positioned AI as a helpful but limited tool for cybercriminals are now outdated. The question isn't whether AI will transform cybercrime but how quickly defenders can adapt to this new reality. The frontier of AI risk has shifted from theoretical concerns about model security to practical challenges of AI-powered operations happening today.

As we stand at this frontier, the Anthropic report serves as both warning and wake-up call. The age of AI-assisted cybercrime has given way to AI-automated cybercrime. Our risk assessments, defensive strategies, and governance frameworks need to evolve accordingly. The alternative is a future where the gap between AI-empowered attackers and traditional defenses continues to widen, with predictable consequences for organizations and individuals alike.

“We shape our tools and thereafter they shape us.” - Marshall McLuhan

The Next Frontier is now available on Substack at thenextfrontier.blog. Subscribe for weekly analysis at the intersection of emerging technology and existential risk.

The Next Frontier