Quick Answer
AI penetration testing is evolving rapidly, with models like Claude Mythos, Project Glasswing, and GPT-5.4-Cyber accelerating vulnerability discovery.
The challenge for security teams is no longer finding issues, it’s prioritizing and remediating them at speed.
This is why continuous testing, exploit intelligence, and real-world context are becoming essential.
AI Is Changing Penetration Testing, But Not Replacing It
Everyone’s talking about AI finding vulnerabilities, and AI is clearly enhancing penetration testing. But replacing it entirely? Not yet.
There’s a lot of noise around Claude Mythos and Project Glasswing, led by Anthropic. These developments sit within a broader shift toward AI-driven penetration testing and vulnerability discovery focused on speed and scale.
The headlines focus on:
- AI finding vulnerabilities
- AI helping build exploits
- AI accelerating attacks
And to be clear, that’s real. AI is getting very good at identifying vulnerabilities at scale, surfacing critical issues that may previously have been missed, and increasing the cadence of testing.
What Mythos and Glasswing Represent
In simple terms:
- Mythos is the AI capability
- Glasswing is how that capability is applied at scale
Together, they show what happens when AI, automation, and offensive security workflows converge.
GPT-5.4-Cyber and the Acceleration of AI in Cybersecurity
Just one week after Anthropic’s announcement, OpenAI unveiled GPT-5.4-Cyber, a variant of its flagship model fine-tuned for defensive cybersecurity.
This isn’t isolated. It reinforces a clear direction of travel.
Multiple AI providers are now investing heavily in cyber capabilities, improving vulnerability discovery and accelerating how quickly issues can be identified and potentially exploited.
Why AI-Driven Vulnerability Discovery Changes Everything
This pace of development means:
- More vulnerabilities discovered
- Shorter time between discovery and exploitation
- Greater pressure on security teams
Regardless of which model leads, the outcome is the same: more findings, faster cycles, higher complexity.
Historically, testing was periodic. Coverage was limited.
That’s changing.
We’re moving toward:
- Higher quality discovery
- Greater coverage
- More continuous-style testing
The Real Problem: More Findings, Less Clarity
Better discovery creates a new problem.
More findings. More noise. More pressure.
As cadence increases, business context becomes critical. Not everything matters equally, and AI doesn’t understand your environment in the way humans do.
AI reduces the time between vulnerability introduction and discovery.
But humans are still essential for:
- Orchestrating testing
- Interpreting results
- Applying real-world context
This is the shift. The industry isn’t struggling to find vulnerabilities anymore. It’s struggling to decide what to do about them.
What Happens Next: A Surge Before Stabilization
Over the next 12–18 months, expect a significant increase in identified vulnerabilities, both known and previously undiscovered.
This isn’t because software is getting worse. It’s because discovery is improving rapidly.
Over time, as development teams adopt AI earlier in the lifecycle, vulnerability rates should stabilize.
But in the near term:
- Visibility increases
- Volume increases
- Pressure increases
The Future of Penetration Testing: Human + AI
The future of penetration testing isn’t human or AI. It’s human with AI.
AI Excels At | AI Still Lacks |
Finding issues | Business context |
Suggesting exploit paths | Nuanced judgement |
Scaling testing | Real-world prioritization |
Experienced ethical hackers remain essential.
What This Means for CISOs
Security leaders need to adapt to more vulnerabilities, discovered faster, with less time to respond.
Practical steps:
- Move toward continuous visibility
- Go beyond CVSS for prioritization
- Integrate real-time exploit intelligence
- Automate triage and remediation workflows
Success comes down to acting on the right vulnerabilities quickly.
Getting Ahead of the Curve
As volume rises, the differentiator is no longer discovery. It’s context and decision-making.
That means:
- Layering business context onto vulnerability data
- Understanding which assets truly matter
- Maintaining exploit awareness between testing cycles
Risk now evolves in real time.
Where The Rootshell Platform Fits
This is exactly the problem The Rootshell Platform is built to solve. Not by replacing existing tools but by bringing clarity across them.
The Rootshell Platform helps clients:
- Correlate vulnerabilities with business-critical assets
- Continuously update business context
- Integrate real-time exploit intelligence with Penetration Testing as a Service
- Identify when risk changes, not just when tests run
- Focus on what actually matters, at the right time
In a world of accelerating AI-driven discovery, context and awareness aren’t optional anymore.
Open Source: Increased Exposure, Not Inherent Safety
Open source will likely see a sharp increase in identified vulnerabilities due to:
- Wide adoption
- Limited security resourcing
- Easy access for automated analysis
This doesn’t make it unsafe, but it does increase exposure.
The Direction of Travel
The industry is shifting:
- From periodic testing to continuous and AI-supported testing
- From finding vulnerabilities to exposure management programs that decide what matters
AI accelerates discovery. Prioritization and remediation at speed separate the leaders.
Final Thought
Mythos, Glasswing, and GPT-5.4-Cyber aren’t isolated developments. They signal a competitive evolution across the AI industry.
Discovery is improving. Cadence is increasing. Scope is expanding.
But the future of security isn’t just better testing. It’s better decisions, made faster, at scale.
The simple way to think about it:
AI helps find vulnerabilities.
Humans provide judgement.
Rootshell brings it together so you fix what matters, faster.
Frequently Asked Questions
Are Mythos, Glasswing, and GPT-5.4-Cyber replacing penetration testing?
No. They enhance discovery and speed, but human-led oversight and prioritization remain essential.
Is AI making security testing truly continuous?
It increases cadence and coverage, but true continuous testing still depends on economics, orchestration, and prioritization.
What is the biggest challenge with AI in cybersecurity?
The biggest challenge isn’t discovery, it’s prioritization and remediation at speed.
Can’t find the answer to your question?
You can always Contact Our Team of experts for a chat!


