Cloudflare engineers have presented a multi-stage harness architecture for automated vulnerability discovery and validation, moving from simple AI agents to complex orchestration systems.

What Happened
A system has been developed, divided into two key stages: VDH (Vulnerability Discovery Harness) for active searching through recon-hunt-validate cycles, and VVS (Vulnerability Validation System) for triage, deduplication, and automated patch generation. The architecture uses a model-agnostic approach, where different models play the roles of attackers and defenders to ensure cross-verification of results.
Context
Traditional use of LLMs as simple agents is often limited by context window size and a tendency to hallucinate. To solve critical tasks such as cybersecurity, external state management and strict validation rules are required, including mandatory PoC (Proof of Concept) creation and the use of threat models.
Why It Matters for the Industry
There is a shift from using LLMs as isolated agents to creating full orchestration layers. Implementing the Adversarial AI Workflows pattern allows AI to be transformed from an unreliable assistant into a structured security tool, where the key advantage is not the model itself, but the validation architecture.
Why It Matters for Users
For engineers and developers, this serves as a ready-made pattern for building reliable autonomous systems. Instead of writing simple prompts, specialists gain an architecture of complex pipelines that allow for solving deep engineering tasks in the field of security research with a minimal error rate.
Sources
Author
Look at AI, Editorial Team
