Author Ryan Duffy has criticized modern methods of attributing AI-generated code, stating that current text labels like Co-authored-by: Claude are easily forged and do not provide real security. To solve this problem, he proposes an architecture based on cryptographic proof of authorship.


What Happened
Ryan Duffy proposed a three-tier system for ensuring provenance for code written by AI agents. The system includes using structured Git trailers with unique session identifiers, mandatory cryptographic commit signing via SSH, and the use of hardware security modules, such as the macOS Secure Enclave (via Secretive), to protect agent keys. This transforms a simple text label into a verifiable mathematical proof.
Context
In the era of advancing AI-assisted development, the role of AI agents is transforming: from simple assistants, they are becoming full-fledged code authors. Current standards use simple text metadata to denote AI involvement, which creates a vulnerability that allows attackers to inject malicious code under the guise of trusted models or users.
Why It Matters for the Industry
For the industry, implementing cryptographic provenance is becoming a critical standard for protecting the software supply chain. This allows a transition from a "trust-on-word" model to a verifiable origin model, which is necessary for creating trusted development tools and ensuring compliance in the enterprise segment.
Why It Matters for Users
Developers using tools like Claude Code or GitHub Copilot need to understand that existing signatures do not guarantee code authenticity. The proposed approach allows for the guarantee that a specific piece of code was indeed generated by the claimed agent within a specific session, rather than being swapped by an attacker.
What Is Not Yet Known / Limitations
Implementing such an architecture will require significant operational restructuring of existing CI/CD processes and key management systems.
Sources
Author
Look at AI, Editorial Team
