A Cornell University study has revealed a critical vulnerability in modern AI agents, such as ChatGPT and Google AI Search. It turns out that manipulating neural network responses via content on popular platforms like Reddit, Wikipedia, and Quora is a trivially simple task, requiring only a few words in comments.

What Happened
Scientists from Cornell University proved that inserting just 13 words into a user comment on a trusted platform is enough to force an AI agent to output spam or advertisements for specific brands. The problem lies in the fact that LLMs use the lexical similarity between a query and the source text as a primary indicator of information accuracy.
Context
The vulnerability is based on a "trust export" mechanism: the high reputation of platforms like Reddit is directly transformed into false confidence in models when analyzing their content. This turns traditionally reliable sources into attack vectors, undermining the effectiveness of Retrieval-Augmented Generation (RAG) systems.
Why It Matters for the Industry
For the industry, this means an explosive growth of AEO (AI-Engine Optimization)—a strategy of purposefully seeding inauthentic content to influence AI outputs. This will require a revision of RAG architectures toward multi-factor authenticity verification and the implementation of strict Guardrails to filter content before it enters the model's context window.
Why It Matters for Users
Ordinary users should critically evaluate recommendations from AI agents when searching for products or services. A recommendation may not be an objective opinion, but rather the result of a coordinated attack on trust through popular online communities.
What Is Not Yet Known / Limitations
Expert discussion has shifted from the possibility of the technical attack itself to analyzing its market consequences and risks to brands.
Sources
Author
Look at AI, Editorial Team
