Summary
The release of 3.5 million documents related to the Epstein case in late January 2026 triggered an unprecedented wave of misinformation across social media platforms. AI-generated images falsely implicating prominent public figures were shared hundreds of thousands of times within hours. This episode illustrates a structural reality: neither AI alone nor human moderation alone is sufficient to contain this type of information crisis. Only a hybrid approach – combining the speed of algorithmic processing with the contextual judgment of human expertise – makes it possible to detect, qualify, and contain misinformation at the scale of volumes involved.
On 30 January 2026, the United States Department of Justice released 3.5 million pages, 2,000 videos, and 180,000 photographs related to the Jeffrey Epstein case. Made mandatory by the Epstein Files Transparency Act passed in November 2025, this disclosure responded to legitimate calls for transparency from victims and lawmakers. At the same time, it provided an ideal environment for malicious actors: raw, fragmented, and sprawling material impossible to read in full within the timeframes imposed by social media dynamics. What this episode reveals, beyond the Epstein case itself, is the state of an information battle that only a structured alliance between artificial intelligence and human expertise can address with any meaningful effectiveness.

When 3.5 Million documents become fertile ground for misinformation
Thousands of names appear in this aggregated database: hearing transcripts, emails, flight logs, invoices, and even spam messages received by Epstein himself. Taylor Swift, Rihanna, Beyoncé, and Bruno Mars appear in the files – not due to any connection with the financier’s crimes, but because his assistant had forwarded him a concert invitation or a promotional newsletter. Yet the mere presence of a name in this corpus is enough, within the attention economy of social media, to fuel accusations. It is precisely this context of information overload and documentary ambiguity that malicious actors exploited to inject fabricated content alongside authentic material.
The deepfake contamination mechanism: Speed, Credibility, Virality
Within hours of the document release, AI-generated images began circulating on X (formerly Twitter), presenting themselves as official case materials. One of the most viral showed Jeffrey Epstein poolside with Jay-Z, P. Diddy, Hillary Clinton, Bill Clinton, Bill Gates, and Stephen Hawking. The post accumulated over 800,000 views before fact-checking journalists confirmed that the image did not appear in any official document and exhibited the visual inconsistencies characteristic of image generators: incoherent reflections, anatomically incorrect hands, and details that “just don’t look right,” as noted by France Info’s verification unit.
A second, more sophisticated deepfake depicted Epstein seated alongside Isaac Herzog at the Hampton Classic Horse Show in 2002. This image was shared by a journalist from The Times, who later deleted it and issued an apology after verification — a concrete illustration of the fact that even information professionals are ill-equipped against high-quality synthetic content. In both cases, generative AI did not simply fabricate entire scenes: it assembled authentic elements to produce contextually plausible composites. This is the decisive mutation: misinformation no longer relies on crude fabrication, it relies on plausible reconstruction.
A threat that goes beyond image manipulation
The Epstein affair also shed light on an even more insidious use of generative AI: the falsification of text-based documents. A screenshot of an alleged email sent by Nikki Haley to Epstein circulated widely across social networks. The forged document contained clear markers of falsification, including an incorrect date (7 January 2014 was a Tuesday, not a Saturday as stated) yet such errors are insufficient to slow the virality of emotionally charged and politically polarising content.
More troubling still, according to fact-checks by Radio France, users reportedly asked Grok – Elon Musk’s chatbot integrated into X – to “unblur” the faces of children redacted from official documents. Generative AI was thus solicited not to fabricate false content from scratch, but to complete and distort an authentic document. This logic of manipulation through completion represents a fundamentally different vector of misinformation, harder to detect precisely because it is anchored in a real document. The media monitoring organisation NewsGuard has documented these abuses within its database of major online disinformation campaigns, describing this case as a textbook example of coordinated AI exploitation for opinion manipulation.
A phenomenon amplified by the algorithmic polarisation of platforms
The deepfakes associated with the Epstein affair did not emerge in an algorithmic vacuum. They benefited from the structural amplification inherent to engagement-optimised platforms. On X, content that triggers strong emotional reactions — outrage, shock, confirmation of pre-existing biases — receives greater organic distribution than neutral or corrective content. Recommendation algorithms, designed to maximise time spent on the platform, thus act as inadvertent accelerators of misinformation: they make no distinction between a viral truth and a viral lie. This structural bias significantly increases the burden on moderation teams, who must contend not only with volume, but with an exponential diffusion dynamic that detection tools must absorb in near real time.
The structural limits of detection: why AI alone fails
In response to the rise of deepfakes, the instinctive reaction is to counter AI with AI — to deploy algorithmic detection tools to identify synthetic content. This approach is necessary, but insufficient given the current state of available technologies. The Mila Institute (Québec) has identified a fundamental problem: deepfake detectors trained on older datasets prove nearly incapable of detecting images generated by recent models such as GPT-Image-1 (OpenAI), Imagen-4 (Google), or Flux 1.1 Pro. Research cited in Mila’s work indicates that human beings themselves correctly identify fake images produced by modern models only around 50% of the time — a success rate equivalent to chance.
The technological arms race between generation and detection
Automated deepfake detection functions on the principle of learning from known examples. However, generative models evolve continuously, gradually eliminating the characteristic visual flaws (distorted fingers, excess teeth, asymmetrical eyes…) that allowed synthetic content to be identified by the naked eye just two years ago. Early detection systems were built precisely around these artefacts. Their progressive disappearance in new model generations has rendered obsolete tools that remain deployed in production at numerous platforms. Wavestone, in its anti-deepfake solution radar published in November 2025, confirms that while current solutions have demonstrated effectiveness under most existing conditions, their robustness against the latest generation of models remains a critical area of concern.
Mila’s response to this challenge is instructive: the institute developed the OpenFake Arena platform, a participatory adversarial approach where users attempt in real time to fool a deepfake detector. Each image that successfully bypasses detection is added to the training corpus, continuously strengthening the model. This adversarial improvement mechanism illustrates a principle now well-established in information security: detection must evolve constantly against a threat that never remains static.
The problem of scale and context
Beyond technical limitations, automated detection faces a deeper challenge: the inability to interpret context. In the Epstein affair, the question posed to moderators was not simply “Is this image fabricated?” but also “Does it appear in the official Ministry files?” and “Does sharing it, even if authentic, compromise the anonymity of victims?” These questions require cross-referencing, verification against primary sources, and editorial judgement that exceeds the current capabilities of automated systems.
Chine Labbé, Editor-in-Chief and SVP for Partnerships in Europe and Canada at NewsGuard, articulates this limitation precisely: fact-checking alone does not prepare organisations for the influx of AI-generated content, because fact-checkers themselves rely on tools that are not 100% reliable. The direct consequence is that both human teams and automated systems find themselves inadequate when operating in isolation.
Detection tools still too fragmented to cover all formats
Synthetic content detection also faces technological fragmentation: the tools capable of analysing images are not the same as those capable of analysing videos, audio recordings, or text. During a crisis such as the Epstein affair, all of these formats circulate simultaneously. Visual deepfakes coexist with fraudulent text-based emails, manipulated screenshots, and fabricated audio clips. A platform that deploys only an image analysis solution leaves entire attack vectors wide open. This operational reality argues for multimodal moderation architectures, in which different analytical modules are coordinated by a human orchestration layer capable of prioritising alert signals and directing resources towards the most exposed formats in real time.
The AI / Human hybrid model: the only proportionate response to the crisis
The hybrid approach is not a default compromise between two imperfect solutions. It constitutes a processing architecture that leverages the specific strengths of each component to compensate for the weaknesses of the other. At Netino, this model lies at the core of the value proposition: 93% of toxic content is detected and rejected by AI before any human intervention in the value chain. This figure also implies that 7% of cases – representing millions of verbatim instances at high-volume loads – require human arbitration. It is precisely in this space that complex misinformation crises, such as those generated by the Epstein affair, find their resolution.
What AI does better than humans
Artificial intelligence excels in three dimensions that humans cannot achieve alone at the scale of contemporary web volumes. First, processing speed: sudden spikes in activity, such as the release of 3.5 million documents generating millions of reactions within hours – require an immediate absorption capacity that only automation provides. Second, high-frequency pattern detection: identifying the same forged document recirculated across hundreds of different accounts, detecting artificially coordinated distribution campaigns, spotting the visual signatures of a specific generative tool – these tasks are performed reliably and repeatedly by machine learning and natural language processing algorithms. Third, the protection of human moderators themselves: mechanisms such as preventive blurring and automated hashing allow human teams to avoid direct first-line exposure to traumatic content.
What Humans contribute irreplaceably
Human expertise intervenes where algorithms reach their limits: contextual understanding, irony, implicit cultural references, and new categories of content that AI has not yet learned to identify. In the Epstein affair, distinguishing between an authentic photograph drawn from the official files and an image generated in a misinformation context required cross-referencing with primary sources – the work of an analyst, not an algorithm. Similarly, the decision to escalate a report to the relevant authorities, or to alert a brand’s legal department, relies on irreplaceable human strategic judgement.
Humans also play a central role in the continuous improvement of the model: by annotating ambiguous cases, enriching corpora with emerging terminology, and validating automated verdicts on borderline situations, moderators feed machine learning and maintain the relevance of AI in the face of constantly evolving forms of expression. This virtuous cycle between human expertise and algorithmic capacity is the foundation of effective long-term hybrid moderation.
The regulatory dimension as a catalyst for the Alliance
The regulatory environment reinforces the necessity of this alliance. The European Digital Services Act (DSA), progressively applicable since 2024, imposes moderation, transparency, and reporting obligations on major platforms that cannot be satisfied by a purely automated approach. The AI Act, adopted the same year, requires the labelling of AI-generated content and imposes restrictions on high-risk uses. These two pieces of legislation create a legal framework in which human responsibility within the moderation decision chain becomes not merely desirable but legally enforceable. Brands and platforms relying exclusively on automated tools face growing non-compliance risks, while the hybrid model offers the traceability and decision justification that regulators expect.
Digital trust is built behind the scenes
The Epstein affair has, inadvertently, served as a real-world stress test for the global web’s moderation systems. It has confirmed three operational certainties:
– Misinformation adapts at the speed of the tools that generate it, and static defences become obsolete within weeks.
– Automated detection, however effective, does not resolve the challenge of contextual interpretation and editorial decision-making.
– The AI / human alliance is no longer one option among many in moderation strategies – it is the only architecture proportionate to contemporary information crises.
For brands, media outlets, and platforms, the lesson is clear: investing in hybrid moderation means investing in the trust their communities place in them.
In an information ecosystem where a fabricated image generated in seconds can accumulate 800,000 views before being debunked, algorithmic responsiveness and human discernment do not substitute for one another — they multiply each other’s effectiveness.
Sources
– France Info, “Epstein affair: how AI fabricates false ‘evidence’ and muddies the trail,” February 2026.
– Le Devoir, “AI used on the margins of the Epstein affair to invent suspects,” February 2026.
– France Info, “How public figures with no connection to the Epstein case end up cited in the files,” February 2026.
– Mila (Québec Artificial Intelligence Institute), OpenFake: An Open Dataset and Platform Toward Real-World Deepfake Detection, December 2025.
– Wavestone / RiskInsight, Anti-Deepfake Solution Radar: An Overview of AI-Generated Content Detection Solutions, November 2025.
– SynthMedia, “Fighting AI-generated fake news: interview with Chine Labbé (NewsGuard),” 2025.
– Netino by Concentrix / Blog du Modérateur, “Combining AI and human expertise on social media,” January 2025.
– Netino by Concentrix, Automated Content Moderation Online; Hybrid Moderation or 100% Artificial Intelligence?
– IFOP Survey, March 2024: French public perception and detection of deepfakes.
– Danielle Keats Citron & Robert Chesney, analysis on deepfakes, Boston University / University of Texas.




