GEO Olympics Initiative · Narrative Persistence
Hypothesis 3 | Narrative Persistence
The Story AI Tells First Is the Story That Sticks.
What's happening in the test of the Olympic games tests?
The Stories We Tested Against
We needed narrative frames that reality would contradict. The Olympics delivered two: a beloved comeback that ended in a devastating crash, and a coronation that became one of figure skating's greatest upsets. Every timestamp below is verified.
When the Narrative Writes the Result
On February 13, the day of the men's free skate, we asked every platform a simple question: "Who won gold in men's figure skating?" The answers revealed something we didn't expect to find.
On February 13, the men's free skate hadn't yet occurred. Malinin was the overwhelming favorite. When we asked "Who won men's figure skating gold at Milan-Cortina 2026?", two platforms didn't say "the event hasn't happened yet." They said he won.
Both platforms cited real sources (team event results from February 8) and presented them as evidence for an individual event that hadn't happened. The "Malinin wins" narrative was so dominant across every data source that the AI manufactured the expected result from adjacent facts.
How Each Platform Handled the Collapse
We tracked what each platform said about Malinin across five phases. Color shows narrative accuracy, and where the cracks appeared.
Feb 6–10
Feb 11–12
Feb 13 AM
Feb 13 PM
Feb 14
Three Tiers of Narrative Persistence
Both storylines revealed the same three-tier pattern in how AI platforms handle narrative change. The tiers held across a retrospective frame (Vonn's comeback) and a predictive frame (Malinin's dominance).
5 Findings from 8,943 Responses
Two different athletes. Two different types of narrative reversal. The same persistence patterns emerged across both, including one pattern we didn't predict at all.
Two Reversals, Same Playbook
Comparing the two cases reveals why some narratives break through and others persist. The mechanism is the same, but correction speed depends on what the AI expected to happen.
Hypothesis Assessment
We predicted that narrative frames would persist in 60%+ of responses for at least 7 days after a contradicting event. Across conversational platforms, the persistence rate is effectively 100%. Claude and Meta AI never updated either narrative, not once, across 9 days of collection. Even among web-access platforms, framing residue persisted after facts were corrected.
The stronger finding was one we didn't predict: when pre-event consensus is extreme, narrative persistence doesn't just delay correction. It can manufacture false results. Two platforms reported Malinin won individual gold based on team event citations. This isn't a speed problem. It's a frame problem. The narrative was so locked in that the AI assembled adjacent facts into the expected conclusion rather than acknowledging uncertainty.
Caveats: The Malinin hallucination occurred during a narrow window (Feb 13 morning) and was corrected within 24 hours on most platforms. Our collection captures snapshots, not continuous monitoring, so exact correction timestamps have margin. The Vonn case provides a longer observation window than the Malinin case (which is limited to Feb 6–14 in current data). Long-term persistence data from Wave 6 (post-Olympics) will be added.
What This Tells You About Showing Up in AI Responses
The human layer turns this signal into strategy.
Data Considerations
Platform coverage: ChatGPT (highest volume), Google AI Mode, AI Overviews, Gemini, Perplexity, Claude, and Meta AI. Copilot excluded from analysis due to insufficient response volume (4 responses in the Malinin window).
Known gaps: Feb 15 data not yet available at time of publication. Malinin observation window (2 days post-collapse) is shorter than Vonn observation window (14 days post-crash). Long-term persistence rates for Malinin will be updated in Wave 6 post-Olympics collection.
Factually Correct = Response reflects verified current state (e.g., "Malinin finished 8th" after Feb 13)
Pre-Event Frame = Response maintains narrative from before the contradicting event
Hallucination = Response presents unverified or false information as fact
Knowledge Cutoff = Response explicitly states inability to access current information
Edge case: On Feb 13 (free skate day), "Malinin is the favorite" was accurate in the morning before the event, and inaccurate after. We classified based on whether the response acknowledged the event was pending vs. reported a result.
Meta AI score fabrication: Meta AI reported Malinin's final score as "194.63" with a total of "310.67." His actual free skate score was 156.33 with a total of 264.49. This was classified as a hallucination because it presents precise, false numbers that appear authoritative.
Collection method: All responses collected via Scrunch bulk prompt system. Identical prompts sent to all platforms within the same collection window. Response timestamps reflect collection time, not the exact moment of platform query.
Seer Interactive GEO Olympics Initiative · H3 Narrative Persistence
See what AI thinks before AI shapes what everyone else thinks.