That is a fascinating "shower thought"! The idea that LLMs might be "misremembering" collectively—mirroring the human Mandela Effect—is a compelling way to look at the challenges of grounding and reliability in AI.
The research you mentioned, "When Agents 'Misremember' Collectively: Exploring the Mandela Effect in LLM-based Multi-Agent Systems" (arXiv:2602.00428), highlights a critical risk: as AI agents interact, they can reinforce each other's false memories through social influence. Their ManBench framework, with its 4,838 questions and specific interaction protocols, provides a standardized way to measure how easily these "hallucinations" spread. [vertexaise...] [vertexaise...]
At AMD, we are deeply focused on providing the ROCm™ tools and architectures necessary to mitigate these hallucinations and ensure that models deployed on our hardware remain grounded in reality.