The experiment was led by a journalism professor specialising in computer science, who tested seven generative AI systems over a four week period. Each day, the tools were asked to list and summarise the five most important news events in Québec, rank them by importance, and provide direct article links as sources. Among the systems tested were Google’s Gemini, OpenAI’s ChatGPT, Claude, Copilot, Grok, DeepSeek, and Aria. The most striking failure involved Gemini inventing a...