Researchers tested ChatGPT , Gemini , Grok , Meta AI, and DeepSeek with 250 prompts across cancer, vaccines, stem cells, nutrition, and athletic performance. The prompts reflected common health queries and familiar misinformation themes, then measured whether the bots stayed aligned with scientific evidence or drifted into misleading and potentially unsafe advice. The weakest results came from open ended prompts. Those broader questions produced far more highly problematic answers than...