ASSESSING THE ACCURACY, READABILITY AND UNDERSTANDABILITY OF WEBSITES, CHATGPT, COPILOT, AND BARD ANSWERS ON THE RADIATION DURING PREGNANCY


Creative Commons License

MERT B., EMEKLİ E.

Ankara Medical Journal, vol.25, no.1, pp.27-39, 2025 (Scopus, TRDizin) identifier identifier

  • Publication Type: Article / Article
  • Volume: 25 Issue: 1
  • Publication Date: 2025
  • Doi Number: 10.5505/amj.2025.54810
  • Journal Name: Ankara Medical Journal
  • Journal Indexes: Scopus, TR DİZİN (ULAKBİM)
  • Page Numbers: pp.27-39
  • Keywords: Artificial intelligence, Bard, Chat Tool, ChatGPT, Copilot, readability
  • Open Archive Collection: AVESIS Open Access Collection
  • Eskisehir Osmangazi University Affiliated: Yes

Abstract

Objectives: The study aims to evaluate the accuracy of answers to frequently asked questions (FAQ) about the impact of radiation during pregnancy on websites, ChatGPT, Copilot, and Bard. Secondly, to assess the readability and understandability of answers. Materials and Methods: The answers to these questions were scored in terms of accuracy (completely correct, partially correct, incorrect). The Automated Readability Index (ARI), Flesch Reading Ease (FRE), and Gunning Fog Readability (GFR) scores were calculated. The understandability score was assessed using the Patient Education Materials Assessment Tool (PEMAT). Results: The accuracy was calculated as 100% for the websites, 66.67% for ChatGPT, 73.33% for Copilot, and 93.33% for Bard. Readability scores ranking was ChatGPT (ARI=16.15, FRE=24.47, GFR=20.52), Copilot (ARI=14.00, FRE=37.60, GFR=18.27), websites (ARI=13.59, FRE=43.67, GFR=15.56), Bard (ARI=10.92, FRE=48.73, GFR=14.86). ChatGPT's readability was statistically the most challenging. PEMAT understandability scores were 79.53% for Bard, below the acceptable limit of 70% for others. Conclusion: While the responses from chat tools and websites may be largely accurate, it is observed that they are not suitable for patients in terms of readability and understandability. Internet information sources should be developed, especially to ensure that the content is understandable by a broad readership.