ASSESSING THE ACCURACY, READABILITY AND UNDERSTANDABILITY OF WEBSITES, CHATGPT, COPILOT, AND BARD ANSWERS ON THE RADIATION DURING PREGNANCY

MERT, BURCU; EMEKLİ, EMRE

doi:10.5505/amj.2025.54810

ASSESSING THE ACCURACY, READABILITY AND UNDERSTANDABILITY OF WEBSITES, CHATGPT, COPILOT, AND BARD ANSWERS ON THE RADIATION DURING PREGNANCY

MERT B., EMEKLİ E.

Ankara Medical Journal, cilt.25, sa.1, ss.27-39, 2025 (Scopus, TRDizin)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 25 Sayı: 1
Basım Tarihi: 2025
Doi Numarası: 10.5505/amj.2025.54810
Dergi Adı: Ankara Medical Journal
Derginin Tarandığı İndeksler: Scopus, TR DİZİN (ULAKBİM)
Sayfa Sayıları: ss.27-39
Anahtar Kelimeler: Artificial intelligence, Bard, Chat Tool, ChatGPT, Copilot, readability
Açık Arşiv Koleksiyonu: AVESİS Açık Erişim Koleksiyonu
Eskişehir Osmangazi Üniversitesi Adresli: Evet

Özet

Objectives: The study aims to evaluate the accuracy of answers to frequently asked questions (FAQ) about the impact of radiation during pregnancy on websites, ChatGPT, Copilot, and Bard. Secondly, to assess the readability and understandability of answers. Materials and Methods: The answers to these questions were scored in terms of accuracy (completely correct, partially correct, incorrect). The Automated Readability Index (ARI), Flesch Reading Ease (FRE), and Gunning Fog Readability (GFR) scores were calculated. The understandability score was assessed using the Patient Education Materials Assessment Tool (PEMAT). Results: The accuracy was calculated as 100% for the websites, 66.67% for ChatGPT, 73.33% for Copilot, and 93.33% for Bard. Readability scores ranking was ChatGPT (ARI=16.15, FRE=24.47, GFR=20.52), Copilot (ARI=14.00, FRE=37.60, GFR=18.27), websites (ARI=13.59, FRE=43.67, GFR=15.56), Bard (ARI=10.92, FRE=48.73, GFR=14.86). ChatGPT's readability was statistically the most challenging. PEMAT understandability scores were 79.53% for Bard, below the acceptable limit of 70% for others. Conclusion: While the responses from chat tools and websites may be largely accurate, it is observed that they are not suitable for patients in terms of readability and understandability. Internet information sources should be developed, especially to ensure that the content is understandable by a broad readership.