ChatGPT-3.5 and ChatGPT-4 Performance in Testicular Cancer: A Comparative Study

Uysal, Umit; Ucar, Murat; Sagir, Suleyman

ChatGPT-3.5 and ChatGPT-4 Performance in Testicular Cancer: A Comparative Study

dc.contributor.author	Uysal, Umit
dc.contributor.author	Ucar, Murat
dc.contributor.author	Sagir, Suleyman
dc.date.accessioned	2026-01-24T12:26:43Z
dc.date.available	2026-01-24T12:26:43Z
dc.date.issued	2025
dc.department	Alanya Alaaddin Keykubat Üniversitesi
dc.description.abstract	Objective: The aim of our study is to assess the reliability of Chat Generative Pre-trained Transformer (ChatGPT), compare the performance of ChatGPT-4 to ChatGPT-3.5, and explore its potential roles in healthcare decision-making. Materials and Methods: Thirty questions related to testicular cancer were prepared, based on the 2023 European Association of Urology guidelines and clinical experience. These questions were systematically posed to ChatGPT-3.5 and ChatGPT-4, and responses were rated by three independent urologists using a six-point Likert scale. The median score from the three specialists was used as the final score. Results: Both ChatGPT versions provided an incorrect answer to one question, scoring a one. For GPT-3.5 and GPT-4, the percentage of responses considered incorrect by the urologists was 20% and 13.3%, respectively, while correct responses (scoring 3 or higher) accounted for 80% and 86.7%. For general information-diagnosis questions, GPT-3.5 and GPT-4, had average scores of 4.29 and 4.80, with median values of 4.27 and 4.67. For treatment follow-up questions, average scores were 3.60 and 4.16, with median values of 3.60 and 4.20. GPT 4 generally outperformed GPT-3.5, but the difference was not statistically significant (p>0.05). Conclusion: Our study shows that ChatGPT-4 is more reliable and accurate than ChatGPT-3.5 in testicular cancer-related queries. Continued development of its database and clinical capabilities could optimize ChatGPT's utility in healthcare.
dc.identifier.doi	10.4274/uob.galenos.2025.2025.1.2
dc.identifier.endpage	46
dc.identifier.issn	2147-2270
dc.identifier.issue	2
dc.identifier.startpage	40
dc.identifier.trdizinid	1325249
dc.identifier.uri	https://doi.org/10.4274/uob.galenos.2025.2025.1.2
dc.identifier.uri	https://search.trdizin.gov.tr/tr/yayin/detay/1325249
dc.identifier.uri	https://hdl.handle.net/20.500.12868/4866
dc.identifier.volume	24
dc.identifier.wos	WOS:001530336700001
dc.identifier.wosquality	Q4
dc.indekslendigikaynak	Web of Science
dc.indekslendigikaynak	TR-Dizin
dc.language.iso	en
dc.publisher	Galenos Publ House
dc.relation.ispartof	Uroonkoloji Bulteni-Bulletin of Urooncology
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/openAccess
dc.snmz	KA_WoS_20260121
dc.subject	Artificial intelligence
dc.subject	ChatGPT
dc.subject	natural language processing
dc.subject	testicular cancer
dc.title	ChatGPT-3.5 and ChatGPT-4 Performance in Testicular Cancer: A Comparative Study
dc.type	Article

Dosyalar

Orijinal paket

Listeleniyor 1 - 1 / 1

İsim:: 40-46.pdf
Boyut:: 160 KB
Biçim:: Adobe Portable Document Format

İndir

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
TR-Dizin İndeksli Yayınlar Koleksiyonu