ChatGPT-4.0 bests other large language models on ACR exam questions

ChatGPT-4.0 performs well on the image-independent American College of Radiology Diagnostic In-Training Exam (ACR DXIT) practice questions, a study presented November 29 at the RSNA 2023 annual meeting found.Presenter Christopher Kaufmann, MD, from the University of Texas at Austin talked about results from his team’s comparative study of large language models, which showed that ChatGPT-4.0 achieved the highest scores.“The results demonstrate the powerful efficiency and improving accuracy of evolving publicly available AI tools when applied to the radiology-specific domain,” Kaufmann said.Large language AI models such as ChatGPT and Google Bard have become an area of interest for radiologists within the past year. Previous studies have examined these models and their clinical utility in clinical- and patient-facing settings. However, Kaufmann pointed out that radiologists need to know the output accuracy, relevance, and reliability of these models before determining their clinical utility by specific domain.Kaufmann and colleagues compared the latest publicly accessible large language models across multiple subspecialty areas of radiology.They used ACR DXIT practice test question set from 2022, specifically image-independent questions that were distributed across various radiology disciplines. The team also used three publicly available large language model platforms: ChatGPT 3.5 & 4.0, Google Bard, and Windows BingChat. The questions were entered into the AI interface ...
Source: AuntMinnie.com Headlines - Category: Radiology Authors: Tags: Advanced Visualization RSNA 2023 Source Type: news