Assessment of correctness, content omission, and risk of harm in large language model responses to dermatology continuing medical education questions

Artificial intelligence (AI)-based large language models (LLMs) have been shown to have promising performance in medical applications, including on specialty board examination questions and complex clinical cases (Beam et al, 2023; Eriksen et al, 2023). Previous reports evaluated the performance of LLMs on dermatology practice board examinations questions (Passby et al, 2023; Joly-Chevrier et al, 2023; Mirza et al, 2024), but the performance of LLMs compared to practicing dermatologists has not been elucidated.
Source: Journal of Investigative Dermatology - Category: Dermatology Authors: Tags: Letters to the Editor Source Type: research