Large Language Model-Based Evaluation of Medical Question Answering Systems: Algorithm Development and Case Study

CONCLUSIONS: Our scalable evaluation framework enables the simulation of patient populations with different health literacy levels and helps to evaluate domain specific CAs, thus promoting their integration into clinical practice. Future research aims to extend the framework to CAs without predefined content and to apply LLMs to adapt medical information to the specific (health) literacy level of the user.PMID:38682499 | DOI:10.3233/SHTI240006
Source: Studies in Health Technology and Informatics - Category: Information Technology Authors: Source Type: research