Large Language Model-Based Evaluation of Medical Question Answering Systems: Algorithm Development and Case Study

CONCLUSIONS: Our scalable evaluation framework enables the simulation of patient populations with different health literacy levels and helps to evaluate domain specific CAs, thus promoting their integration into clinical practice. Future research aims to extend the framework to CAs without predefined content and to apply LLMs to adapt medical information to the specific (health) literacy level of the user.PMID:38682499 | DOI:10.3233/SHTI240006

https://pubmed.ncbi.nlm.nih.gov/38682499/?utm_source=no_user_agent&utm_medium=rs...

Source: Studies in Health Technology and Informatics - April 29, 2024 Category: Information Technology Authors: Daniel Reichenpfader Philipp R össlhuemer Kerstin Denecke Source Type: research

More News: Information Technology | Mammography | Study