NLPReViz: an interactive tool for natural language processing on clinical text

We report changes in performance relative to the quantity of feedback. Using initial training sets as small as 10 documents, expert review led to finalF1scores for the “appendiceal-orifice” variable between 0.78 and 0.91 (with improvements ranging from 13.26% to 29.90%).F1for “biopsy” ranged between 0.88 and 0.94 (−1.52% to 11.74% improvements). The average System Usability Scale score was 70.56. Subjective feedback also suggests possible design improvements.
Source: Journal of the American Medical Informatics Association - Category: Information Technology Source Type: research