Enhancing Nationwide Medico-Administrative Databases Analysis with SAF4SUHAD: A Statistical Analysis Framework for Secondary Use of Healthcare Administrative Databases.

The objective of this work is to build SAF4SUHAD, a statistical analysis framework for secondary use of healthcare administrative databases, using literature-based specifications. A literature review was performed on PubMed in four different medical domains: caesarian deliveries, cholecystectomies, hip replacement surgeries and bariatric surgeries. We identified 22 papers relating analyses of large databases. They reported epidemiological indicators (e.g. mean age), that were abstracted to features (e.g. univariate description of a quantitative variable), and then were implemented through 32 functions available for the user in R programming language. For instance, a function will draw a histogram, compute the mean with confidence interval, quantiles, etc. Those functions comprehend 4 functions for data management, 9 for univariate analysis, 8 for bivariate analysis, 11 for multivariate analysis, and many other intermediate functions. Those functions were successfully used to analyze a French database of 250 million discharge summaries. The set of R ready-to-use functions defined in this work could enable to secure repetitive tasks, and to refocus efforts on expert analysis. PMID: 30306900 [PubMed - in process]
Source: Studies in Health Technology and Informatics - Category: Information Technology Tags: Stud Health Technol Inform Source Type: research