Discovering common pathogenic processes between COVID-19 and HFRS by integrating RNA-seq differential expression analysis with machine learning

This study is attributed to the identification of common genes and perturbed pathways by combining differential expression analysis with bioinformatics and machine learning approaches. Initially, the transcriptomic data of hantavirus-infected peripheral blood mononuclear cells (PBMCs) and SARS-CoV-2 infected PBMCs were analyzed through differential gene expression analysis for identification of common differentially expressed genes (DEGs). The functional annotation by enrichment analysis of common genes demonstrated immune and inflammatory response biological processes enriched by DEGs. The protein–protein interaction (PPI) network of DEGs was then constructed and six genes named RAD51, ALDH1A1, UBA52, CUL3, GADD45B, and CDKN1A were identified as the commonly dysregulated hub genes among HFRS and COVID-19. Later, the classification performance of these hub genes were evaluated using Random Forest (RF), Poisson Linear Discriminant Analysis (PLDA), Voom-based Nearest Shrunken Centroids (voomNSC), and Support Vector Machine (SVM) classifiers which demonstrated accuracy >70%, suggesting the biomarker potential of the hub genes. To our knowledge, this is the first study that unveiled biological processes and pathways commonly dysregulated in HFRS and COVID-19, which could be in the next future used for the design of personalized treatment to prevent the linked attacks of COVID-19 and HFRS.
Source: Frontiers in Microbiology - Category: Microbiology Source Type: research