Bioinformatics pipeline for the systematic mining genomic and proteomic variation linked to rare diseases: The example of monogenic diabetes

PLoS One. 2024 Apr 18;19(4):e0300350. doi: 10.1371/journal.pone.0300350. eCollection 2024.ABSTRACTMonogenic diabetes is characterized as a group of diseases caused by rare variants in single genes. Like for other rare diseases, multiple genes have been linked to monogenic diabetes with different measures of pathogenicity, but the information on the genes and variants is not unified among different resources, making it challenging to process them informatically. We have developed an automated pipeline for collecting and harmonizing data on genetic variants linked to monogenic diabetes. Furthermore, we have translated variant genetic sequences into protein sequences accounting for all protein isoforms and their variants. This allows researchers to consolidate information on variant genes and proteins linked to monogenic diabetes and facilitates their study using proteomics or structural biology. Our open and flexible implementation using Jupyter notebooks enables tailoring and modifying the pipeline and its application to other rare diseases.PMID:38635808 | PMC:PMC11025945 | DOI:10.1371/journal.pone.0300350
Source: Genomics Proteomics ... - Category: Genetics & Stem Cells Authors: Source Type: research