Poincar é maps for visualization of large protein families

Brief Bioinform. 2023 Mar 22:bbad103. doi: 10.1093/bib/bbad103. Online ahead of print.ABSTRACTIn the era of constantly increasing amounts of the available protein data, a relevant and interpretable visualization becomes crucial, especially for tasks requiring human expertise. Poincaré disk projection has previously demonstrated its important efficiency for visualization of biological data such as single-cell RNAseq data. Here, we develop a new method PoincaréMSA for visual representation of complex relationships between protein sequences based on Poincaré maps embedding. We demonstrate its efficiency and potential for visualization of protein family topology as well as evolutionary and functional annotation of uncharacterized sequences. PoincaréMSA is implemented in open source Python code with available interactive Google Colab notebooks as described at https://www.dsimb.inserm.fr/POINCARE_MSA.PMID:36946414 | DOI:10.1093/bib/bbad103
Source: Briefings in Bioinformatics - Category: Bioinformatics Authors: Source Type: research
More News: Bioinformatics