Identifying Diffuse Glioma Subtypes Based on Pathway Enrichment Evaluation
AbstractGliomas are highly heterogeneous in molecular, histology, and microenvironment. However, a classification of gliomas by integrating different tumor microenvironment (TME) components remains unexplored.  Based on the enrichment scores of 17 pathways involved in immune, stromal, DNA repair, and nervous system signatures in diffuse gliomas, we performed consensus clustering to uncover novel subtypes of gliomas. Consistently in three glioma datasets (TCGA-glioma, CGGA325, and CGGA301), we identified three subtypes: Stromal-enriched (Str-G), Nerve-enriched (Ner-G), and mixed (Mix-G). Ner-G was charactered by low immune...
Source: Interdisciplinary Sciences, Computational Life Sciences - April 18, 2024 Category: Bioinformatics Source Type: research

AMPFLDAP: Adaptive Message Passing and Feature Fusion on Heterogeneous Network for LncRNA-Disease Associations Prediction
AbstractExploration of the intricate connections between long noncoding RNA (lncRNA) and diseases, referred to as lncRNA-disease associations (LDAs), plays a pivotal and indispensable role in unraveling the underlying molecular mechanisms of diseases and devising practical treatment approaches. It is imperative to employ computational methods for predicting lncRNA-disease associations to circumvent the need for superfluous experimental endeavors. Graph-based learning models have gained substantial popularity in predicting these associations, primarily because of their capacity to leverage node attributes and relationships ...
Source: Interdisciplinary Sciences, Computational Life Sciences - April 6, 2024 Category: Bioinformatics Source Type: research

MF-MNER: Multi-models Fusion for MNER in Chinese Clinical Electronic Medical Records
AbstractTo address the problem of poor entity recognition performance caused by the lack of Chinese annotation in clinical electronic medical records, this paper proposes a multi-medical entity recognition method F-MNER using a fusion technique combining BART, Bi-LSTM, and CRF. First, after cleaning, encoding, and segmenting the electronic medical records, the obtained semantic representations are dynamically fused using a bidirectional autoregressive transformer (BART) model. Then, sequential information is captured using a bidirectional long short-term memory (Bi-LSTM) network. Finally, the conditional random field (CRF)...
Source: Interdisciplinary Sciences, Computational Life Sciences - April 5, 2024 Category: Bioinformatics Source Type: research

Deep Canonical Correlation Fusion Algorithm Based on Denoising Autoencoder for ASD Diagnosis and Pathogenic Brain Region Identification
In this study, we introduce a new framework based on autoencoder, the Deep Canonical Correlation Fusion algorithm based on Denoising Autoencoder (DCCF-DAE), which proves to be effective in handling high-dimensional data. This framework involves efficient feature extraction from different types of data with an advanced autoencoder, followed by the fusion of these features through the DCCF model. Then we utilize the fused features for disease classification. DCCF integrates functional and structural data to help accurately diagnose ASD and identify critical Regions of Interest (ROIs) in disease mechanisms. We compare the pro...
Source: Interdisciplinary Sciences, Computational Life Sciences - April 4, 2024 Category: Bioinformatics Source Type: research

DeepPI: Alignment-Free Analysis of Flexible Length Proteins Based on Deep Learning and Image Generator
AbstractWith the rapid development of NGS technology, the number of protein sequences has increased exponentially. Computational methods have been introduced in protein functional studies because the analysis of large numbers of proteins through biological experiments is costly and time-consuming. In recent years, new approaches based on deep learning have been proposed to overcome the limitations of conventional methods. Although deep learning-based methods effectively utilize features of protein function, they are limited to sequences of fixed-length and consider information from adjacent amino acids. Therefore, new prot...
Source: Interdisciplinary Sciences, Computational Life Sciences - April 3, 2024 Category: Bioinformatics Source Type: research

BDM: An Assessment Metric for Protein Complex Structure Models Based on Distance Difference Matrix
In this study, we propose a novel metric called BDM (Based on Distance difference Matrix) for assessing protein complex prediction structures. Our approach utilizes a distance difference matrix derived from comparing real and predicted protein structures, establishing a linear correlation with Root Mean Square Deviation (RMSD). BDM overcomes limitations associated with receptor-ligand differentiation and eliminates the requirement for structure alignment, making it a more effective and efficient metric. Evaluation of BDM using CASP14 and CASP15 test sets demonstrates superior performance compared to the official CASP scori...
Source: Interdisciplinary Sciences, Computational Life Sciences - March 27, 2024 Category: Bioinformatics Source Type: research

DOTAD: A Database of Therapeutic Antibody Developability
AbstractThe development of therapeutic antibodies is an important aspect of new drug discovery pipelines. The assessment of an antibody's developability —its suitability for large-scale production and therapeutic use—is a particularly important step in this process. Given that experimental assays to assess antibody developability in large scale are expensive and time-consuming, computational methods have been a more efficient alternative. Howeve r, the antibody research community faces significant challenges due to the scarcity of readily accessible data on antibody developability, which is essential for training and v...
Source: Interdisciplinary Sciences, Computational Life Sciences - March 26, 2024 Category: Bioinformatics Source Type: research

ResDeepSurv: A Survival Model for Deep Neural Networks Based on Residual Blocks and Self-attention Mechanism
AbstractSurvival analysis, as a widely used method for analyzing and predicting the timing of event occurrence, plays a crucial role in the medicine field. Medical professionals utilize survival models to gain insight into the effects of patient covariates on the disease, and the correlation with the effectiveness of different treatment strategies. This knowledge is essential for the development of treatment plans and the enhancement of treatment approaches. Conventional survival models, such as the Cox proportional hazards model, require a significant amount of feature engineering or prior knowledge to facilitate personal...
Source: Interdisciplinary Sciences, Computational Life Sciences - March 15, 2024 Category: Bioinformatics Source Type: research

CHL-DTI: A Novel High –Low Order Information Convergence Framework for Effective Drug–Target Interaction Prediction
AbstractRecognizing drug –target interactions (DTI) stands as a pivotal element in the expansive field of drug discovery. Traditional biological wet experiments, although valuable, are time-consuming and costly as methods. Recently, computational methods grounded in network learning have demonstrated great advantages by e ffective topological feature extraction and attracted extensive research attention. However, most existing network-based learning methods only consider the low-order binary correlation between individual drug and target, neglecting the potential higher-order correlation information derived from mult ipl...
Source: Interdisciplinary Sciences, Computational Life Sciences - March 14, 2024 Category: Bioinformatics Source Type: research

SeFilter-DIA: Squeeze-and-Excitation Network for Filtering High-Confidence Peptides of Data-Independent Acquisition Proteomics
In this study, we introduce SeFilter-DIA, a deep learning algorithm, aiming at automating the identification of high-confidence peptides. Leveraging compressed excitation neural network and residual network models, SeFilter-DIA extracts XIC features and effectively discerns between high and low-confidence peptides. Evaluation of the benchmark datasets demonstrates SeFilter-DIA achieving 99.6% AUC on the test set and 97% for other performance indicators. Furthermore, SeFilter-DIA is applicable for screening peptides with phosphorylation modifications. These results demonstrate the potential of SeFilter-DIA to replace manual...
Source: Interdisciplinary Sciences, Computational Life Sciences - March 12, 2024 Category: Bioinformatics Source Type: research

GraphsformerCPI: Graph Transformer for Compound –Protein Interaction Prediction
In this study, we propose GraphsformerCPI, an end-to-end de ep learning framework that improves prediction performance and interpretability. GraphsformerCPI treats compounds and proteins as sequences of nodes with spatial structures, and leverages novel structure-enhanced self-attention mechanisms to integrate semantic and graph structural features within mo lecules for deep molecule representations. To capture the vital association between compound atoms and protein residues, we devise a dual-attention mechanism to effectively extract relational features through .cross-mapping. By extending the powerful learning capabilit...
Source: Interdisciplinary Sciences, Computational Life Sciences - March 8, 2024 Category: Bioinformatics Source Type: research

Identifying Protein Phosphorylation Site-Disease Associations Based on Multi-Similarity Fusion and Negative Sample Selection by Convolutional Neural Network
AbstractAs one of the most important post-translational modifications (PTMs), protein phosphorylation plays a key role in a variety of biological processes. Many studies have shown that protein phosphorylation is associated with various human diseases. Therefore, identifying protein phosphorylation site-disease associations can help to elucidate the pathogenesis of disease and discover new drug targets. Networks of sequence similarity and Gaussian interaction profile kernel similarity were constructed for phosphorylation sites, as well as networks of disease semantic similarity, disease symptom similarity and Gaussian inte...
Source: Interdisciplinary Sciences, Computational Life Sciences - March 8, 2024 Category: Bioinformatics Source Type: research

Predicting Microbe-Disease Associations Based on a Linear Neighborhood Label Propagation Method with Multi-order Similarity Fusion Learning
AbstractComputational approaches employed for predicting potential microbe-disease associations often rely on similarity information between microbes and diseases. Therefore, it is important to obtain reliable similarity information by integrating multiple types of similarity information. However, existing similarity fusion methods do not consider multi-order fusion of similarity networks. To address this problem, a novel method of linear neighborhood label propagation with multi-order similarity fusion learning (MOSFL-LNP) is proposed to predict potential microbe-disease associations. Multi-order fusion learning comprises...
Source: Interdisciplinary Sciences, Computational Life Sciences - March 4, 2024 Category: Bioinformatics Source Type: research

Singular Value Decomposition-Driven Non-negative Matrix Factorization with Application to Identify the Association Patterns of Sarcoma Recurrence
AbstractSarcomas are malignant tumors from mesenchymal tissue and are characterized by their complexity and diversity. The high recurrence rate making it important to understand the mechanisms behind their recurrence and to develop personalized treatments and drugs. However, previous studies on the association patterns of multi-modal data on sarcoma recurrence have overlooked the fact that genes do not act independently, but rather function within signaling pathways. Therefore, this study collected 290 whole solid images, 869 gene and 1387 pathway data of over 260 sarcoma samples from UCSC and TCGA to identify the associat...
Source: Interdisciplinary Sciences, Computational Life Sciences - March 1, 2024 Category: Bioinformatics Source Type: research

Machine Learning Accelerates De Novo Design of Antimicrobial Peptides
AbstractEfficient and precise design of antimicrobial peptides (AMPs) is of great importance in the field of AMP development. Computing provides opportunities for peptide de novo design. In the present investigation, a new machine learning-based AMP prediction model, AP_Sin, was trained using 1160 AMP sequences and 1160 non-AMP sequences. The results showed that AP_Sin correctly classified 94.61% of AMPs on a comprehensive dataset, outperforming the mainstream and open-source models (Antimicrobial Peptide Scanner vr.2, iAMPpred and AMPlify) and being effective in identifying AMPs. In addition, a peptide sequence generator,...
Source: Interdisciplinary Sciences, Computational Life Sciences - February 28, 2024 Category: Bioinformatics Source Type: research