Noisecut: a python package for noise-tolerant classification of binary data using prior knowledge integration and max-cut solutions
Classification of binary data arises naturally in many clinical applications, such as patient risk stratification through ICD codes. One of the key practical challenges in data classification using machine lea... (Source: BMC Bioinformatics)
Source: BMC Bioinformatics - April 20, 2024 Category: Bioinformatics Authors: Moein E. Samadi, Hedieh Mirzaieazar, Alexander Mitsos and Andreas Schuppert Tags: Software Source Type: research

Drug-Online: an online platform for drug-target interaction, affinity, and binding sites identification using deep learning
Accurately identifying drug-target interaction (DTI), affinity (DTA), and binding sites (DTS) is crucial for drug screening, repositioning, and design, as well as for understanding the functions of target. Alt... (Source: BMC Bioinformatics)
Source: BMC Bioinformatics - April 20, 2024 Category: Bioinformatics Authors: Xin Zeng, Guang-Peng Su, Shu-Juan Li, Shuang-Qing Lv, Meng-Liang Wen and Yi Li Tags: Software Source Type: research

A protein network refinement method based on module discovery and biological information
The identification of essential proteins can help in understanding the minimum requirements for cell survival and development to discover drug targets and prevent disease. Nowadays, node ranking methods are a ... (Source: BMC Bioinformatics)
Source: BMC Bioinformatics - April 20, 2024 Category: Bioinformatics Authors: Li Pan, Haoyue Wang, Bo Yang and Wenbin Li Tags: Research Source Type: research

MMGAT: a graph attention network framework for ATAC-seq motifs finding
Motif finding in Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) data is essential to reveal the intricacies of transcription factor binding sites (TFBSs) and their pivotal roles in gene... (Source: BMC Bioinformatics)
Source: BMC Bioinformatics - April 20, 2024 Category: Bioinformatics Authors: Xiaotian Wu, Wenju Hou, Ziqi Zhao, Lan Huang, Nan Sheng, Qixing Yang, Shuangquan Zhang and Yan Wang Tags: Research Source Type: research

TrieDedup: a fast trie-based deduplication algorithm to handle ambiguous bases in high-throughput sequencing
High-throughput sequencing is a powerful tool that is extensively applied in biological studies. However, sequencers may produce low-quality bases, leading to ambiguous bases, ‘N’s. PCR duplicates introduced i... (Source: BMC Bioinformatics)
Source: BMC Bioinformatics - April 18, 2024 Category: Bioinformatics Authors: Jianqiao Hu, Sai Luo, Ming Tian and Adam Yongxin Ye Tags: Software Source Type: research

Biomedical semantic text summarizer
Text summarization is a challenging problem in Natural Language Processing, which involves condensing the content of textual documents without losing their overall meaning and information content, In the domai... (Source: BMC Bioinformatics)
Source: BMC Bioinformatics - April 16, 2024 Category: Bioinformatics Authors: Mahira Kirmani, Gagandeep Kour, Mudasir Mohd, Nasrullah Sheikh, Dawood Ashraf Khan, Zahid Maqbool, Mohsin Altaf Wani and Abid Hussain Wani Tags: Research Source Type: research

MetageNN: a memory-efficient neural network taxonomic classifier robust to sequencing errors and missing genomes
With the rapid increase in throughput of long-read sequencing technologies, recent studies have explored their potential for taxonomic classification by using alignment-based approaches to reduce the impact of... (Source: BMC Bioinformatics)
Source: BMC Bioinformatics - April 16, 2024 Category: Bioinformatics Authors: Rafael Peres da Silva, Chayaporn Suphavilai and Niranjan Nagarajan Tags: Research Source Type: research

Inference of genomic landscapes using ordered Hidden Markov Models with emission densities (oHMMed)
Genomes are inherently inhomogeneous, with features such as base composition, recombination, gene density, and gene expression varying along chromosomes. Evolutionary, biological, and biomedical analyses aim t... (Source: BMC Bioinformatics)
Source: BMC Bioinformatics - April 16, 2024 Category: Bioinformatics Authors: Claus Vogl, Mariia Karapetiants, Bur çin Yıldırım, Hrönn Kjartansdóttir, Carolin Kosiol, Juraj Bergman, Michal Majka and Lynette Caitlin Mikula Tags: Research Source Type: research

Designing and delivering bioinformatics project-based learning in East Africa
The Eastern Africa Network for Bioinformatics Training (EANBiT) has matured through continuous evaluation, feedback, and codesign. We highlight how the program has evolved to meet challenges and achieve its go... (Source: BMC Bioinformatics)
Source: BMC Bioinformatics - April 14, 2024 Category: Bioinformatics Authors: Caleb K. Kibet, Jean-Baka Domelevo Entfellner, Daudi Jjingo, Etienne Pierre de Villiers, Santie de Villiers, Karen Wambui, Sam Kinyanjui and Daniel Masiga Tags: Research Source Type: research

MultiToxPred 1.0: a novel comprehensive tool for predicting 27 classes of protein toxins using an ensemble machine learning approach
Protein toxins are defense mechanisms and adaptations found in various organisms and microorganisms, and their use in scientific research as therapeutic candidates is gaining relevance due to their effectivene... (Source: BMC Bioinformatics)
Source: BMC Bioinformatics - April 12, 2024 Category: Bioinformatics Authors: Jorge F. Beltr án, Lisandra Herrera-Belén, Fernanda Parraguez-Contreras, Jorge G. Farías, Jorge Machuca-Sepúlveda and Stefania Short Tags: Software Source Type: research

Biomarker discovery with quantum neural networks: a case-study in CTLA4-activation pathways
Biomarker discovery is a challenging task due to the massive search space. Quantum computing and quantum Artificial Intelligence (quantum AI) can be used to address the computational problem of biomarker disco... (Source: BMC Bioinformatics)
Source: BMC Bioinformatics - April 12, 2024 Category: Bioinformatics Authors: Phuong-Nam Nguyen Tags: Research Source Type: research

KEGG orthology prediction of bacterial proteins using natural language processing
The advent of high-throughput technologies has led to an exponential increase in uncharacterized bacterial protein sequences, surpassing the capacity of manual curation. A large number of bacterial protein seq... (Source: BMC Bioinformatics)
Source: BMC Bioinformatics - April 11, 2024 Category: Bioinformatics Authors: Jing Chen, Haoyu Wu and Ning Wang Tags: Research Source Type: research

Control of false discoveries in grouped hypothesis testing for eQTL data
Expression quantitative trait locus (eQTL) analysis aims to detect the genetic variants that influence the expression of one or more genes. Gene-level eQTL testing forms a natural grouped-hypothesis testing st... (Source: BMC Bioinformatics)
Source: BMC Bioinformatics - April 11, 2024 Category: Bioinformatics Authors: Pratyaydipta Rudra, Yi-Hui Zhou, Andrew Nobel and Fred A. Wright Tags: Research Source Type: research

DPI_CDF: druggable protein identifier using cascade deep forest
Drug targets in living beings perform pivotal roles in the discovery of potential drugs. Conventional wet-lab characterization of drug targets is although accurate but generally expensive, slow, and resource i... (Source: BMC Bioinformatics)
Source: BMC Bioinformatics - April 5, 2024 Category: Bioinformatics Authors: Muhammad Arif, Ge Fang, Ali Ghulam, Saleh Musleh and Tanvir Alam Tags: Research Source Type: research

Multiple phenotype association tests based on sliced inverse regression
Joint analysis of multiple phenotypes in studies of biological systems such as Genome-Wide Association Studies is critical to revealing the functional interactions between various traits and genetic variants, ... (Source: BMC Bioinformatics)
Source: BMC Bioinformatics - April 4, 2024 Category: Bioinformatics Authors: Wenyuan Sun, Kyongson Jon and Wensheng Zhu Tags: Research Source Type: research