Using an Ensemble to Identify and Classify Macroalgae Antimicrobial Peptides

AbstractThe rapid spread of multi-drug resistant microbes has lead researchers to discover natural alternative remedies such as antimicrobial peptides (AMPs). In the first line of defense, AMPs display a broad spectrum of potent activity against multi-resistant pathogenic bacteria, viruses, fungi, and even cancer. AMPs can be further characterised into families according to amino acid composition, secondary structure, and function. However, despite recent advancements in rapid computational methods for AMP prediction from various mammalian, aquatic, and terrestrial species, there is limited information regarding their presence, functional roles, and family type from marine macroalgae. In this paper, we present a promising two-tier ensemble of heterogeneous machine learning models that integrates seven well-known machine learning classifiers to predict AMPs from macroalgae. The first tier of the ensemble consists of a suite of binary classifiers that identify AMPs from protein sequence data which are then forwarded to a second-tier multi-class ensemble to characterise their functional family type. The two-tier ensemble was successfully used to identify 39 putative AMP sequences in 12 macroalgae species from three different phyla groups. The approach we describe is not limited to AMPs and can also be applied to search sequence data for other types of proteins.
Source: Interdisciplinary Sciences, Computational Life Sciences - Category: Bioinformatics Source Type: research