Gene selection from large-scale gene expression data based on fuzzy interactive multi-objective binary optimization for medical diagnosis

Publication date: 2018Source: Biocybernetics and Biomedical Engineering, Volume 38, Issue 2Author(s): Saleh Shahbeig, Akbar Rahideh, Mohammad Sadegh Helfroush, Kamran KazemiAbstractAn efficient fuzzy interactive multi-objective optimization method is proposed to select the sub-optimal subset of genes from large-scale gene expression data. It is based on the binary particle swarm optimization (BPSO) algorithm tuned by a chaotic method. The proposed method is able to select the sub-optimal subset of genes with the least number of features that can accurately distinguish between the two classes, e.g. the normal and cancerous samples. The proposed method is evaluated on several publicly available microarray and RNA-sequencing gene expression datasets such as leukemia, colon cancer, central nervous system, lung cancer, ovarian cancer, prostate cancer and RNA-seq lung disease. The results indicate that the proposed method can identify the minimum number of genes to achieve the most accuracy, sensitivity and specificity in the classification process. Achieving 100% accuracy in six out of the seven datasets investigated in this study, demonstrates the high capacity of the proposed algorithm to find the sub-optimal subset of genes. This approach is useful in clinical applications to extract the most influential genes on a disease and to find the treatment procedure for the disease.
Source: Biocybernetics and Biomedical Engineering - Category: Biomedical Engineering Source Type: research