Identification of Gene Signatures for Classifying of Breast Cancer Subtypes Using Protein Interaction Database and Support Vector Machines

(2015) Identification of Gene Signatures for Classifying of Breast Cancer Subtypes Using Protein Interaction Database and Support Vector Machines. 2015 5th International Conference on Computer and Knowledge Engineering (Iccke). pp. 195-200.

Full text not available from this repository.

Abstract

Many studies have used the microarray gene expression data in order to classifying breast cancer subtypes. However, the classification accuracy was not acceptable in many cases even by applying the algorithms to only a single set of data. In this regard, using appropriate algorithm in every step of whole procedure, applying useful bioinformatics databases, considering the interaction among genes, and properly combining analytical steps are the main challenging problems. In this study a solution was proposed which followed a three step process. In the first step a filter feature selection method was used to produce a small set of informative genes. In the second step, the primary selected genes were mapped on the protein-protein interaction network to extend the gene set according to the linking among corresponding proteins. Thus, a portion of genes that was pruned in the first stage is added again to the primary set of selected genes. In the final stage, by using support vector machine-based recursive feature elimination (SVMRFE) method, the final set of informative genes was identified. After that, we compared our proposed algorithm with decision tree methods in the same datasets. The proposed procedure was evaluated on two publicly available DNA microarray dataset, including 456 samples on breast cancer. The proposed algorithm reached to 100 accuracy for predicting Luminal B by using the JMI method in the first step. In conclusion the proposed method showed an appealing improvement in classification accuracy for a multiclass prediction problem. We can predict subtypes with greater than 91.2 overall accuracy by proposed algorithm. However, the accuracy of prediction subtypes in tree decision method is 78.6.

Item Type: Article
Keywords: breast tumor subtype feature selection method protein-protein interaction support vector machine microarray data expression data classification selection guidelines tumors
Page Range: pp. 195-200
Journal or Publication Title: 2015 5th International Conference on Computer and Knowledge Engineering (Iccke)
Journal Index: ISI
Depositing User: مهندس مهدی شریفی
URI: http://eprints.mui.ac.ir/id/eprint/5171

Actions (login required)

View Item View Item