Evaluation of different classification models to extract gene signatures for breast cancer recurrence using microarray data

(2017) Evaluation of different classification models to extract gene signatures for breast cancer recurrence using microarray data. Journal of Isfahan Medical School. pp. 98-103. ISSN 10277595 (ISSN)

Full text not available from this repository.

Abstract

Background: In this study, we aimed to improve the reliability and biological interpretability of gene signatures selected from microarrays by efficient usage of computational models and mathematical algorithms. Methods: At the first step, a good model with high accuracy was chosen to predict cancer recurrence in microarray gene expression data on breast tumors. In this regard, microarray gene expression data of breast tumor in 1271 cancer patients (379 with recurrence and 892 people without recurrence) were utilized to construct an appropriate predictive model for recurrence by comparing the performance of multiple classifiers. In the pre-processing stage, different methods like correlation-based feature selection (CFS), principal component analysis (PCA), independent component analysis (ICA), and genetic algorithm as well as a random selection method were used to reduce the dimensions and choose the most appropriate genes (features). Findings: A total of five gene signatures were selected by combining genetic algorithm, top scoring set (TSS), and random selection method, which showed the best results in most classification models. The final indicator genes were TRIP13, KIF20A, NEK2, RACGAP1 and TYMS, which had significant contribution in the structure of microtubules and spindle and also regulated the attachment of spindle microtubules to kinetochore. Conclusion: By using hybrid models, we can avoid overfitting in training and achieve acceptable accuracy with biologically interpretable genes. © 2017, Isfahan University of Medical Sciences(IUMS). All rights reserved.

Item Type: Article
Keywords: Algorithms Biomarkers Breast cancer Classification Gene expression profiling Article cancer recurrence centromere classifier correlation based feature selection gene genetic algorithm human independent component analysis information processing kif20a gene major clinical study microarray analysis microtubule nek2 gene principal component analysis racgap1 gene spindle cell trip13 gene tyms gene
Divisions: School of Advanced Technologies in Medicine > Department of Bioelectrics and Biomedical Engineering
Page Range: pp. 98-103
Journal or Publication Title: Journal of Isfahan Medical School
Journal Index: Scopus
Volume: 35
Number: 419
ISSN: 10277595 (ISSN)
Depositing User: مهندس مهدی شریفی
URI: http://eprints.mui.ac.ir/id/eprint/1994

Actions (login required)

View Item View Item