Classification and similarity analysis of binding-database: A survey on application of multi-class classifiers for deriving general rules from large compound databases

(2017) Classification and similarity analysis of binding-database: A survey on application of multi-class classifiers for deriving general rules from large compound databases. Journal of Isfahan Medical School. pp. 400-405. ISSN 10277595 (ISSN)

Full text not available from this repository.

Abstract

Background: In this research, we extracted and modified features of active ligands related to specific biological targets with combination of data mining and classification methods to aid medicinal chemists in their drug discovery projects. Preparing an inactive ligand is the major problem for development of multi-class classifiers. Therefore, our models were developed based on only active ligands found in Binding-database (DB) without any needs for preparing inactive molecules. Methods: Our database consisted of 160372 ligands in 45 classes of common proteins and 1497 different features (topological, chemistry, physical, etc.) were calculated for each molecule. Then, the specific features of active ligands of any target were extracted based on combination of linear discriminate analysis and Apriori algorithm. Findings: Receiver operating characteristic (ROC) was a useful operator to analysis the accuracy and sensitivity of classification models and retrieving molecules from ZINC and Binding-DB databases. Area under curve (AUC) of this diagram was evaluated for analysis of each target in Zinc and Binding-DB and their results were 0.8341 ± 0.1495 and 0.8615 ± 0.1502, respectively. Conclusion: Specific features of active ligands could be found using the methodology described in this work and with these features, we can sort each database based on corresponding target. AUC shows that the present method is useful for virtual screening in big databases without survey on inactive ligands. © 2017, Isfahan University of Medical Sciences(IUMS). All Rights Reserved.

Item Type: Article
Keywords: Data mining Database management systems Ligands Multiple classification Virtual systems ligand accuracy area under the curve Article Binding-database classification algorithm data base database management system receiver operating characteristic sensitivity and specificity ZINC database
Divisions: School of Advanced Technologies in Medicine > Department of Bioelectrics and Biomedical Engineering
Page Range: pp. 400-405
Journal or Publication Title: Journal of Isfahan Medical School
Journal Index: Scopus
Volume: 35
Number: 426
ISSN: 10277595 (ISSN)
Depositing User: مهندس مهدی شریفی
URI: http://eprints.mui.ac.ir/id/eprint/1934

Actions (login required)

View Item View Item