Numéro |
Med Sci (Paris)
Volume 34, October 2018
Cancer biomarkers
|
|
---|---|---|
Page(s) | 52 - 58 | |
DOI | https://doi.org/10.1051/medsci/201834f110 | |
Publié en ligne | 7 novembre 2018 |
Computational QSAR model combined molecular descriptors and fingerprints to predict HDAC1 inhibitors
Division of Orthopaedic Surgery, Huashan Hospital, Fudan University, Shanghai, China
* Corresponding author: Yibing Wei doctorwei196@126.com
The dynamic balance between acetylation and deacetylation of histones plays a crucial role in the epigenetic regulation of gene expression. It is equilibrated by two families of enzymes: histone acetyltransferases and histone deacetylases (HDACs). HDACs repress transcription by regulating the conformation of the higher-order chromatin structure. HDAC inhibitors have recently become a class of chemical agents for potential treatment of the abnormal chromatin remodeling process involved in certain cancers. In this study, we constructed a large dataset to predict the activity value of HDAC1 inhibitors. Each compound was represented with seven fingerprints, and computational models were subsequently developed to predict HDAC1 inhibitors via five machine learning methods. These methods include naïve Bayes, κ-nearest neighbor, C4.5 decision tree, random forest, and support vector machine (SVM) algorithms. The best predicting model was CDK fingerprint with SVM, which exhibited an accuracy of 0.89. This model also performed best in five-fold cross-validation. Some representative substructure alerts responsible for HDAC1 inhibitors were identified by using MoSS in KNIME, which could facilitate the identification of HDAC1 inhibitors.
Key words: QSAR / fingerprints / HDAC1 inhibitors
© 2018 médecine/sciences – Inserm
Les statistiques affichées correspondent au cumul d'une part des vues des résumés de l'article et d'autre part des vues et téléchargements de l'article plein-texte (PDF, Full-HTML, ePub... selon les formats disponibles) sur la platefome Vision4Press.
Les statistiques sont disponibles avec un délai de 48 à 96 heures et sont mises à jour quotidiennement en semaine.
Le chargement des statistiques peut être long.