North Carolina Central University


Antitumor agents 252. Application of validated QSAR models to database mining: discovery of novel tylophorine derivatives as potential anticancer agents.

Publication Type  Journal Article
Author  Zhang S, Wei L, Bastow K, Zheng W, Brossi A, Lee K, Tropsha A
Year of Publication  2007
Secondary Title  J Comput Aided Mol Des
Volume  21
Pagination  97--112
Date Published  Jan/Mar
Type of Work  article
Publication Language  eng
Key Words  Alkaloids
Abstract  

A combined approach of validated QSAR modeling and virtual screening was successfully applied to the discovery of novel tylophrine derivatives as anticancer agents. QSAR models have been initially developed for 52 chemically diverse phenanthrine-based tylophrine derivatives (PBTs) with known experimental EC(50) using chemical topological descriptors (calculated with the MolConnZ program) and variable selection k nearest neighbor (kNN) method. Several validation protocols have been applied to achieve robust QSAR models. The original dataset was divided into multiple training and test sets, and the models were considered acceptable only if the leave-one-out cross-validated R (2) (q (2)) values were greater than 0.5 for the training sets and the correlation coefficient R (2) values were greater than 0.6 for the test sets. Furthermore, the q (2) values for the actual dataset were shown to be significantly higher than those obtained for the same dataset with randomized target properties (Y-randomization test), indicating that models were statistically significant. Ten best models were then employed to mine a commercially available ChemDiv Database (ca. 500 K compounds) resulting in 34 consensus hits with moderate to high predicted activities. Ten structurally diverse hits were experimentally tested and eight were confirmed active with the highest experimental EC(50) of 1.8 microM implying an exceptionally high hit rate (80\%). The same ten models were further applied to predict EC50 for four new PBTs, and the correlation coefficient (R (2)) between the experimental and predicted EC(50) for these compounds plus eight active consensus hits was shown to be as high as 0.57. Our studies suggest that the approach combining validated QSAR modeling and virtual screening could be successfully used as a general tool for the discovery of novel biologically active compounds.

Citation Key  171