Comparison of methods for sequential screening of large compound sets

Paul E. Blower, Kevin P. Cross, Gabriel S. Eichler, Glenn J. Myatt, John N. Weinstein, Chihae Yang

Research output: Contribution to journalArticlepeer-review

16 Scopus citations

Abstract

Sequential screening is an iterative procedure that can greatly increase hit rates over random screening or noniterative procedures. We studied the effects of three factors on enrichment rates: the method used to rank compounds, the molecular descriptor set and the selection of initial training set. The primary factor influencing recovery rates was the method of selecting the initial training set. Rates for recovering active compounds were substantially lower with the diverse training sets than they were with training sets selected by other methods. Because structure-activity information is incrementally enhanced in intermediate training sets, sequential screening provides significant improvement in the average rate of recovery of active compounds when compared with non-iterative selection procedures.

Original languageEnglish (US)
Pages (from-to)115-122
Number of pages8
JournalCombinatorial Chemistry and High Throughput Screening
Volume9
Issue number2
DOIs
StatePublished - Feb 2006
Externally publishedYes

Keywords

  • K-nearest neighbors
  • Nadaraya-Watson kernel
  • Random forest
  • Recursive partitioning
  • Sequential screening
  • Training set selection

ASJC Scopus subject areas

  • Drug Discovery
  • Computer Science Applications
  • Organic Chemistry

Fingerprint

Dive into the research topics of 'Comparison of methods for sequential screening of large compound sets'. Together they form a unique fingerprint.

Cite this