TY - GEN
T1 - Consensus scoring criteria in structure-based virtual screening
AU - Yang, Jinn-Moon
AU - Frank Hsu, D.
PY - 2005
Y1 - 2005
N2 - In this paper, we demonstrated that combining multiple scoring functions improves enrichment of true positives only if (a) each of the individual scoring functions has relatively high performance, and (b) the individual scoring functions are distinctive. The major weakness, the inability to consistently identify true positives (leads), of virtual screening is likely due to the imprecise scoring algorithms. It has been demonstrated that consensus scoring improves enrichment of true positives, but they are yet to provide theoretical analysis that gives insight into real features of combinations and data fusion for virtual screening. This work thus establishes a potential theoretical basis for the probable success of data fusion approaches to improve yields in silico screening experiments. We provide initial validation of this theoretical approach using data from five scoring systems with two evolutionary docking algorithms on four targets, thymidine kinase, human dihydrofolate reductase, and estrogen receptors of antagonists and agonists. Results of the experiment show a fairly significant improvement (vs. single algorithms) in several measures of scoring quality, specifically "goodness-of-hit" scores, false positive rate, and "enrichment".
AB - In this paper, we demonstrated that combining multiple scoring functions improves enrichment of true positives only if (a) each of the individual scoring functions has relatively high performance, and (b) the individual scoring functions are distinctive. The major weakness, the inability to consistently identify true positives (leads), of virtual screening is likely due to the imprecise scoring algorithms. It has been demonstrated that consensus scoring improves enrichment of true positives, but they are yet to provide theoretical analysis that gives insight into real features of combinations and data fusion for virtual screening. This work thus establishes a potential theoretical basis for the probable success of data fusion approaches to improve yields in silico screening experiments. We provide initial validation of this theoretical approach using data from five scoring systems with two evolutionary docking algorithms on four targets, thymidine kinase, human dihydrofolate reductase, and estrogen receptors of antagonists and agonists. Results of the experiment show a fairly significant improvement (vs. single algorithms) in several measures of scoring quality, specifically "goodness-of-hit" scores, false positive rate, and "enrichment".
UR - http://www.scopus.com/inward/record.url?scp=33751162600&partnerID=8YFLogxK
U2 - 10.1109/EITC.2005.1544376
DO - 10.1109/EITC.2005.1544376
M3 - Conference contribution
AN - SCOPUS:33751162600
SN - 0780393295
SN - 9780780393295
T3 - Emerging Information Technology Conference 2005
SP - 165
EP - 167
BT - Emerging Information Technology Conference 2005
T2 - Emerging Information Technology Conference 2005
Y2 - 15 August 2005 through 16 August 2005
ER -