Review article

Assessing the accuracy of prediction algorithms for classification: an overview

Pierre BaldiDepartment of Information and Computer Science, University of California, Irvine, CA 92697, USA. [email protected]Søren BrunakYves ChauvinClaus A. AndersenHenrik Nielsen

2000en

ABI

Abstract

Abstract 4 Also at the Department of Biological Sciences, University of California, Irvine, USA, to whom all correspondence should be addressed. We provide a unified overview of methods that currently are widely used to assess the accuracy of prediction algorithms, from raw percentages, quadratic error measures and other distances, and correlation coefficients, and to information theoretic measures such as relative entropy and mutual information. We briefly discuss the advantages and disadvantages of each approach. For classification tasks, we derive new learning algorithms for the design of prediction systems by directly optimising the correlation coefficient. We observe and prove several results relating sensitivity and specificity of optimal systems. While the principles are general, we illustrate the applicability on specific problems such as protein secondary structure and signal peptide prediction. Contact: [email protected]

Identifiers

DOI: 10.1093/bioinformatics/16.5.412

Citations and references

Cited by 30 references

Metrics — AkademScholar