UPV-EHU ADDI
  • Back
    • English
    • español
    • Basque
  • Login
  • English 
    • English
    • español
    • Basque
  • FAQ
View Item 
  •   ADDI
  • INVESTIGACIÓN
  • Documentos de Trabajo e Informes Técnicos
  • Informes técnicos y Documentos de trabajo
  • View Item
  •   ADDI
  • INVESTIGACIÓN
  • Documentos de Trabajo e Informes Técnicos
  • Informes técnicos y Documentos de trabajo
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

A sensitivity study of bias and variance of k-fold cross-validation in prediction error estimation

Thumbnail
View/Open
tr09-00-1.pdf (1.891Mb)
Date
2009
Author
Rodríguez Fernández, Juan Diego
Pérez Martínez, Aritz
Lozano Alonso, José Antonio
Metadata
Show full item record
  Estadisticas en RECOLECTA
(LA Referencia)

URI
http://hdl.handle.net/10810/4628
Abstract
In the machine learning field the performance of a classifier is usually measured in terms of prediction error. In most real-world problems, the error cannot be exactly calculated and it must be estimated. Therefore, it’s important to choose an appropriate estimator of the error. This paper analyzes the statistical properties (bias and variance) of the k-fold cross-validation classification error estimator (k-cv). Our main contribution is a novel theoretical decomposition of the variance of the k-cv considering its sources of variance: sensitivity to changes in the training set and sensitivity to changes in the folds. The paper also compares the bias and variance of the estimator for different values of k. The empirical study has been performed in artificial domains because they allow the exact computation of the implied quantities and we can specify rigorously the conditions of experimentation. The empirical study has been performed for two different classifiers (naïve Bayes and nearest neighbor), different number of folds (2, 5, 10, n) and sample sizes, and training sets coming from assorted probability distributions.
Collections
  • Informes técnicos y Documentos de trabajo

DSpace 6.4 software copyright © -2023  DuraSpace
OpenAIRE
EHU Bilbioteka
 

 

Browse

All of ADDICommunities & CollectionsBy Issue DateAuthorsTitlesDepartamentos (cas.)Departamentos (eus.)SubjectsThis CollectionBy Issue DateAuthorsTitlesDepartamentos (cas.)Departamentos (eus.)Subjects

My Account

Login

Statistics

View Usage Statistics

DSpace 6.4 software copyright © -2023  DuraSpace
OpenAIRE
EHU Bilbioteka