Show simple item record

dc.contributor.authorIparragirre Letamendia, Amaia ORCID
dc.contributor.authorLumley, Thomas
dc.contributor.authorBarrio Beraza, Irantzu
dc.contributor.authorArostegui Madariaga, Inmaculada ORCID
dc.date.accessioned2023-06-19T17:35:56Z
dc.date.available2023-06-19T17:35:56Z
dc.date.issued2023-12
dc.identifier.citationStat 12(1) : (2023) // Article ID e578es_ES
dc.identifier.issn2049-1573
dc.identifier.urihttp://hdl.handle.net/10810/61467
dc.description.abstractVariable selection is an important step to end up with good prediction models. LASSO regression models are one of the most commonly used methods for this purpose, for which cross-validation is the most widely applied validation technique to choose the tuning parameter . Validation techniques in a complex survey framework are closely related to “replicate weights”. However, to our knowledge, they have never been used in a LASSO regression context. Applying LASSO regression models to complex survey data could be challenging. The goal of this paper is twofold. On the one hand, we analyze the performance of replicate weights methods to select the tuning parameter for fitting LASSO regression models to complex survey data. On the other hand, we propose new replicate weights methods for the same purpose. In particular, we propose a new design-based cross-validation method as a combination of the traditional cross-validation and replicate weights. The performance of all these methods has been analyzed and compared by means of an extensive simulation study to the traditional cross-validation technique to select the tuning parameter for LASSO regression models. The results suggest a considerable improvement when the new proposal design-based cross-validation is used instead of the traditional cross-validation.es_ES
dc.description.sponsorshipThis work was financially supported in part by grants from the Departamento de Educación, Política Lingüística y Cultura del Gobierno Vasco IT1456-22 and by the Ministry of Science and Innovation through BCAM Severo Ochoa accreditation CEX2021-001142-S/MICIN/AEI/10.13039/501100011033 and through project PID2020-115882RB-I00/AEI/10.13039/501100011033 funded by Agencia Estatal de Investigación and acronym “S3M1P4R” and also by the Basque Government through the BERC 2022-2025 program. The work of AI was supported by grant PIF18/213. Open Access funding is provided by the University of the Basque Country.es_ES
dc.language.isoenges_ES
dc.publisherWileyes_ES
dc.relationinfo:eu-repo/grantAgreement/MICINN/CEX2021-001142-Ses_ES
dc.relationinfo:eu-repo/grantAgreement/MICINN/PID2020-115882RB-I00es_ES
dc.rightsinfo:eu-repo/semantics/openAccesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/es/*
dc.titleVariable selection with LASSO regression for complex survey dataes_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.rights.holder© 2023 The Authors. Stat published by John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.es_ES
dc.rights.holderAtribución 3.0 España*
dc.relation.publisherversionhttps://onlinelibrary.wiley.com/doi/full/10.1002/sta4.578es_ES
dc.identifier.doi10.1002/sta4.578
dc.departamentoesMatemáticases_ES
dc.departamentoeuMatematikaes_ES


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

© 2023 The Authors. Stat published by John Wiley & Sons Ltd.
This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
Except where otherwise noted, this item's license is described as © 2023 The Authors. Stat published by John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.