dc.contributor.author | Garciarena Hualde, Unai  | |
dc.contributor.author | Mendiburu Alberro, Alexander | |
dc.contributor.author | Santana Hermida, Roberto  | |
dc.date.accessioned | 2025-01-20T15:41:20Z | |
dc.date.available | 2025-01-20T15:41:20Z | |
dc.date.issued | 2018-10-04 | |
dc.identifier.citation | 2018 IEEE Congress on Evolutionary Computation (CEC) (pp. 1-8). IEEE. | es_ES |
dc.identifier.isbn | 978-1-5090-6017-7 | |
dc.identifier.uri | http://hdl.handle.net/10810/71607 | |
dc.description.abstract | Strategies to automatize the selection of Machine Learning algorithms and their parameters have gained popularity in recent years, to the point of coining the term Automated Machine Learning. The most general version of this problem is pipeline optimization, which seeks an optimal combination of preprocessors and classifiers, along with their respective parameters. In this paper we address the pipeline generation problem from a broader perspective, that of problem complexity understanding as a previous step before proposing a solution, a comprehension we consider critical. The main contribution of this work is the analysis of the characteristics of the fitness landscape. Furthermore, a recently introduced tool for pipeline generation is used to investigate how an automatic method behaves in the previously studied landscape. Results show the high complexity of the pipeline optimization problem, as it can contain several disperse optima, and suffers from a severe lack of generality. Results also suggest that, depending on the dimensions of the search, the model quality target, and the data being modeled, basic search methods can produce results that match the user's expectations. | es_ES |
dc.description.sponsorship | This work has received support from IT-609-13 (Basque Government) and TIN2016-78365-R (Spanish Ministry of Economy, Industry and Competitiveness) programs http://www.mineco.gob.es/portal/site/mineco. Unai Garciarena holds a predoctoral grant (ref. PIF16/238) from the University of the Basque Country. | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | IEEE | es_ES |
dc.rights | info:eu-repo/semantics/openAccess | es_ES |
dc.subject | genetic programming | es_ES |
dc.subject | supervised classification | es_ES |
dc.subject | automated machine learning | es_ES |
dc.title | Analysis of the Complexity of the Automatic Pipeline Generation Problem | es_ES |
dc.type | info:eu-repo/semantics/conferenceObject | es_ES |
dc.rights.holder | © 2018 IEEE | es_ES |
dc.relation.publisherversion | https://doi.org/10.1109/CEC.2018.8477662 | es_ES |
dc.identifier.doi | https://doi.org/10.1109/CEC.2018.8477662 | |
dc.identifier.doi | 10.1109/CEC.2018.8477662 | |
dc.departamentoes | Ciencia de la computación e inteligencia artificial | es_ES |
dc.departamentoeu | Konputazio zientziak eta adimen artifiziala | es_ES |