Show simple item record

dc.contributor.advisorErcoreca González, Aitor
dc.contributor.advisorFeldheim, Véronique
dc.contributor.advisorGaray, Roberto
dc.contributor.authorIsmagulov, Bekbol
dc.date.accessioned2021-07-15T10:43:09Z
dc.date.available2021-07-15T10:43:09Z
dc.date.issued2021-06
dc.identifier.urihttp://hdl.handle.net/10810/52468
dc.descriptionxiii, 67 p.es_ES
dc.description.abstractMissing data is one of the most common issues of the raw data in data analysis. Missing-ness could be ignored if it is considered not to have a significant impact on the analysis. In other cases, imputation methods are applied to handle them as machine learning models performed on the data with missing values may have a drastic decrease of the quality with the existence of the missing points. This thesis aims to determine the accuracy of the predictions of single and multiple imputation methods on the energy data as well as con-sidering the impact the weather variables have on them. To test the methods, the case study was conducted on four separate smart energy meter data from residential buildings located in Tartu, Estonia and each data set also comprised weather variables collected independently by the University of Tartu. The artificial miss-ing values were entered in the clean data to examine the imputation techniques which allowed to compare the outcome with the original complete data set. The results demon-strated the higher accuracy for multiple imputation methods as opposed to the univariate analysis and the importance of highly correlated variables for the prediction of missing points. We conclude that the increase of the variables included for the prediction of the analysis of the missing values is likely to increase the accuracy of the method as well. Despite multiple imputations appear to have the best accuracy, the challenges related to the con-current missing values for all variables coming from the same sensor should be considered.es_ES
dc.language.isoenges_ES
dc.rightsinfo:eu-repo/semantics/openAccesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/*
dc.subjectbig dataes_ES
dc.subjectdata analysises_ES
dc.subjecttreatment of missing dataes_ES
dc.subjectenergy meterses_ES
dc.subjectunivariate imputation methodses_ES
dc.subjectmultivariate imputation methodses_ES
dc.titleAnalysis of imputation methods for data gaps in high resolution smart meters in buildingses_ES
dc.typeinfo:eu-repo/semantics/masterThesises_ES
dc.rights.holder(cc) 2021 Bekbol Ismagulov (cc by-nc-nd 4.0)es_ES


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

(cc) 2021 Bekbol Ismagulov (cc by-nc-nd 4.0)
Except where otherwise noted, this item's license is described as (cc) 2021 Bekbol Ismagulov (cc by-nc-nd 4.0)