Show simple item record

dc.contributor.authorBen Letaifa Zouari, Leila
dc.contributor.authorTorres Barañano, María Inés ORCID
dc.date.accessioned2021-04-13T18:04:04Z
dc.date.available2021-04-13T18:04:04Z
dc.date.issued2021-04-07
dc.identifier.citationIEEE Access 9 : 55939-55954 (2021)es_ES
dc.identifier.urihttp://hdl.handle.net/10810/50913
dc.description.abstractSpeech is a behavioural biometric signal that can provide important information to understand the human intends as well as their emotional status. The paper is centered on the speech-based identification of the seniors’s emotional status during their interaction with a virtual agent playing the role of a health professional coach. Under real conditions, we can just identify a small set of task-dependent spontaneous emotions. The number of identified samples is largely different for each emotion, which results in an imbalanced dataset problem. This research proposes the dimensional model of emotions as a perceptual representation space alternative to the generally used acoustic one. The main contribution of the paper is the definition of a perceptual borderline for the oversampling of minority emotion classes in this space. This limit, based on arousal and valence criteria, leads to two methods of balancing the data: the Perceptual Borderline oversampling and the Perceptual Borderline SMOTE (Synthetic Minority Oversampling Technique). Both methods are implemented and compared to state-of-the-art approaches of Random oversampling and SMOTE. The experimental evaluation was carried out on three imbalanced datasets of spontaneous emotions acquired in human-machine scenarios in three different cultures: Spain, France and Norway. The emotion recognition results obtained by neural networks classifiers show that the proposed perceptual oversampling methods led to significant improvements when compared with the state-of-the art, for all scenarios and languages.es_ES
dc.description.sponsorshipThe research presented in this paper is conducted as partof the project EMPATHIC and of the MENHIR MSCAaction that have received funding from the European Union’s Horizon 2020 research and innovation program under grant agreements No 769872 an No 823907 respectiveles_ES
dc.language.isoenges_ES
dc.publisherIEEEes_ES
dc.relationinfo:eu-repo/grantAgreement/EC/H2020/769872es_ES
dc.relationinfo:eu-repo/grantAgreement/EC/H2020/823907es_ES
dc.rightsinfo:eu-repo/semantics/openAccesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectdimensional model of emotionses_ES
dc.subjectemotion recognitiones_ES
dc.subjectmulti-class classificationes_ES
dc.subjectperceptual borderlinees_ES
dc.subjectspeech analysises_ES
dc.subjectspeech processinges_ES
dc.titlePerceptual borderline for balancing multi-class spontaneous emotional dataes_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.rights.holder(cc) 2021 This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/es_ES
dc.relation.publisherversionhttps://ieeexplore.ieee.org/document/9398699es_ES
dc.identifier.doi10.1109/ACCESS.2021.3071485
dc.contributor.funderEuropean Commission
dc.departamentoesElectricidad y electrónicaes_ES
dc.departamentoeuElektrizitatea eta elektronikaes_ES


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

(cc) 2021 This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
Except where otherwise noted, this item's license is described as (cc) 2021 This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/