dc.contributor.author | Montenegro Portillo, César | |
dc.contributor.author | Santana Hermida, Roberto  | |
dc.contributor.author | Lozano Alonso, José Antonio | |
dc.date.accessioned | 2021-05-18T07:48:28Z | |
dc.date.available | 2021-05-18T07:48:28Z | |
dc.date.issued | 2021-04 | |
dc.identifier.citation | Engineering Applications of Artificial Intelligence 100 : (2021) // Article ID 104189 | es_ES |
dc.identifier.issn | 0952-1976 | |
dc.identifier.issn | 1873-6769 | |
dc.identifier.uri | http://hdl.handle.net/10810/51456 | |
dc.description.abstract | An End-Of-Turn Detection Module (EOTD-M) is an essential component of automatic Spoken Dialogue Systems. The capability of correctly detecting whether a user?s utterance has ended or not improves the accuracy in interpreting the meaning of the message and decreases the latency in the answer. Usually, in dialogue systems, an EOTD-M is coupled with an Automatic Speech Recognition Module (ASR-M) to transmit complete utterances to the Natural Language Understanding unit. Mistakes in the ASR-M transcription can have a strong effect on the performance of the EOTD-M. The actual extent of this effect depends on the particular combination of ASR M transcription errors and the sentence featurization techniques implemented as part of the EOTD-M. In this paper we investigate this important relationship for an EOTD-M based on semantic information and particular characteristics of the speakers (speech profiles). We introduce an Automatic Speech Recognition Simulator (ASR-SIM) that models different types of semantic mistakes in the ASR-M transcription as well as different speech profiles. We use the simulator to evaluate the sensitivity to ASR-M mistakes of a Long Short-Term Memory network classifier trained in EOTD with different featurization techniques. Our experiments reveal the different ways in which the performance of the model is influenced by the ASR-M errors. We corroborate that not only is the ASR-SIM useful to estimate the performance of an EOTD-M in customized noisy scenarios, but it can also be used to generate training datasets with the expected error rates of real working conditions, which leads to better performance. | es_ES |
dc.description.sponsorship | The research presented in this paper has been conducted as part of the project EMPATHIC that has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 769872.
Jose A. Lozano is partially supported by the Basque Government through the BERC 2018-2021 program, IT1244-19 and grant "Artificial Intelligence in BCAM number EXP. 2019/00432'' and by the Spanish Ministry of Science, Innovation and Universities: BCAM Severo Ochoa accreditation SEV-2017-0718, TIN2016-78365-R and PID2019-104966GB-I00. And R. Santana acknowledge support by the Spanish Ministry of Science, Innovation and Universities (Project TIN201678365-R and PID2019-104966GB-I00), and the Basque Government (IT1244-19 and ELKARTEK Programs) | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | Elsevier | es_ES |
dc.relation | info:eu-repo/grantAgreement/EC/H2020/769872 | es_ES |
dc.relation | info:eu-repo/grantAgreement/MICIU/SEV-2017-0718 | es_ES |
dc.relation | info:eu-repo/grantAgreement/MICINN/TIN2016-78365-R | es_ES |
dc.relation | info:eu-repo/grantAgreement/MICINN/PID2019-104966GB-I00 | es_ES |
dc.rights | info:eu-repo/semantics/openAccess | es_ES |
dc.rights.uri | http://creativecommons.org/licenses/by/3.0/es/ | * |
dc.subject | spoken dialogue systems | es_ES |
dc.subject | automatic speech recognition | es_ES |
dc.subject | end of turn detection | es_ES |
dc.subject | natural language processing | es_ES |
dc.subject | neural networks | es_ES |
dc.title | Analysis of the Sensitivity of the End-Of-Turn Detection Task to Errors Generated by the Automatic Speech Recognition Process | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dc.rights.holder | This is an open access article distributed under the terms of the Creative Commons CC-BY license | es_ES |
dc.rights.holder | Atribución 3.0 España | * |
dc.relation.publisherversion | https://www-sciencedirect-com.ehu.idm.oclc.org/science/article/pii/S0952197621000361?via%3Dihub#! | es_ES |
dc.identifier.doi | 10.1016/j.engappai.2021.104189 | |
dc.contributor.funder | European Commission | |
dc.departamentoes | Ciencia de la computación e inteligencia artificial | es_ES |
dc.departamentoeu | Konputazio zientziak eta adimen artifiziala | es_ES |