Show simple item record

dc.contributor.authorSerrano García, Luis
dc.contributor.authorTavarez Arriba, David
dc.contributor.authorSarasola, Xabier
dc.contributor.authorRaman, Sneha
dc.contributor.authorSaratxaga Couceiro, Ibon ORCID
dc.contributor.authorNavas Cordón, Eva ORCID
dc.contributor.authorHernáez Rioja, Inmaculada ORCID
dc.date.accessioned2019-05-15T15:33:18Z
dc.date.available2019-05-15T15:33:18Z
dc.date.issued2018-11-23
dc.identifier.citationIberSPEECH 2018 21-23 November 2018, Barcelona, Spain : 122-126 (2018)es_ES
dc.identifier.urihttp://hdl.handle.net/10810/32818
dc.description.abstractThis paper describes a voice conversion system designed withthe aim of improving the intelligibility and pleasantness of oe-sophageal voices. Two different systems have been built, oneto transform the spectral magnitude and another one for thefundamental frequency, both based on DNNs. Ahocoder hasbeen used to extract the spectral information (mel cepstral co-efficients) and a specific pitch extractor has been developed tocalculate the fundamental frequency of the oesophageal voices.The cepstral coefficients are converted by means of an LSTMnetwork. The conversion of the intonation curve is implementedthrough two different LSTM networks, one dedicated to thevoiced unvoiced detection and another one for the predictionof F0 from the converted cepstral coefficients. The experi-ments described here involve conversion from one oesophagealspeaker to a specific healthy voice. The intelligibility of thesignals has been measured with a Kaldi based ASR system. Apreference test has been implemented to evaluate the subjectivepreference of the obtained converted voices comparing themwith the original oesophageal voice. The results show that spec-tral conversion improves ASR while restoring the intonation ispreferred by human listenerses_ES
dc.description.sponsorshipThis work has been partially funded by the Spanish Ministryof Economy and Competitiveness with FEDER support (RE-STORE project, TEC2015-67163-C2-1-R), the Basque Govern-ment (BerbaOla project, KK-2018/00014) and from the Euro-pean Unions H2020 research and innovation programme un-der the Marie Curie European Training Network ENRICH(675324).es_ES
dc.language.isoenges_ES
dc.publisherInternational Speech Communication Associationes_ES
dc.relationinfo:eu-repo/grantAgreement/EC/H2020/675324es_ES
dc.relationinfo:eu-repo/grantAgreement/MINECO/TEC2015-67163-C2-1-Res_ES
dc.rightsinfo:eu-repo/semantics/openAccesses_ES
dc.subjectvoice conversiones_ES
dc.subjectspeech and voice disorderses_ES
dc.subjectalaryngeal voiceses_ES
dc.subjectspeech intelligibilityes_ES
dc.titleLSTM based voice conversion for laryngectomeeses_ES
dc.typeinfo:eu-repo/semantics/conferenceObjectes_ES
dc.rights.holder(c) 2018 ISCAes_ES
dc.relation.publisherversionhttps://www.isca-speech.org/archive/IberSPEECH_2018/abstracts/IberS18_O3-4_Serrano.htmles_ES
dc.identifier.doi10.21437/IberSPEECH.2018-26
dc.contributor.funderEuropean Commission
dc.departamentoesIngeniería de comunicacioneses_ES
dc.departamentoeuKomunikazioen ingeniaritzaes_ES


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record