UPV-EHU ADDI
  • Back
    • English
    • español
    • Basque
  • Login
  • English 
    • English
    • español
    • Basque
  • FAQ
View Item 
  •   ADDI
  • INVESTIGACIÓN
  • Artículos, Comunicaciones, Libros
  • Comunicaciones
  • View Item
  •   ADDI
  • INVESTIGACIÓN
  • Artículos, Comunicaciones, Libros
  • Comunicaciones
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

LSTM based voice conversion for laryngectomees

Thumbnail
View/Open
Texto completo (278.7Kb)
Date
2018-11-23
Author
Serrano García, Luis
Tavarez Arriba, David
Sarasola, Xabier
Raman, Sneha
Saratxaga Couceiro, Ibon ORCID
Navas Cordón, Eva ORCID
Hernáez Rioja, Inmaculada ORCID
Metadata
Show full item record
  Estadisticas en RECOLECTA
(LA Referencia)

IberSPEECH 2018 21-23 November 2018, Barcelona, Spain : 122-126 (2018)
URI
http://hdl.handle.net/10810/32818
Abstract
This paper describes a voice conversion system designed withthe aim of improving the intelligibility and pleasantness of oe-sophageal voices. Two different systems have been built, oneto transform the spectral magnitude and another one for thefundamental frequency, both based on DNNs. Ahocoder hasbeen used to extract the spectral information (mel cepstral co-efficients) and a specific pitch extractor has been developed tocalculate the fundamental frequency of the oesophageal voices.The cepstral coefficients are converted by means of an LSTMnetwork. The conversion of the intonation curve is implementedthrough two different LSTM networks, one dedicated to thevoiced unvoiced detection and another one for the predictionof F0 from the converted cepstral coefficients. The experi-ments described here involve conversion from one oesophagealspeaker to a specific healthy voice. The intelligibility of thesignals has been measured with a Kaldi based ASR system. Apreference test has been implemented to evaluate the subjectivepreference of the obtained converted voices comparing themwith the original oesophageal voice. The results show that spec-tral conversion improves ASR while restoring the intonation ispreferred by human listeners
Collections
  • Comunicaciones
  • OpenAire

DSpace 6.4 software copyright © -2023  DuraSpace
OpenAIRE
EHU Bilbioteka
 

 

Browse

All of ADDICommunities & CollectionsBy Issue DateAuthorsTitlesDepartamentos (cas.)Departamentos (eus.)SubjectsThis CollectionBy Issue DateAuthorsTitlesDepartamentos (cas.)Departamentos (eus.)Subjects

My Account

Login

Statistics

View Usage Statistics

DSpace 6.4 software copyright © -2023  DuraSpace
OpenAIRE
EHU Bilbioteka