UPV-EHU ADDI
  • Back
    • English
    • español
    • Basque
  • Login
  • English 
    • English
    • español
    • Basque
  • FAQ
View Item 
  •   ADDI
  • INVESTIGACIÓN
  • Artículos, Comunicaciones, Libros
  • Artículos
  • View Item
  •   ADDI
  • INVESTIGACIÓN
  • Artículos, Comunicaciones, Libros
  • Artículos
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

A uniform phase representation for the harmonic model in speech synthesis applications

Thumbnail
View/Open
s13636-014-0038-1-1.pdf (2.815Mb)
Date
2014-10-16
Author
Degottex, Gilles
Erro Eslava, Daniel
Metadata
Show full item record
Journal on Audio, Speech and Music Processing 2014 : (2014) // Article ID 38
URI
http://hdl.handle.net/10810/15924
Abstract
Feature-based vocoders, e.g., STRAIGHT, offer a way to manipulate the perceived characteristics of the speech signal in speech transformation and synthesis. For the harmonic model, which provide excellent perceived quality, features for the amplitude parameters already exist (e.g., Line Spectral Frequencies (LSF), Mel-Frequency Cepstral Coefficients (MFCC)). However, because of the wrapping of the phase parameters, phase features are more difficult to design. To randomize the phase of the harmonic model during synthesis, a voicing feature is commonly used, which distinguishes voiced and unvoiced segments. However, voice production allows smooth transitions between voiced/unvoiced states which makes voicing segmentation sometimes tricky to estimate. In this article, two-phase features are suggested to represent the phase of the harmonic model in a uniform way, without voicing decision. The synthesis quality of the resulting vocoder has been evaluated, using subjective listening tests, in the context of resynthesis, pitch scaling, and Hidden Markov Model (HMM)-based synthesis. The experiments show that the suggested signal model is comparable to STRAIGHT or even better in some scenarios. They also reveal some limitations of the harmonic framework itself in the case of high fundamental frequencies.
Collections
  • Artículos

DSpace software copyright © 2002-2016  DuraSpace
OpenAIRE
EHU Bilbioteka
 

 

Browse

All of ADDICommunities & CollectionsBy Issue DateAuthorsTitlesDepartamentos (cas.)Departamentos (eus.)SubjectsThis CollectionBy Issue DateAuthorsTitlesDepartamentos (cas.)Departamentos (eus.)Subjects

My Account

Login

Statistics

View Usage Statistics

DSpace software copyright © 2002-2016  DuraSpace
OpenAIRE
EHU Bilbioteka