Vocal-tract modeling: Fractional elongation of segment lengths in a waveguide model with half-sample delays

Research output: Contribution to journalArticle

16 Scopus citations

Abstract

Digital waveguide models are commonly used for simulating vocal-tract acoustics based on physiological data. In particular, waveguide models with half-sample delays are known to be well suited for speech production research. This paper presents enhancements to such a model, aimed at improved accuracy in mapping physiological vocal-tract data (shape and length of the airway) to waveguide parameters. The enhancements allow the length of the vocal tract to be continuously varied, thus enabling more realistic synthesis. This is achieved by smoothly varying the individual segment lengths of a piecewise-cylindrical representation of ;the airway, without altering the system sampling frequency. Fractional-delay filters are used for spatial interpolation of the digital waveguide model. The algorithms are validated by modeling the protrusion of lips, lowering of larynx and lengthening of intermediate segments for a static vowel shape.

Original languageEnglish (US)
Pages (from-to)1754-1762
Number of pages9
JournalIEEE Transactions on Audio, Speech and Language Processing
Volume14
Issue number5
DOIs
StatePublished - Sep 1 2006

Keywords

  • Articulatory speech synthesis
  • Digital waveguides
  • Vocal tract modeling

ASJC Scopus subject areas

  • Acoustics and Ultrasonics
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'Vocal-tract modeling: Fractional elongation of segment lengths in a waveguide model with half-sample delays'. Together they form a unique fingerprint.

  • Cite this