An acoustically-driven vocal tract model for stop consonant production

Research output: Contribution to journalArticle

  • 1 Citations

Abstract

The purpose of this study was to further develop a multi-tier model of the vocal tract area function in which the modulations of shape to produce speech are generated by the product of a vowel substrate and a consonant superposition function. The new approach consists of specifying input parameters for a target consonant as a set of directional changes in the resonance frequencies of the vowel substrate. Using calculations of acoustic sensitivity functions, these “resonance deflection patterns” are transformed into time-varying deformations of the vocal tract shape without any direct specification of location or extent of the consonant constriction along the vocal tract. The configuration of the constrictions and expansions that are generated by this process were shown to be physiologically-realistic and produce speech sounds that are easily identifiable as the target consonants. This model is a useful enhancement for area function-based synthesis and can serve as a tool for understanding how the vocal tract is shaped by a talker during speech production.

LanguageEnglish (US)
Pages1-17
Number of pages17
JournalSpeech Communication
Volume87
DOIs
StatePublished - Mar 1 2017

Fingerprint

Substrate
Speech Production
Target
Multi-model
Resonance Frequency
Substrates
Model
Deflection
acoustics
Superposition
Time-varying
Acoustics
Modulation
Enhancement
Acoustic waves
Synthesis
Specification
Specifications
Configuration
Vocal Tract

Keywords

  • Area function
  • Formant
  • Resonance
  • Speech modeling
  • Speech synthesis
  • Vocal tract

ASJC Scopus subject areas

  • Software
  • Language and Linguistics
  • Modeling and Simulation
  • Communication
  • Linguistics and Language
  • Computer Vision and Pattern Recognition
  • Computer Science Applications

Cite this

An acoustically-driven vocal tract model for stop consonant production. / Story, Brad H.; Bunton, Kate.

In: Speech Communication, Vol. 87, 01.03.2017, p. 1-17.

Research output: Contribution to journalArticle

@article{f8aa256dac924fa7a9677664434061af,
title = "An acoustically-driven vocal tract model for stop consonant production",
abstract = "The purpose of this study was to further develop a multi-tier model of the vocal tract area function in which the modulations of shape to produce speech are generated by the product of a vowel substrate and a consonant superposition function. The new approach consists of specifying input parameters for a target consonant as a set of directional changes in the resonance frequencies of the vowel substrate. Using calculations of acoustic sensitivity functions, these “resonance deflection patterns” are transformed into time-varying deformations of the vocal tract shape without any direct specification of location or extent of the consonant constriction along the vocal tract. The configuration of the constrictions and expansions that are generated by this process were shown to be physiologically-realistic and produce speech sounds that are easily identifiable as the target consonants. This model is a useful enhancement for area function-based synthesis and can serve as a tool for understanding how the vocal tract is shaped by a talker during speech production.",
keywords = "Area function, Formant, Resonance, Speech modeling, Speech synthesis, Vocal tract",
author = "Story, {Brad H.} and Kate Bunton",
year = "2017",
month = "3",
day = "1",
doi = "10.1016/j.specom.2016.12.001",
language = "English (US)",
volume = "87",
pages = "1--17",
journal = "Speech Communication",
issn = "0167-6393",
publisher = "Elsevier",

}

TY - JOUR

T1 - An acoustically-driven vocal tract model for stop consonant production

AU - Story,Brad H.

AU - Bunton,Kate

PY - 2017/3/1

Y1 - 2017/3/1

N2 - The purpose of this study was to further develop a multi-tier model of the vocal tract area function in which the modulations of shape to produce speech are generated by the product of a vowel substrate and a consonant superposition function. The new approach consists of specifying input parameters for a target consonant as a set of directional changes in the resonance frequencies of the vowel substrate. Using calculations of acoustic sensitivity functions, these “resonance deflection patterns” are transformed into time-varying deformations of the vocal tract shape without any direct specification of location or extent of the consonant constriction along the vocal tract. The configuration of the constrictions and expansions that are generated by this process were shown to be physiologically-realistic and produce speech sounds that are easily identifiable as the target consonants. This model is a useful enhancement for area function-based synthesis and can serve as a tool for understanding how the vocal tract is shaped by a talker during speech production.

AB - The purpose of this study was to further develop a multi-tier model of the vocal tract area function in which the modulations of shape to produce speech are generated by the product of a vowel substrate and a consonant superposition function. The new approach consists of specifying input parameters for a target consonant as a set of directional changes in the resonance frequencies of the vowel substrate. Using calculations of acoustic sensitivity functions, these “resonance deflection patterns” are transformed into time-varying deformations of the vocal tract shape without any direct specification of location or extent of the consonant constriction along the vocal tract. The configuration of the constrictions and expansions that are generated by this process were shown to be physiologically-realistic and produce speech sounds that are easily identifiable as the target consonants. This model is a useful enhancement for area function-based synthesis and can serve as a tool for understanding how the vocal tract is shaped by a talker during speech production.

KW - Area function

KW - Formant

KW - Resonance

KW - Speech modeling

KW - Speech synthesis

KW - Vocal tract

UR - http://www.scopus.com/inward/record.url?scp=85006377675&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85006377675&partnerID=8YFLogxK

U2 - 10.1016/j.specom.2016.12.001

DO - 10.1016/j.specom.2016.12.001

M3 - Article

VL - 87

SP - 1

EP - 17

JO - Speech Communication

T2 - Speech Communication

JF - Speech Communication

SN - 0167-6393

ER -