An acoustically-driven vocal tract model for stop consonant production

Research output: Research - peer-reviewArticle

Abstract

The purpose of this study was to further develop a multi-tier model of the vocal tract area function in which the modulations of shape to produce speech are generated by the product of a vowel substrate and a consonant superposition function. The new approach consists of specifying input parameters for a target consonant as a set of directional changes in the resonance frequencies of the vowel substrate. Using calculations of acoustic sensitivity functions, these “resonance deflection patterns” are transformed into time-varying deformations of the vocal tract shape without any direct specification of location or extent of the consonant constriction along the vocal tract. The configuration of the constrictions and expansions that are generated by this process were shown to be physiologically-realistic and produce speech sounds that are easily identifiable as the target consonants. This model is a useful enhancement for area function-based synthesis and can serve as a tool for understanding how the vocal tract is shaped by a talker during speech production.

LanguageEnglish (US)
Pages1-17
Number of pages17
JournalSpeech Communication
Volume87
DOIs
StatePublished - Mar 1 2017

Fingerprint

Model
Vocal Tract
Consonant
Stop Consonants
Substrate
Target
Speech
Substrates
Speech Production
Multi-model
Resonance Frequency
Deflection
Superposition
Time-varying
Acoustics
Modulation
Enhancement
Synthesis
Specification
Configuration

Keywords

  • Area function
  • Formant
  • Resonance
  • Speech modeling
  • Speech synthesis
  • Vocal tract

ASJC Scopus subject areas

  • Software
  • Language and Linguistics
  • Modeling and Simulation
  • Communication
  • Linguistics and Language
  • Computer Vision and Pattern Recognition
  • Computer Science Applications

Cite this

An acoustically-driven vocal tract model for stop consonant production. / Story, Brad H.; Bunton, Kate.

In: Speech Communication, Vol. 87, 01.03.2017, p. 1-17.

Research output: Research - peer-reviewArticle

@article{f8aa256dac924fa7a9677664434061af,
title = "An acoustically-driven vocal tract model for stop consonant production",
abstract = "The purpose of this study was to further develop a multi-tier model of the vocal tract area function in which the modulations of shape to produce speech are generated by the product of a vowel substrate and a consonant superposition function. The new approach consists of specifying input parameters for a target consonant as a set of directional changes in the resonance frequencies of the vowel substrate. Using calculations of acoustic sensitivity functions, these “resonance deflection patterns” are transformed into time-varying deformations of the vocal tract shape without any direct specification of location or extent of the consonant constriction along the vocal tract. The configuration of the constrictions and expansions that are generated by this process were shown to be physiologically-realistic and produce speech sounds that are easily identifiable as the target consonants. This model is a useful enhancement for area function-based synthesis and can serve as a tool for understanding how the vocal tract is shaped by a talker during speech production.",
keywords = "Area function, Formant, Resonance, Speech modeling, Speech synthesis, Vocal tract",
author = "Story, {Brad H.} and Kate Bunton",
year = "2017",
month = "3",
doi = "10.1016/j.specom.2016.12.001",
volume = "87",
pages = "1--17",
journal = "Speech Communication",
issn = "0167-6393",
publisher = "Elsevier",

}

TY - JOUR

T1 - An acoustically-driven vocal tract model for stop consonant production

AU - Story,Brad H.

AU - Bunton,Kate

PY - 2017/3/1

Y1 - 2017/3/1

N2 - The purpose of this study was to further develop a multi-tier model of the vocal tract area function in which the modulations of shape to produce speech are generated by the product of a vowel substrate and a consonant superposition function. The new approach consists of specifying input parameters for a target consonant as a set of directional changes in the resonance frequencies of the vowel substrate. Using calculations of acoustic sensitivity functions, these “resonance deflection patterns” are transformed into time-varying deformations of the vocal tract shape without any direct specification of location or extent of the consonant constriction along the vocal tract. The configuration of the constrictions and expansions that are generated by this process were shown to be physiologically-realistic and produce speech sounds that are easily identifiable as the target consonants. This model is a useful enhancement for area function-based synthesis and can serve as a tool for understanding how the vocal tract is shaped by a talker during speech production.

AB - The purpose of this study was to further develop a multi-tier model of the vocal tract area function in which the modulations of shape to produce speech are generated by the product of a vowel substrate and a consonant superposition function. The new approach consists of specifying input parameters for a target consonant as a set of directional changes in the resonance frequencies of the vowel substrate. Using calculations of acoustic sensitivity functions, these “resonance deflection patterns” are transformed into time-varying deformations of the vocal tract shape without any direct specification of location or extent of the consonant constriction along the vocal tract. The configuration of the constrictions and expansions that are generated by this process were shown to be physiologically-realistic and produce speech sounds that are easily identifiable as the target consonants. This model is a useful enhancement for area function-based synthesis and can serve as a tool for understanding how the vocal tract is shaped by a talker during speech production.

KW - Area function

KW - Formant

KW - Resonance

KW - Speech modeling

KW - Speech synthesis

KW - Vocal tract

UR - http://www.scopus.com/inward/record.url?scp=85006377675&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85006377675&partnerID=8YFLogxK

U2 - 10.1016/j.specom.2016.12.001

DO - 10.1016/j.specom.2016.12.001

M3 - Article

VL - 87

SP - 1

EP - 17

JO - Speech Communication

T2 - Speech Communication

JF - Speech Communication

SN - 0167-6393

ER -