A preliminary study of voice quality transformation based on modifications to the neutral vocal tract area function

Brad H. Story, Ingo R. Titze

Research output: Contribution to journalArticlepeer-review

15 Scopus citations

Abstract

The idea is pursued that voice quality can be partially represented by the underlying shape of a speaker's neutral vocal tract. Using an area function model, which allows direct access to the neutral tract shape, four separate modifications were made to one male speaker's vocal tract. The modifications involve imposing constrictive or expansive effects on the pharyngeal and oral portions of the neutral area function as well as on lip aperture and the epi-laryngeal tube. A single word utterance was first synthesized by superimposing deformation patterns appropriate for the word onto the original neutral tract shape (area function). Then, four additional samples of the word were synthesized using different modified neutral area function each time. The modifications were assessed by comparing F1-F2 formant trajectories of the original utterance with those of the modifications. The formant frequencies were observed to shift within the F1-F2 plane in directions predictable from simple tube acoustics. However, the modified voice qualities did not preserve the shape of the original F1-F2 trajectory. In other words, the modifications did not create a simple linear transformation of formant frequencies even though the "articulatory dynamics" (deformation patterns of the area function) were identical in all cases. These somewhat artificial vocal tract modifications were also compared with formant frequencies extracted from recordings of a speaker attempting to produce the same types of modifications. In general, the speaker's formant trajectories showed some similarities to the synthesized versions. However, the speaker also seemed to grade the "level" of the voice quality that was exerted on the utterance depending on whether the demands of the voice quality were in competition with the linguistic demands of a given phonetic segment. Finally, to demonstrate this type of voice quality modification in a broader context, the same procedures were applied to sentence-level speech and results were again shown as F1-F2 formant trajectories.

Original languageEnglish (US)
Pages (from-to)485-509
Number of pages25
JournalJournal of Phonetics
Volume30
Issue number3
DOIs
StatePublished - Jul 2002

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language
  • Speech and Hearing

Fingerprint Dive into the research topics of 'A preliminary study of voice quality transformation based on modifications to the neutral vocal tract area function'. Together they form a unique fingerprint.

Cite this