Vocal tract area functions for an adult female speaker based on volumetric imaging

Brad H Story, Ingo R. Titze, Eric A. Hoffman

Research output: Contribution to journalArticle

64 Citations (Scopus)

Abstract

Magnetic resonance imaging (MRI) was used to acquire vocal tract shapes of ten vowels and two liquid approximants for a 27-year-old adult female. These images were complemented with additional images acquired with electron beam computed tomography (CT). Each 3-D shape was condensed into a set of cross-sectional areas of oblique sections perpendicular to the centerline of the vocal tract's long axis, resulting in an 'area function.' Formant frequencies computed for each area function showed reasonable similarity to those determined from the natural (recorded) speech of the imaged subject, but differences suggest that some of the imaged vocal tract shapes were articulated differently during imaging than during recording of natural speech, and also that imaging procedures may have compromised some accuracy for a few shapes. The formant calculations also confirmed the significant effect that the pitiform sinus can have on lowering the formant frequencies. A comparison is made between area functions derived using both MRI and CT methods. Additionally, the area functions reported in this study are compared with those from two previous studies and demonstrate general similarities in shape but also obvious differences that can be attributed to anatomical differences of the imaged subjects and to differences in imaging techniques and image processing methods.

Original languageEnglish (US)
Pages (from-to)471-487
Number of pages17
JournalJournal of the Acoustical Society of America
Volume104
Issue number1
DOIs
StatePublished - 1998
Externally publishedYes

Fingerprint

magnetic resonance
tomography
sinuses
vowels
imaging techniques
image processing
recording
Imaging
Vocal Tract
electron beams
liquids
Magnetic Resonance Imaging
Computed Tomography
Formant Frequencies
Image Processing
Liquid
Formants
Natural Speech

ASJC Scopus subject areas

  • Acoustics and Ultrasonics

Cite this

Vocal tract area functions for an adult female speaker based on volumetric imaging. / Story, Brad H; Titze, Ingo R.; Hoffman, Eric A.

In: Journal of the Acoustical Society of America, Vol. 104, No. 1, 1998, p. 471-487.

Research output: Contribution to journalArticle

@article{8cb4ab949dc849a0ad3bb2a450e2f6fd,
title = "Vocal tract area functions for an adult female speaker based on volumetric imaging",
abstract = "Magnetic resonance imaging (MRI) was used to acquire vocal tract shapes of ten vowels and two liquid approximants for a 27-year-old adult female. These images were complemented with additional images acquired with electron beam computed tomography (CT). Each 3-D shape was condensed into a set of cross-sectional areas of oblique sections perpendicular to the centerline of the vocal tract's long axis, resulting in an 'area function.' Formant frequencies computed for each area function showed reasonable similarity to those determined from the natural (recorded) speech of the imaged subject, but differences suggest that some of the imaged vocal tract shapes were articulated differently during imaging than during recording of natural speech, and also that imaging procedures may have compromised some accuracy for a few shapes. The formant calculations also confirmed the significant effect that the pitiform sinus can have on lowering the formant frequencies. A comparison is made between area functions derived using both MRI and CT methods. Additionally, the area functions reported in this study are compared with those from two previous studies and demonstrate general similarities in shape but also obvious differences that can be attributed to anatomical differences of the imaged subjects and to differences in imaging techniques and image processing methods.",
author = "Story, {Brad H} and Titze, {Ingo R.} and Hoffman, {Eric A.}",
year = "1998",
doi = "10.1121/1.423298",
language = "English (US)",
volume = "104",
pages = "471--487",
journal = "Journal of the Acoustical Society of America",
issn = "0001-4966",
publisher = "Acoustical Society of America",
number = "1",

}

TY - JOUR

T1 - Vocal tract area functions for an adult female speaker based on volumetric imaging

AU - Story, Brad H

AU - Titze, Ingo R.

AU - Hoffman, Eric A.

PY - 1998

Y1 - 1998

N2 - Magnetic resonance imaging (MRI) was used to acquire vocal tract shapes of ten vowels and two liquid approximants for a 27-year-old adult female. These images were complemented with additional images acquired with electron beam computed tomography (CT). Each 3-D shape was condensed into a set of cross-sectional areas of oblique sections perpendicular to the centerline of the vocal tract's long axis, resulting in an 'area function.' Formant frequencies computed for each area function showed reasonable similarity to those determined from the natural (recorded) speech of the imaged subject, but differences suggest that some of the imaged vocal tract shapes were articulated differently during imaging than during recording of natural speech, and also that imaging procedures may have compromised some accuracy for a few shapes. The formant calculations also confirmed the significant effect that the pitiform sinus can have on lowering the formant frequencies. A comparison is made between area functions derived using both MRI and CT methods. Additionally, the area functions reported in this study are compared with those from two previous studies and demonstrate general similarities in shape but also obvious differences that can be attributed to anatomical differences of the imaged subjects and to differences in imaging techniques and image processing methods.

AB - Magnetic resonance imaging (MRI) was used to acquire vocal tract shapes of ten vowels and two liquid approximants for a 27-year-old adult female. These images were complemented with additional images acquired with electron beam computed tomography (CT). Each 3-D shape was condensed into a set of cross-sectional areas of oblique sections perpendicular to the centerline of the vocal tract's long axis, resulting in an 'area function.' Formant frequencies computed for each area function showed reasonable similarity to those determined from the natural (recorded) speech of the imaged subject, but differences suggest that some of the imaged vocal tract shapes were articulated differently during imaging than during recording of natural speech, and also that imaging procedures may have compromised some accuracy for a few shapes. The formant calculations also confirmed the significant effect that the pitiform sinus can have on lowering the formant frequencies. A comparison is made between area functions derived using both MRI and CT methods. Additionally, the area functions reported in this study are compared with those from two previous studies and demonstrate general similarities in shape but also obvious differences that can be attributed to anatomical differences of the imaged subjects and to differences in imaging techniques and image processing methods.

UR - http://www.scopus.com/inward/record.url?scp=0031876670&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0031876670&partnerID=8YFLogxK

U2 - 10.1121/1.423298

DO - 10.1121/1.423298

M3 - Article

C2 - 9670539

AN - SCOPUS:0031876670

VL - 104

SP - 471

EP - 487

JO - Journal of the Acoustical Society of America

JF - Journal of the Acoustical Society of America

SN - 0001-4966

IS - 1

ER -