Estimation of the voice source from speech pressure signals: Evaluation of an inverse filtering technique using physical modelling of voice production

Paavo Alku, Brad H Story, Matti Airas

Research output: Contribution to journalArticle

29 Citations (Scopus)

Abstract

Objective: The goal of the study is to use physical modelling of voice production to assess the performance of an inverse filtering method in estimating the glottal flow from acoustic speech pressure signals. Methods: An automatic inverse filtering method is presented, and speech pressure signals are generated using physical modelling of voice production so as to obtain test vowels with a known shape of the glottal excitation waveform. The speech sounds produced consist of 4 different vowels, each with 10 different values of the fundamental frequency. Both the original glottal flows given by physical modelling and their estimates computed by inverse filtering were parametrised with two robust voice source parameters: the normalized amplitude quotient and the difference (in decibels) between the levels of the first and second harmonics. Results: The results show that for both extracted parameters the error introduced by inverse filtering was, in general, small. The effect of the distortion caused by inverse filtering on the parameter values was clearly smaller than the change in the corresponding parameters when the phonation type was altered. The distortion was largest for high-pitched vowels with the lowest value of the first formant. Conclusions: The study shows that the proposed inverse filtering technique combined with the extracted parameters constitutes a voice source analysis tool that is able to measure the voice source dynamics automatically with satisfactory accuracy.

Original languageEnglish (US)
Pages (from-to)102-113
Number of pages12
JournalFolia Phoniatrica et Logopaedica
Volume58
Issue number2
DOIs
StatePublished - Feb 2006

Fingerprint

Pressure
evaluation
Speech Acoustics
source analysis
Phonation
Phonetics
acoustics
Physical
Modeling
Evaluation
performance
Values

Keywords

  • Glottis
  • Inverse filtering
  • Physical modelling
  • Voice production

ASJC Scopus subject areas

  • Otorhinolaryngology
  • Health Professions(all)

Cite this

@article{5da17413efa544f9826342eff6dde3d3,
title = "Estimation of the voice source from speech pressure signals: Evaluation of an inverse filtering technique using physical modelling of voice production",
abstract = "Objective: The goal of the study is to use physical modelling of voice production to assess the performance of an inverse filtering method in estimating the glottal flow from acoustic speech pressure signals. Methods: An automatic inverse filtering method is presented, and speech pressure signals are generated using physical modelling of voice production so as to obtain test vowels with a known shape of the glottal excitation waveform. The speech sounds produced consist of 4 different vowels, each with 10 different values of the fundamental frequency. Both the original glottal flows given by physical modelling and their estimates computed by inverse filtering were parametrised with two robust voice source parameters: the normalized amplitude quotient and the difference (in decibels) between the levels of the first and second harmonics. Results: The results show that for both extracted parameters the error introduced by inverse filtering was, in general, small. The effect of the distortion caused by inverse filtering on the parameter values was clearly smaller than the change in the corresponding parameters when the phonation type was altered. The distortion was largest for high-pitched vowels with the lowest value of the first formant. Conclusions: The study shows that the proposed inverse filtering technique combined with the extracted parameters constitutes a voice source analysis tool that is able to measure the voice source dynamics automatically with satisfactory accuracy.",
keywords = "Glottis, Inverse filtering, Physical modelling, Voice production",
author = "Paavo Alku and Story, {Brad H} and Matti Airas",
year = "2006",
month = "2",
doi = "10.1159/000089611",
language = "English (US)",
volume = "58",
pages = "102--113",
journal = "Folia Phoniatrica et Logopaedica",
issn = "1021-7762",
publisher = "S. Karger AG",
number = "2",

}

TY - JOUR

T1 - Estimation of the voice source from speech pressure signals

T2 - Evaluation of an inverse filtering technique using physical modelling of voice production

AU - Alku, Paavo

AU - Story, Brad H

AU - Airas, Matti

PY - 2006/2

Y1 - 2006/2

N2 - Objective: The goal of the study is to use physical modelling of voice production to assess the performance of an inverse filtering method in estimating the glottal flow from acoustic speech pressure signals. Methods: An automatic inverse filtering method is presented, and speech pressure signals are generated using physical modelling of voice production so as to obtain test vowels with a known shape of the glottal excitation waveform. The speech sounds produced consist of 4 different vowels, each with 10 different values of the fundamental frequency. Both the original glottal flows given by physical modelling and their estimates computed by inverse filtering were parametrised with two robust voice source parameters: the normalized amplitude quotient and the difference (in decibels) between the levels of the first and second harmonics. Results: The results show that for both extracted parameters the error introduced by inverse filtering was, in general, small. The effect of the distortion caused by inverse filtering on the parameter values was clearly smaller than the change in the corresponding parameters when the phonation type was altered. The distortion was largest for high-pitched vowels with the lowest value of the first formant. Conclusions: The study shows that the proposed inverse filtering technique combined with the extracted parameters constitutes a voice source analysis tool that is able to measure the voice source dynamics automatically with satisfactory accuracy.

AB - Objective: The goal of the study is to use physical modelling of voice production to assess the performance of an inverse filtering method in estimating the glottal flow from acoustic speech pressure signals. Methods: An automatic inverse filtering method is presented, and speech pressure signals are generated using physical modelling of voice production so as to obtain test vowels with a known shape of the glottal excitation waveform. The speech sounds produced consist of 4 different vowels, each with 10 different values of the fundamental frequency. Both the original glottal flows given by physical modelling and their estimates computed by inverse filtering were parametrised with two robust voice source parameters: the normalized amplitude quotient and the difference (in decibels) between the levels of the first and second harmonics. Results: The results show that for both extracted parameters the error introduced by inverse filtering was, in general, small. The effect of the distortion caused by inverse filtering on the parameter values was clearly smaller than the change in the corresponding parameters when the phonation type was altered. The distortion was largest for high-pitched vowels with the lowest value of the first formant. Conclusions: The study shows that the proposed inverse filtering technique combined with the extracted parameters constitutes a voice source analysis tool that is able to measure the voice source dynamics automatically with satisfactory accuracy.

KW - Glottis

KW - Inverse filtering

KW - Physical modelling

KW - Voice production

UR - http://www.scopus.com/inward/record.url?scp=32944458861&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=32944458861&partnerID=8YFLogxK

U2 - 10.1159/000089611

DO - 10.1159/000089611

M3 - Article

C2 - 16479132

AN - SCOPUS:32944458861

VL - 58

SP - 102

EP - 113

JO - Folia Phoniatrica et Logopaedica

JF - Folia Phoniatrica et Logopaedica

SN - 1021-7762

IS - 2

ER -