Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction

Paavo Alku, Jouni Pohjalainen, Martti Vainio, Anne Maria Laukkanen, Brad Story

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

Since performance of conventional linear prediction (LP) deteriorates in formant estimation of high-pitched voices, several all-pole modeling methods robust to F0 have been developed. This study compares five such previously known methods and proposes a new technique, Weighted Linear Prediction with Attenuated Main Excitation (WLP-AME). WLP-AME utilizes weighted linear prediction in which the square of the prediction error is multiplied with a weighting function that downgrades the contribution of the glottal source in the model optimization. Consequently, the resulting all-pole model is affected more by the vocal tract characteristics, which leads to more accurate formant estimates. By using synthetic vowels created with a physical modeling approach, the study shows that WLP-AME yields improved formant frequency estimates for high-pitched vowels in comparison to the previously known methods.

Original languageEnglish (US)
Title of host publication13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Pages1610-1613
Number of pages4
StatePublished - Dec 1 2012
Event13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 - Portland, OR, United States
Duration: Sep 9 2012Sep 13 2012

Publication series

Name13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Volume2

Other

Other13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
CountryUnited States
CityPortland, OR
Period9/9/129/13/12

Keywords

  • Formants
  • Linear prediction

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Communication

Fingerprint Dive into the research topics of 'Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction'. Together they form a unique fingerprint.

  • Cite this

    Alku, P., Pohjalainen, J., Vainio, M., Laukkanen, A. M., & Story, B. (2012). Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction. In 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 (pp. 1610-1613). (13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012; Vol. 2).