Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction

Paavo Alku, Jouni Pohjalainen, Martti Vainio, Anne Maria Laukkanen, Brad H Story

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

Since performance of conventional linear prediction (LP) deteriorates in formant estimation of high-pitched voices, several all-pole modeling methods robust to F0 have been developed. This study compares five such previously known methods and proposes a new technique, Weighted Linear Prediction with Attenuated Main Excitation (WLP-AME). WLP-AME utilizes weighted linear prediction in which the square of the prediction error is multiplied with a weighting function that downgrades the contribution of the glottal source in the model optimization. Consequently, the resulting all-pole model is affected more by the vocal tract characteristics, which leads to more accurate formant estimates. By using synthetic vowels created with a physical modeling approach, the study shows that WLP-AME yields improved formant frequency estimates for high-pitched vowels in comparison to the previously known methods.

Original languageEnglish (US)
Title of host publication13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Pages1610-1613
Number of pages4
Volume2
StatePublished - 2012
Event13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 - Portland, OR, United States
Duration: Sep 9 2012Sep 13 2012

Other

Other13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
CountryUnited States
CityPortland, OR
Period9/9/129/13/12

Fingerprint

Frequency estimation
Pole
Poles
optimization model
weighting
performance

Keywords

  • Formants
  • Linear prediction

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Communication

Cite this

Alku, P., Pohjalainen, J., Vainio, M., Laukkanen, A. M., & Story, B. H. (2012). Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction. In 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 (Vol. 2, pp. 1610-1613)

Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction. / Alku, Paavo; Pohjalainen, Jouni; Vainio, Martti; Laukkanen, Anne Maria; Story, Brad H.

13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012. Vol. 2 2012. p. 1610-1613.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Alku, P, Pohjalainen, J, Vainio, M, Laukkanen, AM & Story, BH 2012, Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction. in 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012. vol. 2, pp. 1610-1613, 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Portland, OR, United States, 9/9/12.
Alku P, Pohjalainen J, Vainio M, Laukkanen AM, Story BH. Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction. In 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012. Vol. 2. 2012. p. 1610-1613
Alku, Paavo ; Pohjalainen, Jouni ; Vainio, Martti ; Laukkanen, Anne Maria ; Story, Brad H. / Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012. Vol. 2 2012. pp. 1610-1613
@inproceedings{86f1e7d111aa40ebae1d5e5163a899c9,
title = "Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction",
abstract = "Since performance of conventional linear prediction (LP) deteriorates in formant estimation of high-pitched voices, several all-pole modeling methods robust to F0 have been developed. This study compares five such previously known methods and proposes a new technique, Weighted Linear Prediction with Attenuated Main Excitation (WLP-AME). WLP-AME utilizes weighted linear prediction in which the square of the prediction error is multiplied with a weighting function that downgrades the contribution of the glottal source in the model optimization. Consequently, the resulting all-pole model is affected more by the vocal tract characteristics, which leads to more accurate formant estimates. By using synthetic vowels created with a physical modeling approach, the study shows that WLP-AME yields improved formant frequency estimates for high-pitched vowels in comparison to the previously known methods.",
keywords = "Formants, Linear prediction",
author = "Paavo Alku and Jouni Pohjalainen and Martti Vainio and Laukkanen, {Anne Maria} and Story, {Brad H}",
year = "2012",
language = "English (US)",
isbn = "9781622767595",
volume = "2",
pages = "1610--1613",
booktitle = "13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012",

}

TY - GEN

T1 - Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction

AU - Alku, Paavo

AU - Pohjalainen, Jouni

AU - Vainio, Martti

AU - Laukkanen, Anne Maria

AU - Story, Brad H

PY - 2012

Y1 - 2012

N2 - Since performance of conventional linear prediction (LP) deteriorates in formant estimation of high-pitched voices, several all-pole modeling methods robust to F0 have been developed. This study compares five such previously known methods and proposes a new technique, Weighted Linear Prediction with Attenuated Main Excitation (WLP-AME). WLP-AME utilizes weighted linear prediction in which the square of the prediction error is multiplied with a weighting function that downgrades the contribution of the glottal source in the model optimization. Consequently, the resulting all-pole model is affected more by the vocal tract characteristics, which leads to more accurate formant estimates. By using synthetic vowels created with a physical modeling approach, the study shows that WLP-AME yields improved formant frequency estimates for high-pitched vowels in comparison to the previously known methods.

AB - Since performance of conventional linear prediction (LP) deteriorates in formant estimation of high-pitched voices, several all-pole modeling methods robust to F0 have been developed. This study compares five such previously known methods and proposes a new technique, Weighted Linear Prediction with Attenuated Main Excitation (WLP-AME). WLP-AME utilizes weighted linear prediction in which the square of the prediction error is multiplied with a weighting function that downgrades the contribution of the glottal source in the model optimization. Consequently, the resulting all-pole model is affected more by the vocal tract characteristics, which leads to more accurate formant estimates. By using synthetic vowels created with a physical modeling approach, the study shows that WLP-AME yields improved formant frequency estimates for high-pitched vowels in comparison to the previously known methods.

KW - Formants

KW - Linear prediction

UR - http://www.scopus.com/inward/record.url?scp=84878402077&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84878402077&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84878402077

SN - 9781622767595

VL - 2

SP - 1610

EP - 1613

BT - 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012

ER -