Quasi closed phase glottal inverse filtering analysis with weighted linear prediction

Manu Airaksinen, Tuomo Raitio, Brad H Story, Paavo Alku

Research output: Contribution to journalArticle

62 Citations (Scopus)

Abstract

This study presents a new glottal inverse filtering (GIF) technique based on closed phase analysis over multiple fundamental periods. The proposed quasi closed phase (QCP) analysis method utilizes weighted linear prediction (WLP) with a specific attenuated main excitation (AME) weight function that attenuates the contribution of the glottal source in the linear prediction model optimization. This enables the use of the autocorrelation criterion in linear prediction in contrast to the covariance criterion used in conventional closed phase analysis. The QCP method was compared to previously developed methods by using synthetic vowels produced with the conventional source-filter model as well as with a physical modeling approach. The obtained objective measures show that the QCP method improves the GIF performance in terms of errors in typical glottal source parametrizations for both low- and high-pitched vowels. Additionally, QCP was tested in a physiologically oriented vocoder, where the analysis/synthesis quality was evaluated with a subjective listening test indicating improved perceived quality for normal speaking style.

Original languageEnglish (US)
Pages (from-to)596-607
Number of pages12
JournalIEEE Transactions on Audio, Speech and Language Processing
Volume22
Issue number3
DOIs
StatePublished - Mar 2014

Fingerprint

linear prediction
vowels
Autocorrelation
autocorrelation
optimization model
filters
speaking
Linear Models
optimization
synthesis
Weights and Measures
excitation
performance

Keywords

  • Closed phase analysis
  • GIF
  • Glottal inverse filtering
  • Speech analysis
  • Weighted linear prediction

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Acoustics and Ultrasonics
  • Signal Processing
  • Media Technology
  • Instrumentation
  • Linguistics and Language
  • Speech and Hearing

Cite this

Quasi closed phase glottal inverse filtering analysis with weighted linear prediction. / Airaksinen, Manu; Raitio, Tuomo; Story, Brad H; Alku, Paavo.

In: IEEE Transactions on Audio, Speech and Language Processing, Vol. 22, No. 3, 03.2014, p. 596-607.

Research output: Contribution to journalArticle

@article{e23da3f6d52f48fab471371b8d12cf7b,
title = "Quasi closed phase glottal inverse filtering analysis with weighted linear prediction",
abstract = "This study presents a new glottal inverse filtering (GIF) technique based on closed phase analysis over multiple fundamental periods. The proposed quasi closed phase (QCP) analysis method utilizes weighted linear prediction (WLP) with a specific attenuated main excitation (AME) weight function that attenuates the contribution of the glottal source in the linear prediction model optimization. This enables the use of the autocorrelation criterion in linear prediction in contrast to the covariance criterion used in conventional closed phase analysis. The QCP method was compared to previously developed methods by using synthetic vowels produced with the conventional source-filter model as well as with a physical modeling approach. The obtained objective measures show that the QCP method improves the GIF performance in terms of errors in typical glottal source parametrizations for both low- and high-pitched vowels. Additionally, QCP was tested in a physiologically oriented vocoder, where the analysis/synthesis quality was evaluated with a subjective listening test indicating improved perceived quality for normal speaking style.",
keywords = "Closed phase analysis, GIF, Glottal inverse filtering, Speech analysis, Weighted linear prediction",
author = "Manu Airaksinen and Tuomo Raitio and Story, {Brad H} and Paavo Alku",
year = "2014",
month = "3",
doi = "10.1109/TASLP.2013.2294585",
language = "English (US)",
volume = "22",
pages = "596--607",
journal = "IEEE Transactions on Speech and Audio Processing",
issn = "1558-7916",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "3",

}

TY - JOUR

T1 - Quasi closed phase glottal inverse filtering analysis with weighted linear prediction

AU - Airaksinen, Manu

AU - Raitio, Tuomo

AU - Story, Brad H

AU - Alku, Paavo

PY - 2014/3

Y1 - 2014/3

N2 - This study presents a new glottal inverse filtering (GIF) technique based on closed phase analysis over multiple fundamental periods. The proposed quasi closed phase (QCP) analysis method utilizes weighted linear prediction (WLP) with a specific attenuated main excitation (AME) weight function that attenuates the contribution of the glottal source in the linear prediction model optimization. This enables the use of the autocorrelation criterion in linear prediction in contrast to the covariance criterion used in conventional closed phase analysis. The QCP method was compared to previously developed methods by using synthetic vowels produced with the conventional source-filter model as well as with a physical modeling approach. The obtained objective measures show that the QCP method improves the GIF performance in terms of errors in typical glottal source parametrizations for both low- and high-pitched vowels. Additionally, QCP was tested in a physiologically oriented vocoder, where the analysis/synthesis quality was evaluated with a subjective listening test indicating improved perceived quality for normal speaking style.

AB - This study presents a new glottal inverse filtering (GIF) technique based on closed phase analysis over multiple fundamental periods. The proposed quasi closed phase (QCP) analysis method utilizes weighted linear prediction (WLP) with a specific attenuated main excitation (AME) weight function that attenuates the contribution of the glottal source in the linear prediction model optimization. This enables the use of the autocorrelation criterion in linear prediction in contrast to the covariance criterion used in conventional closed phase analysis. The QCP method was compared to previously developed methods by using synthetic vowels produced with the conventional source-filter model as well as with a physical modeling approach. The obtained objective measures show that the QCP method improves the GIF performance in terms of errors in typical glottal source parametrizations for both low- and high-pitched vowels. Additionally, QCP was tested in a physiologically oriented vocoder, where the analysis/synthesis quality was evaluated with a subjective listening test indicating improved perceived quality for normal speaking style.

KW - Closed phase analysis

KW - GIF

KW - Glottal inverse filtering

KW - Speech analysis

KW - Weighted linear prediction

UR - http://www.scopus.com/inward/record.url?scp=84898074254&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84898074254&partnerID=8YFLogxK

U2 - 10.1109/TASLP.2013.2294585

DO - 10.1109/TASLP.2013.2294585

M3 - Article

AN - SCOPUS:84898074254

VL - 22

SP - 596

EP - 607

JO - IEEE Transactions on Speech and Audio Processing

JF - IEEE Transactions on Speech and Audio Processing

SN - 1558-7916

IS - 3

ER -