A simplified model for the simulation and transformation of speech

Brad H. Story, Ingo R. Titze, Darrell Wong

Research output: Contribution to journalArticle

Abstract

This paper explores a model that reduces speech production to the specification of four time-varying parameters; F1 and F2, voice fundamental frequency (F0), and a relative amplitude of the voice. The trajectory of the first two formants, F1 and F2, is treated as a series of coordinate pairs that are mapped from the F1F2 plane into a two-dimensional plane of coefficients. These coefficients are multipliers of two empirically-based orthogonal basis vectors which, when added to a neutral vowel area function, will produce a new area function with the desired locations of F1 and F2. Thus, area functions and voice parameters extracted at appropriate time intervals can be fed into a speech simulation model to recreate the original speech. A transformation of the speech can also be imposed by manipulating the area function and voice characteristics prior to the recreation of speech by simulation. The model has initially been developed for vowel-like speech utterances, but the effect of consonants on the F1F2 trajectory is also briefly addressed.

Original languageEnglish (US)
Pages (from-to)593-601
Number of pages9
JournalEngineering Applications of Artificial Intelligence
Volume10
Issue number6
StatePublished - Dec 1 1997
Externally publishedYes

Keywords

  • Speech production
  • Speech transformation

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Artificial Intelligence
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'A simplified model for the simulation and transformation of speech'. Together they form a unique fingerprint.

  • Cite this