Considerations in voice transformation with physiologic scaling principles

Ingo Titze, Darrell Wong, Brad H Story, Russell Long

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

This study begins to explore the importance of the physiological domain in voice transformation. A general approach is outlined for transforming the voice quality of sentence-level speech while maintaining the same phonetic content. Transformations will eventually include gender, age, voice quality, emotional state, disordered state, dialect or impersonation. In this paper, only a specific voice quality, twang, is described as an example. The basic question is: relative to pure signal processing, can voices be transformed more effectively if biomechanical, acoustic and anatomical scaling principles are applied? At present, two approaches are contrasted, a Linear Predictive Coding approach and a biomechanical simulation approach.

Original languageEnglish (US)
Pages (from-to)113-123
Number of pages11
JournalSpeech Communication
Volume22
Issue number2-3
StatePublished - Aug 1997
Externally publishedYes

Fingerprint

Voice Quality
scaling
Scaling
Phonetics
dialect
phonetics
Acoustics
acoustics
coding
simulation
Speech analysis
present
gender
Signal Processing
Coding
Voice
Signal processing
Simulation

Keywords

  • Speech simulation
  • Speech synthesis
  • Voice conversion
  • Voice transformation
  • Vowel quality

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Experimental and Cognitive Psychology
  • Linguistics and Language

Cite this

Considerations in voice transformation with physiologic scaling principles. / Titze, Ingo; Wong, Darrell; Story, Brad H; Long, Russell.

In: Speech Communication, Vol. 22, No. 2-3, 08.1997, p. 113-123.

Research output: Contribution to journalArticle

Titze, I, Wong, D, Story, BH & Long, R 1997, 'Considerations in voice transformation with physiologic scaling principles', Speech Communication, vol. 22, no. 2-3, pp. 113-123.
Titze, Ingo ; Wong, Darrell ; Story, Brad H ; Long, Russell. / Considerations in voice transformation with physiologic scaling principles. In: Speech Communication. 1997 ; Vol. 22, No. 2-3. pp. 113-123.
@article{e0886662359e4a7aaf966b504ea30fa5,
title = "Considerations in voice transformation with physiologic scaling principles",
abstract = "This study begins to explore the importance of the physiological domain in voice transformation. A general approach is outlined for transforming the voice quality of sentence-level speech while maintaining the same phonetic content. Transformations will eventually include gender, age, voice quality, emotional state, disordered state, dialect or impersonation. In this paper, only a specific voice quality, twang, is described as an example. The basic question is: relative to pure signal processing, can voices be transformed more effectively if biomechanical, acoustic and anatomical scaling principles are applied? At present, two approaches are contrasted, a Linear Predictive Coding approach and a biomechanical simulation approach.",
keywords = "Speech simulation, Speech synthesis, Voice conversion, Voice transformation, Vowel quality",
author = "Ingo Titze and Darrell Wong and Story, {Brad H} and Russell Long",
year = "1997",
month = "8",
language = "English (US)",
volume = "22",
pages = "113--123",
journal = "Speech Communication",
issn = "0167-6393",
publisher = "Elsevier",
number = "2-3",

}

TY - JOUR

T1 - Considerations in voice transformation with physiologic scaling principles

AU - Titze, Ingo

AU - Wong, Darrell

AU - Story, Brad H

AU - Long, Russell

PY - 1997/8

Y1 - 1997/8

N2 - This study begins to explore the importance of the physiological domain in voice transformation. A general approach is outlined for transforming the voice quality of sentence-level speech while maintaining the same phonetic content. Transformations will eventually include gender, age, voice quality, emotional state, disordered state, dialect or impersonation. In this paper, only a specific voice quality, twang, is described as an example. The basic question is: relative to pure signal processing, can voices be transformed more effectively if biomechanical, acoustic and anatomical scaling principles are applied? At present, two approaches are contrasted, a Linear Predictive Coding approach and a biomechanical simulation approach.

AB - This study begins to explore the importance of the physiological domain in voice transformation. A general approach is outlined for transforming the voice quality of sentence-level speech while maintaining the same phonetic content. Transformations will eventually include gender, age, voice quality, emotional state, disordered state, dialect or impersonation. In this paper, only a specific voice quality, twang, is described as an example. The basic question is: relative to pure signal processing, can voices be transformed more effectively if biomechanical, acoustic and anatomical scaling principles are applied? At present, two approaches are contrasted, a Linear Predictive Coding approach and a biomechanical simulation approach.

KW - Speech simulation

KW - Speech synthesis

KW - Voice conversion

KW - Voice transformation

KW - Vowel quality

UR - http://www.scopus.com/inward/record.url?scp=0031197896&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0031197896&partnerID=8YFLogxK

M3 - Article

VL - 22

SP - 113

EP - 123

JO - Speech Communication

JF - Speech Communication

SN - 0167-6393

IS - 2-3

ER -