Phonological and statistical effects on timing of speech perception: Insights from a database of Dutch diphone perception

Natasha Warner, Roel Smits, James M. McQueen, Anne Cutler

Research output: Contribution to journalArticle

14 Citations (Scopus)

Abstract

We report detailed analyses of a very large database on timing of speech perception collected by Smits et al. (Smits, R., Warner, N., McQueen, J.M., Cutler, A., 2003. Unfolding of phonetic information over time: A database of Dutch diphone perception. J. Acoust. Soc. Am. 113, 563-574). Eighteen listeners heard all possible diphones of Dutch, gated in portions of varying size and presented without background noise. The present report analyzes listeners' responses across gates in terms of phonological features (voicing, place, and manner for consonants; height, backness, and length for vowels). The resulting patterns for feature perception differ from patterns reported when speech is presented in noise. The data are also analyzed for effects of stress and of phonological context (neighboring vowel vs. consonant); effects of these factors are observed to be surprisingly limited. Finally, statistical effects, such as overall phoneme frequency and transitional probabilities, along with response biases, are examined; these too exercise only limited effects on response patterns. The results suggest highly accurate speech perception on the basis of acoustic information alone.

Original languageEnglish (US)
Pages (from-to)53-72
Number of pages20
JournalSpeech Communication
Volume46
Issue number1
DOIs
StatePublished - May 2005

Fingerprint

Speech Perception
Noise
Timing
Portion Size
Databases
Phonetics
Acoustics
listener
Speech analysis
Unfolding
phonetics
Exercise
acoustics
Perception
Data Base
trend

Keywords

  • Diphone
  • Dutch
  • Feature
  • Speech perception
  • Timing

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Experimental and Cognitive Psychology
  • Linguistics and Language

Cite this

Phonological and statistical effects on timing of speech perception : Insights from a database of Dutch diphone perception. / Warner, Natasha; Smits, Roel; McQueen, James M.; Cutler, Anne.

In: Speech Communication, Vol. 46, No. 1, 05.2005, p. 53-72.

Research output: Contribution to journalArticle

@article{68769b59572740019207f9fa33de834a,
title = "Phonological and statistical effects on timing of speech perception: Insights from a database of Dutch diphone perception",
abstract = "We report detailed analyses of a very large database on timing of speech perception collected by Smits et al. (Smits, R., Warner, N., McQueen, J.M., Cutler, A., 2003. Unfolding of phonetic information over time: A database of Dutch diphone perception. J. Acoust. Soc. Am. 113, 563-574). Eighteen listeners heard all possible diphones of Dutch, gated in portions of varying size and presented without background noise. The present report analyzes listeners' responses across gates in terms of phonological features (voicing, place, and manner for consonants; height, backness, and length for vowels). The resulting patterns for feature perception differ from patterns reported when speech is presented in noise. The data are also analyzed for effects of stress and of phonological context (neighboring vowel vs. consonant); effects of these factors are observed to be surprisingly limited. Finally, statistical effects, such as overall phoneme frequency and transitional probabilities, along with response biases, are examined; these too exercise only limited effects on response patterns. The results suggest highly accurate speech perception on the basis of acoustic information alone.",
keywords = "Diphone, Dutch, Feature, Speech perception, Timing",
author = "Natasha Warner and Roel Smits and McQueen, {James M.} and Anne Cutler",
year = "2005",
month = "5",
doi = "10.1016/j.specom.2005.01.003",
language = "English (US)",
volume = "46",
pages = "53--72",
journal = "Speech Communication",
issn = "0167-6393",
publisher = "Elsevier",
number = "1",

}

TY - JOUR

T1 - Phonological and statistical effects on timing of speech perception

T2 - Insights from a database of Dutch diphone perception

AU - Warner, Natasha

AU - Smits, Roel

AU - McQueen, James M.

AU - Cutler, Anne

PY - 2005/5

Y1 - 2005/5

N2 - We report detailed analyses of a very large database on timing of speech perception collected by Smits et al. (Smits, R., Warner, N., McQueen, J.M., Cutler, A., 2003. Unfolding of phonetic information over time: A database of Dutch diphone perception. J. Acoust. Soc. Am. 113, 563-574). Eighteen listeners heard all possible diphones of Dutch, gated in portions of varying size and presented without background noise. The present report analyzes listeners' responses across gates in terms of phonological features (voicing, place, and manner for consonants; height, backness, and length for vowels). The resulting patterns for feature perception differ from patterns reported when speech is presented in noise. The data are also analyzed for effects of stress and of phonological context (neighboring vowel vs. consonant); effects of these factors are observed to be surprisingly limited. Finally, statistical effects, such as overall phoneme frequency and transitional probabilities, along with response biases, are examined; these too exercise only limited effects on response patterns. The results suggest highly accurate speech perception on the basis of acoustic information alone.

AB - We report detailed analyses of a very large database on timing of speech perception collected by Smits et al. (Smits, R., Warner, N., McQueen, J.M., Cutler, A., 2003. Unfolding of phonetic information over time: A database of Dutch diphone perception. J. Acoust. Soc. Am. 113, 563-574). Eighteen listeners heard all possible diphones of Dutch, gated in portions of varying size and presented without background noise. The present report analyzes listeners' responses across gates in terms of phonological features (voicing, place, and manner for consonants; height, backness, and length for vowels). The resulting patterns for feature perception differ from patterns reported when speech is presented in noise. The data are also analyzed for effects of stress and of phonological context (neighboring vowel vs. consonant); effects of these factors are observed to be surprisingly limited. Finally, statistical effects, such as overall phoneme frequency and transitional probabilities, along with response biases, are examined; these too exercise only limited effects on response patterns. The results suggest highly accurate speech perception on the basis of acoustic information alone.

KW - Diphone

KW - Dutch

KW - Feature

KW - Speech perception

KW - Timing

UR - http://www.scopus.com/inward/record.url?scp=17444373108&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=17444373108&partnerID=8YFLogxK

U2 - 10.1016/j.specom.2005.01.003

DO - 10.1016/j.specom.2005.01.003

M3 - Article

AN - SCOPUS:17444373108

VL - 46

SP - 53

EP - 72

JO - Speech Communication

JF - Speech Communication

SN - 0167-6393

IS - 1

ER -