Predictive trellis-coded quantization of speech

Michael W Marcellin, Thomas R. Fischer, Jerry D. Gibson

Research output: Contribution to journalArticle

30 Citations (Scopus)

Abstract

Trellis-coded quantization (TCQ) is incorporated into a predictive coding structure for encoding sampled speech. The modest complexity of the resulting structure is seen to be a direct consequence of the TCQ formulation. Simulation results are presented for systems using fixed-prediction/fixed-residual encoding, fixed-prediction/adaptive-residual encoding, and adaptive-prediction/adaptive-residual encoding. The performance of predictive TCQ (PTCQ) is compared to that of other waveform coders, and the effects of channel errors on PTCQ performance are discussed. For a fully adaptive 16-kb/s speech coding system, segmental signal-to-noise ratios in the range of 19.1-21.9 dB are obtained for a variety of speakers and test sentences. Reconstructed speech obtained from this system is of excellent communication quality.

Original languageEnglish (US)
Pages (from-to)46-55
Number of pages10
JournalIEEE Transactions on Acoustics, Speech, and Signal Processing
Volume38
Issue number1
DOIs
StatePublished - Jan 1990

Fingerprint

Speech coding
Signal to noise ratio
Communication

ASJC Scopus subject areas

  • Signal Processing

Cite this

Predictive trellis-coded quantization of speech. / Marcellin, Michael W; Fischer, Thomas R.; Gibson, Jerry D.

In: IEEE Transactions on Acoustics, Speech, and Signal Processing, Vol. 38, No. 1, 01.1990, p. 46-55.

Research output: Contribution to journalArticle

@article{ff3a8c56e58842c2bf2c95f251ee4a56,
title = "Predictive trellis-coded quantization of speech",
abstract = "Trellis-coded quantization (TCQ) is incorporated into a predictive coding structure for encoding sampled speech. The modest complexity of the resulting structure is seen to be a direct consequence of the TCQ formulation. Simulation results are presented for systems using fixed-prediction/fixed-residual encoding, fixed-prediction/adaptive-residual encoding, and adaptive-prediction/adaptive-residual encoding. The performance of predictive TCQ (PTCQ) is compared to that of other waveform coders, and the effects of channel errors on PTCQ performance are discussed. For a fully adaptive 16-kb/s speech coding system, segmental signal-to-noise ratios in the range of 19.1-21.9 dB are obtained for a variety of speakers and test sentences. Reconstructed speech obtained from this system is of excellent communication quality.",
author = "Marcellin, {Michael W} and Fischer, {Thomas R.} and Gibson, {Jerry D.}",
year = "1990",
month = "1",
doi = "10.1109/29.45617",
language = "English (US)",
volume = "38",
pages = "46--55",
journal = "IEEE Transactions on Signal Processing",
issn = "1053-587X",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "1",

}

TY - JOUR

T1 - Predictive trellis-coded quantization of speech

AU - Marcellin, Michael W

AU - Fischer, Thomas R.

AU - Gibson, Jerry D.

PY - 1990/1

Y1 - 1990/1

N2 - Trellis-coded quantization (TCQ) is incorporated into a predictive coding structure for encoding sampled speech. The modest complexity of the resulting structure is seen to be a direct consequence of the TCQ formulation. Simulation results are presented for systems using fixed-prediction/fixed-residual encoding, fixed-prediction/adaptive-residual encoding, and adaptive-prediction/adaptive-residual encoding. The performance of predictive TCQ (PTCQ) is compared to that of other waveform coders, and the effects of channel errors on PTCQ performance are discussed. For a fully adaptive 16-kb/s speech coding system, segmental signal-to-noise ratios in the range of 19.1-21.9 dB are obtained for a variety of speakers and test sentences. Reconstructed speech obtained from this system is of excellent communication quality.

AB - Trellis-coded quantization (TCQ) is incorporated into a predictive coding structure for encoding sampled speech. The modest complexity of the resulting structure is seen to be a direct consequence of the TCQ formulation. Simulation results are presented for systems using fixed-prediction/fixed-residual encoding, fixed-prediction/adaptive-residual encoding, and adaptive-prediction/adaptive-residual encoding. The performance of predictive TCQ (PTCQ) is compared to that of other waveform coders, and the effects of channel errors on PTCQ performance are discussed. For a fully adaptive 16-kb/s speech coding system, segmental signal-to-noise ratios in the range of 19.1-21.9 dB are obtained for a variety of speakers and test sentences. Reconstructed speech obtained from this system is of excellent communication quality.

UR - http://www.scopus.com/inward/record.url?scp=0025252878&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0025252878&partnerID=8YFLogxK

U2 - 10.1109/29.45617

DO - 10.1109/29.45617

M3 - Article

VL - 38

SP - 46

EP - 55

JO - IEEE Transactions on Signal Processing

JF - IEEE Transactions on Signal Processing

SN - 1053-587X

IS - 1

ER -