Historikk

Cristin-resultat-ID: 396129

Sist endret: 21. januar 2015, 15:07

NVI-rapporteringsår: 2005

Resultat

Vitenskapelig artikkel

2005

Distributed ASR Using Speech Coder Data for Efficient Feature Vector Representation

Trond Skogstad og
Torbjørn Svendsen

Tidsskrift Tidsskrift

Eurospeech : Proceedings of the European Conference on Speech Communication and Technology

ISSN 1018-4074

NVI-nivå 0

Finn i kanalregisteret

Om resultatet Om resultatet

Vitenskapelig artikkel

Publiseringsår: 2005

Volum: 9

Sider: 2861 - 2864

Artikkelnummer: 2346

Beskrivelse Beskrivelse

Engelsk

Tittel

Distributed ASR Using Speech Coder Data for Efficient Feature Vector Representation

Sammendrag

This paper proposes an alternative approach to distributed speech recognition in scenarios where both reliable feature vectors and the reconstruction of the speech signal are required. By transmitting the difference between speech coded information and the desired feature vectors, this system achieves both excellent quality speech reconstruction and ASR recognition performance. Experiments show that a transparent recognition rate is achieved with as little as 0.6 kbps of additional information supplementing the AMR speech coder operating at 4.75 kbps. The total rate is comparable to the the ETSI 202 211 extended front-end standard.

Vis fullstendig beskrivelse

Bidragsytere Bidragsytere

Trond Skogstad

Forfatter
ved Institutt for elektroniske systemer ved Norges teknisk-naturvitenskapelige universitet

Torbjørn Karl Svendsen

Bidragsyterens navn vises på dette resultatet som Torbjørn Svendsen

Forfatter
ved Institutt for elektroniske systemer ved Norges teknisk-naturvitenskapelige universitet

1 - 2 av 2