Development of Large Vocabulary ASR Systems for Mandarin and Arabic
Konferenz: Sprachkommunikation 2008 - 8. ITG-Fachtagung
08.10.2008 - 10.10.2008 in Aachen, Germany
Tagungsband: Sprachkommunikation 2008
Seiten: 4Sprache: EnglischTyp: PDF
Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt
Autoren:
Schlüter, R.; Gollan, Ch.; Hahn, S.; Heigold, G.; Hoffmeister, B.; Lööf, J.; Plahl, Ch.; Rybach, D.; Ney, H. (Lehrstuhl für Informatik 6, RWTH Aachen University, 52056 Aachen)
Inhalt:
In this work, we describe our automatic speech recognition (ASR) systems for Mandarin and Arabic. We describe the general modeling, training, and recognition architecture, as well as language specific aspects. Several design aspects of the systems, including multiple system design and combination, are analyzed. For Arabic, we summarize the semi-automatic lexicon generation using a statistical approach to grapheme-to-phoneme conversion and pronunciation statistics. For Mandarin, different methods for integrating tone information are described. We present systematic recognition results on recent evaluation corpora of the Global Autonomous Language Exploitation (GALE) project.