Technical Program
SP-P8: Robust Automatic Speech Recognition: General Topics |
Session Type: Poster |
Time: Wednesday, May 29, 10:30 - 12:30 |
Location: Poster Area C |
Session Chair: Peder Olsen, IBM |
SP-P8.1: VOICE ACTIVITY DETECTION USING CONVOLUTIVE NON-NEGATIVE SPARSE CODING |
Peng Teng; Beijing Institute of Technology |
Yunde Jia; Beijing Institute of Technology |
SP-P8.2: RECURRENT NEURAL NETWORKS FOR VOICE ACTIVITY DETECTION |
Thad Hughes; Google Inc. |
Keir Mierle; Google Inc. |
SP-P8.3: APPROXIMATED PARALLEL MODEL COMBINATION FOR EFFICIENT NOISE-ROBUST SPEECH RECOGNITION |
Khe Chai Sim; National University of Singapore |
SP-P8.4: AN UNCERTAINTY DECODING APPROACH TO NOISE- AND REVERBERATION-ROBUST SPEECH RECOGNITION |
Roland Maas; University of Erlangen-Nuremberg |
Akshaya Thippur; KTH Royal Institute of Technology |
Armin Sehr; Beuth University of Applied Sciences Berlin |
Walter Kellermann; University of Erlangen-Nuremberg |
SP-P8.5: BAYESIAN LATENT VARIABLE MODELS FOR SPEECH RECOGNITION |
Jen-Tzung Chien; National Chiao Tung University |
Peng Liu; Sohu.com Inc. |
SP-P8.6: AN INVESTIGATION OF DEEP NEURAL NETWORKS FOR NOISE ROBUST SPEECH RECOGNITION |
Michael Seltzer; Microsoft Research |
Dong Yu; Microsoft Research |
Yongqiang Wang; Cambridge University |
SP-P8.7: MODELING HETEROGENEOUS DATA SOURCES FOR SPEECH RECOGNITION USING SYNCHRONOUS HIDDEN MARKOV MODELS |
Yong Zhao; Georgia Institute of Technology |
Biing-Hwang (Fred) Juang; Georgia Institute of Technology |
SP-P8.8: NOISE ADAPTIVE FRONT-END NORMALIZATION BASED ON VECTOR TAYLOR SERIES FOR DEEP NEURAL NETWORKS IN ROBUST SPEECH RECOGNITION |
Bo Li; National University of Singapore |
Khe Chai Sim; National University of Singapore |
SP-P8.9: PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES |
Po-Sen Huang; University of Illinois at Urbana-Champaign |
Kshitiz Kumar; Microsoft Corporation |
Chaojun Liu; Microsoft Corporation |
Yifan Gong; Microsoft Corporation |
Li Deng; Microsoft Research |
SP-P8.10: ASR ERROR DETECTION IN A CONVERSATIONAL SPOKEN LANGUAGE TRANSLATION SYSTEM |
Wei Chen; Raytheon BBN Technologies |
Sankaranarayanan Ananthakrishnan; Raytheon BBN Technologies |
Rohit Kumar; Raytheon BBN Technologies |
Rohit Prasad; Raytheon BBN Technologies |
Prem Natarajan; Raytheon BBN Technologies |
SP-P8.11: MEAN TEMPORAL DISTANCE: PREDICTING ASR ERROR FROM TEMPORAL PROPERTIES OF SPEECH SIGNAL |
Hynek Hermansky; Johns Hopkins University |
Ehsan Variani; Johns Hopkins University |
Vijayaditya Peddinti; Johns Hopkins University |
SP-P8.12: FEATURE EXTRACTION WITH A MULTISCALE MODULATION ANALYSIS FOR ROBUST AUTOMATIC SPEECH RECOGNITION |
Florian Mueller; University of Luebeck |
Alfred Mertins; University of Luebeck |
SP-P8.13: JOINT ANALYSIS OF VOCAL TRACT LENGTH AND TEMPORAL INFORMATION FOR ROBUST SPEECH RECOGNITION |
Chien-Lin Huang; National Institute of Information and Communications Technology |
Chiori Hori; National Institute of Information and Communications Technology |
Hideki Kashioka; National Institute of Information and Communications Technology |
Bin Ma; Institute for Infocomm Research |
SP-P8.14: DOUBLE PITCH MARKS IN DIPLOPHONIC VOICE |
Philipp Aichinger; Medical University of Vienna |
Berit Schneider-Stickler; Medical University of Vienna |
Wolfgang Bigenzahn; Medical University of Vienna |
Anna Katharina Fuchs; Graz University of Technology |
Bernhard Geiger; Graz University of Technology |
Martin Hagmüller; Graz University of Technology |
Gernot Kubin; Graz University of Technology |