Technical Program
SP-P4: Acoustic Modeling for Automatic Speech Recognition |
Session Type: Poster |
Time: Tuesday, May 28, 15:30 - 17:30 |
Location: Poster Area D |
Session Chair: Malcolm Slaney, Microsoft |
SP-P4.1: EFFECT OF FILTER BANDWIDTH AND SPECTRAL SAMPLING RATE OF ANALYSIS FILTERBANK ON AUTOMATIC PHONEME RECOGNITION |
Feipeng Li; Johns Hopkins University |
Hynek Hermansky; Johns Hopkins University |
SP-P4.2: PROBABILISTIC ASR FEATURE EXTRACTION APPLYING CONTEXT-SENSITIVE CONNECTIONIST TEMPORAL CLASSIFICATION NETWORKS |
Martin Woellmer; BMW Group |
Björn Schuller; Technische Universität München |
Gerhard Rigoll; Technische Universität München |
SP-P4.3: OPTIMIZED MFCC FEATURE EXTRACTION ON GPU |
Haofeng Kou; Santa Clara University |
Weijia Shang; Santa Clara University |
Ian Lane; Carnegie Mellon University |
Jike Chong; Carnegie Mellon University |
SP-P4.4: MULTI-VIEW CCA-BASED ACOUSTIC FEATURES FOR PHONETIC RECOGNITION ACROSS SPEAKERS AND DOMAINS |
Raman Arora; Toyota Technological Institute at Chicago |
Karen Livescu; Toyota Technological Institute at Chicago |
SP-P4.5: PERFORMANCES OF UNSUPERVISED HMM IN ACOUSTIC-TO-ARTICULATORY INVERSION |
Helene Lachambre; IRIT - University of Toulouse |
Lionel Koenig; IRIT - University of Toulouse |
Régine André-Obrecht; IRIT - University of Toulouse |
SP-P4.6: ARTICULATORY TRAJECTORIES FOR LARGE-VOCABULARY SPEECH RECOGNITION |
Vikramjit Mitra; SRI International |
Wen Wang; SRI International |
Andreas Stolcke; Microsoft Research |
Hosung Nam; Haskins Laboratories |
Colleen Richey; SRI International |
Jiahong Yuan; University of Pennsylvania |
Mark Liberman; University of Pennsylvania |
SP-P4.7: DISTINCT TRIPHONE MODELING BY REFERENCE MODEL WEIGHTING |
Dongpeng Chen; The Hong Kong University of Science and Technology |
Brian Mak; The Hong Kong University of Science and Technology |
SP-P4.8: A NEW PHASE-BASED FEATURE REPRESENTATION FOR ROBUST SPEECH RECOGNITION |
Erfan Loweimi; Amirkabir University of Technology (Tehran Polytechnic) |
Seyed Mohammad Ahadi; Amirkabir University of Technology (Tehran Polytechnic) |
Thomas Drugman; Université de Mons |
SP-P4.9: CHANNEL-MAPPING FOR SPEECH CORPUS RECYCLING |
Osamu Ichikawa; IBM |
Steven Rennie; IBM |
Takashi Fukuda; IBM |
Masafumi Nishimura; IBM |
SP-P4.10: AN EVALUATION OF POSTERIOR MODELING TECHNIQUES FOR PHONETIC RECOGNITION |
Rohit Prabhavalkar; The Ohio State University |
Tara N. Sainath; IBM T.J. Watson Research Center |
David Nahamoo; IBM T.J. Watson Research Center |
Bhuvana Ramabhadran; IBM T.J. Watson Research Center |
Dimitri Kanevsky; IBM T.J. Watson Research Center |
SP-P4.11: ACCENT ADAPTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS |
Petr Motlicek; Idiap Research Institute |
Philip N. Garner; Idiap Research Institute |
Namhoon Kim; Samsung Electronics Co. Ltd |
Jeongmi Cho; Samsung Electronics Co. Ltd |
SP-P4.12: SEMI-SUPERVISED ACCENT DETECTION AND MODELING |
Shilei Zhang; IBM Research |
Yong Qin; IBM Research |
SP-P4.13: TONE RECOGNITION FOR CONTINUOUS ACCENTED MANDARIN CHINESE |
Jiang Wu; SUNY-Binghamton |
Stephen A. Zahorian; SUNY-Binghamton |
Hongbing Hu; SUNY-Binghamton |
SP-P4.14: SUBMODULAR FEATURE SELECTION FOR HIGH-DIMENSIONAL ACOUSTIC SCORE SPACES |
Yuzong Liu; University of Washington |
Kai Wei; University of Washington |
Katrin Kirchhoff; University of Washington |
Yisong Song; University of Washington |
Jeff Bilmes; University of Washington |