Technical Program

SP-P19: General Topics in Speech Recognition

Session Type: Poster
Time: Friday, May 31, 15:20 - 17:20
Location: Poster Area C
Session Chair: Michael Seltzer, Microsoft
 
SP-P19.1: EMBEDDING TIME WARPING IN EXEMPLAR-BASED SPARSE REPRESENTATIONS OF SPEECH
         Emre Yilmaz; Katholieke Universiteit Leuven
         Jort Florent Gemmeke; Katholieke Universiteit Leuven
         Hugo Van hamme; Katholieke Universiteit Leuven
 
SP-P19.2: UNSUPERVISED DISCOVERY OF LINGUISTIC STRUCTURE INCLUDING TWO-LEVEL ACOUSTIC PATTERNS USING THREE CASCADED STAGES OF ITERATIVE OPTIMIZATION
         Cheng-Tao Chung; National Taiwan University
         Chun-an Chan; National Taiwan University
         Lin-Shan Lee; National Taiwan University
 
SP-P19.3: LIGHTLY SUPERVISED LEARNING FROM A DAMAGED NATURAL SPEECH CORPUS
         Charles Fox; Sheffield University
         Thomas Hain; Sheffield University
 
SP-P19.4: WEAK TOP-DOWN CONSTRAINTS FOR UNSUPERVISED ACOUSTIC MODEL TRAINING
         Aren Jansen; Johns Hopkins University
         Samuel Thomas; Johns Hopkins University
         Hynek Hermansky; Johns Hopkins University
 
SP-P19.5: ACCURATE SPEECH SEGMENTATION BY MIMICKING HUMAN AUDITORY PROCESSING
         Sarah King; University of Illinois
         Mark Hasegawa-Johnson; University of Illinois
 
SP-P19.6: AUDIOVISUAL CORPUS TO ANALYZE WHISPER SPEECH
         Tam Tran; The University of Texas at Dallas
         Soroosh Mariooryad; The University of Texas at Dallas
         Carlos Busso; The University of Texas at Dallas
 
SP-P19.7: SPEAKER-INDEPENDENT LIPS AND TONGUE VISUALIZATION OF VOWELS
         Hao Li; Chinese Academy of Sciences
         Minghao Yang; Chinese Academy of Sciences
         Jianhua Tao; Chinese Academy of Sciences
 
SP-P19.8: A SUMMARY OF THE 2012 JHU CLSP WORKSHOP ON ZERO RESOURCE SPEECH TECHNOLOGIES AND MODELS OF EARLY LANGUAGE ACQUISITION
         Aren Jansen; Johns Hopkins University
         Emmanuel Dupoux; Ecole des Haute Etudes en Sciences Sociales
         Sharon Goldwater; University of Edinburgh
         Mark Johnson; Macquarie University
         Sanjeev Khudanpur; Johns Hopkins University
         Kenneth Church; IBM Research
         Naomi Feldman; University of Maryland
         Hynek Hermansky; Johns Hopkins University
         Florian Metze; Carnegie Mellon University
         Richard Rose; McGill University
         Michael Seltzer; Microsoft Research
         Pascal Clark; Johns Hopkins University
         Ian McGraw; Massachusetts Institute of Technology
         Balakrishnan Varadarajan; Google Inc.
         Erin Bennett; University of Maryland
         Benjamin Borschinger; Macquarie University
         Justin Chiu; Carnegie Mellon University
         Ewan Dunbar; University of Maryland
         Abdellah Fourtassi; École Normale Supérieure
         David Harwath; Massachusetts Institute of Technology
         Chia-Ying Lee; Massachusetts Institute of Technology
         Keith Levin; Johns Hopkins University
         Atta Norouzian; McGill University
         Vijayaditya Peddinti; Johns Hopkins University
         Rachael Richardson; University of Maryland
         Thomas Schatz; École Normale Supérieure
         Samuel Thomas; Johns Hopkins University
 
SP-P19.9: COMPARING TWO METHODS FOR CROWDSOURCING SPEECH TRANSCRIPTION
         Rachele Sprugnoli; Center for the Evaluation of Language and Communication Technologies
         Giovanni Moretti; Center for the Evaluation of Language and Communication Technologies
         Matteo Fuoli; Center for the Evaluation of Language and Communication Technologies
         Diego Giuliani; Fondazione Bruno Kessler
         Luisa Bentivogli; Fondazione Bruno Kessler
         Emanuele Pianta; Fondazione Bruno Kessler
         Roberto Gretter; Fondazione Bruno Kessler
         Fabio Brugnara; Fondazione Bruno Kessler
 
SP-P19.10: THE SPOKEN WEB SEARCH TASK AT MEDIAEVAL 2012
         Florian Metze; Carnegie Mellon University
         Xavier Anguera; Telefonica Research
         Etienne Barnard; North-West University
         Marelie Davel; North-West University
         Guillaume Gravier; IRISA/ INRIA
 
SP-P19.11: GLOBALPHONE: A MULTILINGUAL TEXT & SPEECH DATABASE IN 20 LANGUAGES
         Tanja Schultz; Karlsruhe Institute of Technology (KIT)
         Ngoc Thang Vu; Karlsruhe Institute of Technology (KIT)
         Tim Schlippe; Karlsruhe Institute of Technology (KIT)
 
SP-P19.12: IMPROVING ASR BY INTEGRATING LECTURE AUDIO AND SLIDES
         João Miranda; Instituto Superior Técnico / Carnegie Mellon University
         João Neto; Instituto Superior Técnico
         Alan W. Black; Carnegie Mellon University
 
SP-P19.13: SEGMENTATION-BASED MONGOLIAN LVCSR APPROACH
         Feilong Bao; Inner Mongolia University
         Guanglai Gao; Inner Mongolia University
         Xueliang Yan; Inner Mongolia University
         Weihua Wang; Inner Mongolia University