Technical Program
SP-P17: Speech Synthesis |
Session Type: Poster |
Time: Friday, May 31, 10:30 - 12:30 |
Location: Poster Area C |
Session Chair: Antonio Bonafonte, Technical University of Catalonia (UPC) |
SP-P17.1: ARTICULATORY INVERSION AND SYNTHESIS: TOWARDS ARTICULATORY-BASED MODIFICATION OF SPEECH |
Sandesh Aryal; Texas A&M University |
Ricardo Gutierrez-Osuna; Texas A&M University |
SP-P17.2: A FAST TABLE LOOKUP BASED, STATISTICAL MODEL DRIVEN NON-UNIFORM UNIT SELECTION TTS |
Yao Qian; Microsoft Research Asia |
Frank Soong; Microsoft Research Asia |
Xiaobo Zhou; Microsoft Research Asia |
Yundi Qian; Microsoft Research Asia |
Xiaotian Zhang; Microsoft Research Asia |
SP-P17.3: STATISTICAL PARAMETRIC SPEECH SYNTHESIS USING DEEP NEURAL NETWORKS |
Heiga Zen; Google Inc. |
Andrew Senior; Google Inc. |
Mike Schuster; Google Inc. |
SP-P17.4: PREDICTION OF CREAKY VOICE FROM CONTEXTUAL FACTORS |
Thomas Drugman; University of Mons |
John Kane; Trinity College Dublin |
Tuomo Raitio; Aalto University |
Christer Gobl; Trinity College Dublin |
SP-P17.5: COMPLEX CEPSTRUM ANALYSIS BASED ON THE MINIMUM MEAN SQUARED ERROR |
Ranniery Maia; Toshiba Research Europe Ltd. |
Masami Akamine; Toshiba Corporation |
Mark J.F. Gales; Toshiba Research Europe Ltd. |
SP-P17.6: INTEGRATED AUTOMATIC EXPRESSION PREDICTION AND SPEECH SYNTHESIS FROM TEXT |
Langzhou Chen; Toshiba Research Europe Ltd. |
Mark J.F. Gales; Toshiba Research Europe Ltd. |
Norbert Braunschweiler; Toshiba Research Europe Ltd. |
Masami Akamine; Corporate Research and Development Center |
Kate Knill; Engineering Department |
SP-P17.7: SPEAKER AND LANGUAGE INDEPENDENT VOICE QUALITY CLASSIFICATION APPLIED TO UNLABELLED CORPORA OF EXPRESSIVE SPEECH |
John Kane; Trinity College Dublin |
Scherer Stefan; University of Southern California |
Matthew Aylett; CereProc Ltd. |
Louis-Philippe Morency; University of Southern California |
Christer Gobl; Trinity College Dublin |
SP-P17.8: LIGHTLY SUPERVISED GMM VAD TO USE AUDIOBOOK FOR SPEECH SYNTHESISER |
Yoshitaka Mamiya; University of Edinburgh |
Junichi Yamagishi; University of Edinburgh |
Oliver Watts; University of Edinburgh |
Robert Clark; University of Edinburgh |
Simon King; University of Edinburgh |
Adriana Stan; Technical University of Cluj-Napoca |
SP-P17.9: BOOTSTRAPPING TEXT-TO-SPEECH FOR SPEECH PROCESSING IN LANGUAGES WITHOUT AN ORTHOGRAPHY |
Sunayana Sitaram; Carnegie Mellon University |
Sukhada Palkar; Carnegie Mellon University |
Yun-Nung Chen; Carnegie Mellon University |
Alok Parlikar; Carnegie Mellon University |
Alan W. Black; Carnegie Mellon University |
SP-P17.10: MAXIMUM INTELLIGIBILITY-BASED CLOSE-LOOP SPEECH SYNTHESIS FRAMEWORK FOR NOISY ENVIRONMENTS |
Yuan-Fu Liao; National Taipei University of Technology |
Ming-Long Wu; National Taipei University of Technology |
Jia-Chi Lin; National Taipei University of Technology |
SP-P17.11: SPEECH SYNTHESIS USING SUBBAND-CODED MULTIBAND SOURCE COMPONENTS AND SINUSOIDS |
Nobuyuki Nishizawa; KDDI R&D Laboratories, Inc. |
Tsuneo Kato; KDDI R&D Laboratories, Inc. |
SP-P17.12: FRAME-LEVEL ACOUSTIC MODELING BASED ON GAUSSIAN PROCESS REGRESSION FOR STATISTICAL NONPARAMETRIC SPEECH SYNTHESIS |
Tomoki Koriyama; Tokyo Institute of Technology |
Takashi Nose; Tokyo Institute of Technology |
Takao Kobayashi; Tokyo Institute of Technology |
SP-P17.13: MULTI-DISTRIBUTION DEEP BELIEF NETWORK FOR SPEECH SYNTHESIS |
Shiyin Kang; The Chinese University of Hong Kong |
Xiaojun Qian; The Chinese University of Hong Kong |
Helen Meng; The Chinese University of Hong Kong |