Technical Program

 

DATE TIME ROOM SESSION NAME PRESENTATION TYPE PAPER CODE PAPER ID PAPER TITLE PAPER AUTHORS
Friday, 9 September 2016 09:30 - 10:30 Grand Ballroom ABC Keynote 1: ISCA Medalist   Fri-Keynote-1 3001    
Friday, 9 September 2016 11:00 Grand Ballroom A Neural Networks in Speech Recognition ORAL Fri-O-1-1-1 473 Improving English Conversational Telephone Speech Recognition Ivan Medennikov, Alexey Prudnikov, Alexander Zatvornitskiy
Friday, 9 September 2016 11:20 Grand Ballroom A Neural Networks in Speech Recognition ORAL Fri-O-1-1-2 1460 The IBM 2016 English Conversational Telephone Speech Recognition System George Saon, Tom Sercu, Steven Rennie, Hong-Kwang J. Kuo
Friday, 9 September 2016 11:40 Grand Ballroom A Neural Networks in Speech Recognition ORAL Fri-O-1-1-3 39 Small-Footprint Deep Neural Networks with Highway Connections for Speech Recognition Liang Lu, Steve Renals
Friday, 9 September 2016 12:00 Grand Ballroom A Neural Networks in Speech Recognition ORAL Fri-O-1-1-4 251 Deep Convolutional Neural Networks with Layer-Wise Context Expansion and Attention Dong Yu, Wayne Xiong, Jasha Droppo, Andreas Stolcke, Guoli Ye, Jinyu Li, Geoffrey Zweig
Friday, 9 September 2016 12:20 Grand Ballroom A Neural Networks in Speech Recognition ORAL Fri-O-1-1-5 275 Lower Frame Rate Neural Network Acoustic Models Golan Pundak, Tara N. Sainath
Friday, 9 September 2016 12:40 Grand Ballroom A Neural Networks in Speech Recognition ORAL Fri-O-1-1-6 725 Improved Neural Network Initialization by Grouping Context-Dependent Targets for Acoustic Modeling Gakuto Kurata, Brian Kingsbury
Friday, 9 September 2016 11:00 Grand Ballroom BC Special Session: Auditory-Visual Expressive Speech and Gesture in Humans and Machines ORAL Fri-O-1-2-1 1453 Automatic Scoring of Monologue Video Interviews Using Multimodal Cues Lei Chen, Gary Feng, Michelle Martin-Raugh, Chee Wee Leong, Christopher Kitchen, Su-Youn Yoon, Blair Lehman, Harrison Kell, Chong Min Lee
Friday, 9 September 2016 11:15 Grand Ballroom BC Special Session: Auditory-Visual Expressive Speech and Gesture in Humans and Machines ORAL Fri-O-1-2-2 1463 The Sound of Disgust: How Facial Expression May Influence Speech Production Chee Seng Chong, Jeesun Kim, Chris Davis
Friday, 9 September 2016 11:30 Grand Ballroom BC Special Session: Auditory-Visual Expressive Speech and Gesture in Humans and Machines ORAL Fri-O-1-2-3 158 Analyzing Temporal Dynamics of Dyadic Synchrony in Affective Interactions Zhaojun Yang, Shrikanth S. Narayanan
Friday, 9 September 2016 11:45 Grand Ballroom BC Special Session: Auditory-Visual Expressive Speech and Gesture in Humans and Machines ORAL Fri-O-1-2-4 62 Audiovisual Speech Scene Analysis in the Context of Competing Sources Attigodu C. Ganesh, Frédéric Berthommier, Jean-Luc Schwartz
Friday, 9 September 2016 12:00 Grand Ballroom BC Special Session: Auditory-Visual Expressive Speech and Gesture in Humans and Machines ORAL Fri-O-1-2-5 419 Head Motion Generation with Synthetic Speech: A Data Driven Approach Najmeh Sadoughi, Carlos Busso
Friday, 9 September 2016 12:15 Grand Ballroom BC Special Session: Auditory-Visual Expressive Speech and Gesture in Humans and Machines ORAL Fri-O-1-2-6 1505 The Consistency and Stability of Acoustic and Visual Cues for Different Prosodic Attitudes Jeesun Kim, Chris Davis
Friday, 9 September 2016 12:30 Grand Ballroom BC Special Session: Auditory-Visual Expressive Speech and Gesture in Humans and Machines ORAL Fri-O-1-2-7 4006 Introduction to Poster Presentation of Part II Jeesun Kim, Gérard Bailly
Friday, 9 September 2016 11:00 Bayview A Prosody ORAL Fri-O-1-3-1 1601 The Unit of Speech Encoding: The Case of Romanian Irene Vogel, Laura Spinu
Friday, 9 September 2016 11:20 Bayview A Prosody ORAL Fri-O-1-3-2 1268 The Perceptual Effect of L1 Prosody Transplantation on L2 Speech: The Case of French Accented German Jeanin Jügler, Frank Zimmerer, Jürgen Trouvain, Bernd Möbius
Friday, 9 September 2016 11:40 Bayview A Prosody ORAL Fri-O-1-3-3 631 Organizing Syllables into Sandhi Domains - Evidence from F0 and Duration Patterns in Shanghai Chinese Bijun Ling, Jie Liang
Friday, 9 September 2016 12:00 Bayview A Prosody ORAL Fri-O-1-3-4 1355 Automatic Analysis of Phonetic Speech Style Dimensions Neville Ryant, Mark Liberman
Friday, 9 September 2016 12:20 Bayview A Prosody ORAL Fri-O-1-3-5 1424 The Acoustic Manifestation of Prominence in Stressless Languages Angeliki Athanasopoulou, Irene Vogel
Friday, 9 September 2016 12:40 Bayview A Prosody ORAL Fri-O-1-3-6 607 The Rhythmic Constraint on Prosodic Boundaries in Mandarin Chinese Based on Corpora of Silent Reading and Speech Perception Wei Lai, Jiahong Yuan, Ya Li, Xiaoying Xu, Mark Liberman
Friday, 9 September 2016 11:00 Bayview B Speech and Language Processing for Clinical Health Applications ORAL Fri-O-1-4-1 408 Toward Development and Evaluation of Pain Level-Rating Scale for Emergency Triage based on Vocal Characteristics and Facial Expressions Fu-Sheng Tsai, Ya-Ling Hsu, Wei-Chen Chen, Yi-Ming Weng, Chip-Jin Ng, Chi-Chun Lee
Friday, 9 September 2016 11:20 Bayview B Speech and Language Processing for Clinical Health Applications ORAL Fri-O-1-4-2 1098 Predicting Severity of Voice Disorder from DNN-HMM Acoustic Posteriors Tan Lee, Yuanyuan Liu, Yu Ting Yeung, Thomas K.T. Law, Kathy Y.S. Lee
Friday, 9 September 2016 11:40 Bayview B Speech and Language Processing for Clinical Health Applications ORAL Fri-O-1-4-3 114 Long-Term Stability of Tracheoesophageal Voices Klaske E. van Sluis, Michiel W.M. van den Brekel, Frans J.M. Hilgers, Rob J.J.H. van Son
Friday, 9 September 2016 12:00 Bayview B Speech and Language Processing for Clinical Health Applications ORAL Fri-O-1-4-4 384 Detecting Mild Cognitive Impairment from Spontaneous Speech by Correlation-Based Phonetic Feature Selection Gábor Gosztolya, László Tóth, Tamás Grósz, Veronika Vincze, Ildikó Hoffmann, Gréta Szatlóczki, Magdolna Pákáski, János Kálmán
Friday, 9 September 2016 12:20 Bayview B Speech and Language Processing for Clinical Health Applications ORAL Fri-O-1-4-5 549 Towards an Automated Screening Tool for Developmental Speech and Language Impairments Jen J. Gong, Maryann Gong, Dina Levy-Lambert, Jordan R. Green, Tiffany P. Hogan, John V. Guttag
Friday, 9 September 2016 12:40 Bayview B Speech and Language Processing for Clinical Health Applications ORAL Fri-O-1-4-6 842 Spectral Enhancement of Cleft Lip and Palate Speech Vikram C.M., Nagaraj Adiga, S.R. Mahadeva Prasanna
Friday, 9 September 2016 11:00 Seacliff BCD Speech Coding and Audio Processing for Noise Reduction ORAL Fri-O-1-5-1 43 Assessing Level-Dependent Segmental Contribution to the Intelligibility of Speech Processed by Single-Channel Noise-Suppression Algorithms Tian Guan, Guangxing Chu, Fei Chen, Feng Yang
Friday, 9 September 2016 11:20 Seacliff BCD Speech Coding and Audio Processing for Noise Reduction ORAL Fri-O-1-5-2 594 Effectiveness of Near-End Speech Enhancement Under Equal-Loudness and Equal-Level Constraints Tudor-Cătălin Zorilă, Sheila Flanagan, Brian C.J. Moore, Yannis Stylianou
Friday, 9 September 2016 11:40 Seacliff BCD Speech Coding and Audio Processing for Noise Reduction ORAL Fri-O-1-5-3 1005 Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence Bidisha Sharma, S.R. Mahadeva Prasanna
Friday, 9 September 2016 12:00 Seacliff BCD Speech Coding and Audio Processing for Noise Reduction ORAL Fri-O-1-5-4 18 Relative Contributions of Amplitude and Phase to the Intelligibility Advantage of Ideal Binary Masked Sentences Lei Wang, Shufeng Zhu, Diliang Chen, Yong Feng, Fei Chen
Friday, 9 September 2016 12:20 Seacliff BCD Speech Coding and Audio Processing for Noise Reduction ORAL Fri-O-1-5-5 410 Predicting Binaural Speech Intelligibility from Signals Estimated by a Blind Source Separation Algorithm Qingju Liu, Yan Tang, Philip J.B. Jackson, Wenwu Wang
Friday, 9 September 2016 12:40 Seacliff BCD Speech Coding and Audio Processing for Noise Reduction ORAL Fri-O-1-5-6 960 Automated Pause Insertion for Improved Intelligibility Under Reverberation Petko N. Petkov, Norbert Braunschweiler, Yannis Stylianou
Friday, 9 September 2016 11:00 Seacliff A Speech Analysis ORAL Fri-O-1-6-1 1135 Automatic Classification of Phonation Modes in Singing Voice: Towards Singing Style Characterisation and Application to Ethnomusicological Recordings Jean-Luc Rouas, Leonidas Ioannidis
Friday, 9 September 2016 11:20 Seacliff A Speech Analysis ORAL Fri-O-1-6-2 1002 Novel Nonlinear Prediction Based Features for Spoofed Speech Detection Himanshu N. Bhavsar, Tanvina B. Patel, Hemant A. Patil
Friday, 9 September 2016 11:40 Seacliff A Speech Analysis ORAL Fri-O-1-6-3 1074 Robust Vowel Landmark Detection Using Epoch-Based Features Sri Harsha Dumpala, Bhanu Teja Nellore, Raghu Ram Nevali, Suryakanth V. Gangashetty, B. Yegnanarayana
Friday, 9 September 2016 12:00 Seacliff A Speech Analysis ORAL Fri-O-1-6-4 168 Sensitivity of Quantitative RT-MRI Metrics of Vocal Tract Dynamics to Image Reconstruction Settings Johannes Töger, Yongwan Lim, Sajan Goud Lingala, Shrikanth S. Narayanan, Krishna S. Nayak
Friday, 9 September 2016 12:20 Seacliff A Speech Analysis ORAL Fri-O-1-6-5 875 Sound Pattern Matching for Automatic Prosodic Event Detection Milos Cernak, Afsaneh Asaei, Pierre-Edouard Honnet, Philip N. Garner, Hervé Bourlard
Friday, 9 September 2016 12:40 Seacliff A Speech Analysis ORAL Fri-O-1-6-6 644 Automatic Classification of Lexical Stress in English and Arabic Languages Using Deep Learning Mostafa Shahin, Julien Epps, Beena Ahmed
Friday, 9 September 2016 11:00 Pacific Concourse - Poster A First and Second Language Acquisition POSTER Fri-P-1-1-1 895 Development of Mandarin Onset-Rime Detection in Relation to Age and Pinyin Instruction Fei Chen, Nan Yan, Xunan Huang, Hao Zhang, Lan Wang, Gang Peng
Friday, 9 September 2016 11:00 Pacific Concourse - Poster A First and Second Language Acquisition POSTER Fri-P-1-1-2 1022 Joint Effect of Dialect and Mandarin on English Vowel Production: A Case Study in Changsha EFL Learners Xinyi Wen, Yuan Jia
Friday, 9 September 2016 11:00 Pacific Concourse - Poster A First and Second Language Acquisition POSTER Fri-P-1-1-3 182 Effects of L1 Phonotactic Constraints on L2 Word Segmentation Strategies Tamami Katayama
Friday, 9 September 2016 11:00 Pacific Concourse - Poster A First and Second Language Acquisition POSTER Fri-P-1-1-4 457 Putting German [ʃ] and [ç] in Two Different Boxes: Native German vs L2 German of French Learners Jane Wottawa, Martine Adda-Decker, Frédéric Isel
Friday, 9 September 2016 11:00 Pacific Concourse - Poster A First and Second Language Acquisition POSTER Fri-P-1-1-5 623 Naturalness Judgement of L2 English Through Dubbing Practice Dean Luo, Ruxin Luo, Lixin Wang
Friday, 9 September 2016 11:00 Pacific Concourse - Poster A First and Second Language Acquisition POSTER Fri-P-1-1-6 641 Audiovisual Training Effects for Japanese Children Learning English /r/-/l/ Yasuaki Shinohara
Friday, 9 September 2016 11:00 Pacific Concourse - Poster A First and Second Language Acquisition POSTER Fri-P-1-1-7 658 L2 Acquisition and Production of the English Rhotic Pharyngeal Gesture Sarah Harper, Louis Goldstein, Shrikanth S. Narayanan
Friday, 9 September 2016 11:00 Pacific Concourse - Poster B Speech and Hearing Disorders & Perception POSTER Fri-P-1-2-1 1198 Auditory-Visual Perception of VCVs Produced by People with Down Syndrome: Preliminary Results Alexandre Hennequin, Amélie Rochet-Capellan, Marion Dohen
Friday, 9 September 2016 11:00 Pacific Concourse - Poster B Speech and Hearing Disorders & Perception POSTER Fri-P-1-2-2 109 Combining Non-Pathological Data of Different Language Varieties to Improve DNN-HMM Performance on Pathological Speech Emre Yılmaz, Mario Ganzeboom, Catia Cucchiarini, Helmer Strik
Friday, 9 September 2016 11:00 Pacific Concourse - Poster B Speech and Hearing Disorders & Perception POSTER Fri-P-1-2-3 1077 Evaluation of a Phone-Based Anomaly Detection Approach for Dysarthric Speech Imed Laaridh, Corinne Fredouille, Christine Meunier
Friday, 9 September 2016 11:00 Pacific Concourse - Poster B Speech and Hearing Disorders & Perception POSTER Fri-P-1-2-4 1085 Recognition of Dysarthric Speech Using Voice Parameters for Speaker Adaptation and Multi-Taper Spectral Estimation Chitralekha Bhat, Bhavik Vachhani, Sunil Kopparapu
Friday, 9 September 2016 11:00 Pacific Concourse - Poster B Speech and Hearing Disorders & Perception POSTER Fri-P-1-2-5 1133 Impaired Categorical Perception of Mandarin Tones and its Relationship to Language Ability in Autism Spectrum Disorders Fei Chen, Nan Yan, Xiaojie Pan, Feng Yang, Zhuanzhuan Ji, Lan Wang, Gang Peng
Friday, 9 September 2016 11:00 Pacific Concourse - Poster B Speech and Hearing Disorders & Perception POSTER Fri-P-1-2-6 1476 Perceived Naturalness of Electrolaryngeal Speech Produced Using sEMG-Controlled vs. Manual Pitch Modulation K.F. Nagle, J.T. Heaton
Friday, 9 September 2016 11:00 Pacific Concourse - Poster B Speech and Hearing Disorders & Perception POSTER Fri-P-1-2-7 1488 Identifying Hearing Loss from Learned Speech Kernels Shamima Najnin, Bonny Banerjee, Lisa Lucks Mendel, Masoumeh Heidari Kapourchali, Jayanta Kumar Dutta, Sungmin Lee, Chhayakanta Patro, Monique Pousson
Friday, 9 September 2016 11:00 Pacific Concourse - Poster B Speech and Hearing Disorders & Perception POSTER Fri-P-1-2-8 1524 Differential Effects of Velopharyngeal Dysfunction on Speech Intelligibility During Early and Late Stages of Amyotrophic Lateral Sclerosis Panying Rong, Yana Yunusova, Jordan R. Green
Friday, 9 September 2016 11:00 Pacific Concourse - Poster B Speech and Hearing Disorders & Perception POSTER Fri-P-1-2-9 349 The Production of Intervocalic Glides in Non Dysarthric Parkinsonian Speech V. Delvaux, V. Roland, K. Huet, M. Piccaluga, M.C. Haelewyck, B. Harmegnies
Friday, 9 September 2016 11:00 Pacific Concourse - Poster B Speech and Hearing Disorders & Perception POSTER Fri-P-1-2-10 38 Auditory Processing Impairments Under Background Noise in Children with Non-Syndromic Cleft Lip and/or Palate Yang Feng, Zhang Lu
Friday, 9 September 2016 11:00 Pacific Concourse - Poster B Speech and Hearing Disorders & Perception POSTER Fri-P-1-2-11 737 Modulation Spectral Features for Predicting Vocal Emotion Recognition by Simulated Cochlear Implants Zhi Zhu, Ryota Miyauchi, Yukiko Araki, Masashi Unoki
Friday, 9 September 2016 11:00 Pacific Concourse - Poster B Speech and Hearing Disorders & Perception POSTER Fri-P-1-2-12 765 Automatic Discrimination of Soft Voice Onset Using Acoustic Features of Breathy Voicing Keiko Ochi, Koichi Mori, Naomi Sakai, Nobutaka Ono
Friday, 9 September 2016 11:00 Pacific Concourse - Poster B Speech and Hearing Disorders & Perception POSTER Fri-P-1-2-13 891 Effect of Noise on Lexical Tone Perception in Cantonese-Speaking Amusics Jing Shao, Caicai Zhang, Gang Peng, Yike Yang, William S.-Y. Wang
Friday, 9 September 2016 11:00 Pacific Concourse - Poster B Speech and Hearing Disorders & Perception POSTER Fri-P-1-2-14 721 Audio-Visual Speech Recognition Using Bimodal-Trained Bottleneck Features for a Person with Severe Hearing Loss Yuki Takashima, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki, Nobuyuki Mitani, Kiyohiro Omori, Kaoru Nakazono
Friday, 9 September 2016 11:00 Pacific Concourse - Poster B Speech and Hearing Disorders & Perception POSTER Fri-P-1-2-15 297 Perception of Tone in Whispered Mandarin Sentences: The Case for Singapore Mandarin Yuling Gu, Boon Pang Lim, Nancy F. Chen
Friday, 9 September 2016 11:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Fri-P-1-3-1 116 A KL Divergence and DNN-Based Approach to Voice Conversion without Parallel Training Sentences Feng-Long Xie, Frank K. Soong, Haifeng Li
Friday, 9 September 2016 11:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Fri-P-1-3-2 227 Parallel Dictionary Learning for Voice Conversion Using Discriminative Graph-Embedded Non-Negative Matrix Factorization Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki
Friday, 9 September 2016 11:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Fri-P-1-3-3 678 Speech Bandwidth Extension Using Bottleneck Features and Deep Recurrent Neural Networks Yu Gu, Zhen-Hua Ling, Li-Rong Dai
Friday, 9 September 2016 11:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Fri-P-1-3-4 705 Voice Conversion Based on Matrix Variate Gaussian Mixture Model Using Multiple Frame Features Yi Yang, Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu
Friday, 9 September 2016 11:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Fri-P-1-3-5 1035 Voice Conversion Based on Trajectory Model Training of Neural Networks Considering Global Variance Naoki Hosaka, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda
Friday, 9 September 2016 11:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Fri-P-1-3-6 1131 Comparing Articulatory and Acoustic Strategies for Reducing Non-Native Accents Sandesh Aryal, Ricardo Gutierrez-Osuna
Friday, 9 September 2016 11:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Fri-P-1-3-7 345 Cross-Lingual Speaker Adaptation for Statistical Speech Synthesis Using Limited Data Seyyed Saeed Sarfjoo, Cenk Demiroglu
Friday, 9 September 2016 11:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Fri-P-1-3-8 1043 Personalized, Cross-Lingual TTS Using Phonetic Posteriorgrams Lifa Sun, Hao Wang, Shiyin Kang, Kun Li, Helen Meng
Friday, 9 September 2016 11:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Fri-P-1-3-9 1127 Acoustic Analysis of Syllables Across Indian Languages Anusha Prakash, Jeena J. Prakash, Hema A. Murthy
Friday, 9 September 2016 11:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Fri-P-1-3-10 421 Objective Evaluation Methods for Chinese Text-To-Speech Systems Teng Zhang, Zhipeng Chen, Ji Wu, Sam Lai, Wenhui Lei, Carsten Isert
Friday, 9 September 2016 11:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Fri-P-1-3-11 584 Objective Evaluation Using Association Between Dimensions Within Spectral Features for Statistical Parametric Speech Synthesis Yusuke Ijima, Taichi Asami, Hideyuki Mizuno
Friday, 9 September 2016 11:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Fri-P-1-3-12 847 A Hierarchical Predictor of Synthetic Speech Naturalness Using Neural Networks Takenori Yoshimura, Gustav Eje Henter, Oliver Watts, Mirjam Wester, Junichi Yamagishi, Keiichi Tokuda
Friday, 9 September 2016 11:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Fri-P-1-3-13 1376 Text-to-Speech for Individuals with Vision Loss: A User Study Monika Podsiadło, Shweta Chahar
Friday, 9 September 2016 11:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Fri-P-1-3-14 159 Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System Using Deep Recurrent Neural Networks Cassia Valentini-Botinhao, Xin Wang, Shinji Takaki, Junichi Yamagishi
Friday, 9 September 2016 11:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Fri-P-1-3-15 502 Data Selection and Adaptation for Naturalness in HMM-Based Speech Synthesis Erica Cooper, Alison Chang, Yocheved Levitan, Julia Hirschberg
Friday, 9 September 2016 11:00 Pacific Concourse - Poster D Topics in Speech Processing POSTER Fri-P-1-4-1 789 A Portable Automatic PA-TA-KA Syllable Detection System to Derive Biomarkers for Neurological Disorders Fei Tao, Louis Daudet, Christian Poellabauer, Sandra L. Schneider, Carlos Busso
Friday, 9 September 2016 11:00 Pacific Concourse - Poster D Topics in Speech Processing POSTER Fri-P-1-4-2 1045 Deep Neural Networks for i-Vector Language Identification of Short Utterances in Cars Omid Ghahabi, Antonio Bonafonte, Javier Hernando, Asunción Moreno
Friday, 9 September 2016 11:00 Pacific Concourse - Poster D Topics in Speech Processing POSTER Fri-P-1-4-3 339 Improving i-Vector and PLDA Based Speaker Clustering with Long-Term Features Abraham Woubie, Jordi Luque, Javier Hernando
Friday, 9 September 2016 11:00 Market Street Foyer Show & Tell Session 1   Fri-S&T-1-1 2000 Open Language Interface for Voice Exploitation (OLIVE) Aaron Lawson, Mitchell McLaren, Harry Bratt, Martin Graciarena, Horacio Franco, Christopher George, Allen Stauffer, Chris Bartels, Julien VanHout
Friday, 9 September 2016 11:00 Market Street Foyer Show & Tell Session 1   Fri-S&T-1-2 2002 A Multimodal Dialogue System for Air Traffic Control Trainees Based on Discrete-Event Simulation Luboš Šmídl, Adam Chýlek, Jan Švec
Friday, 9 September 2016 11:00 Market Street Foyer Show & Tell Session 1   Fri-S&T-1-3 2003 Lig-Aikuma: A Mobile App to Collect Parallel Speech for Under-Resourced Language Studies Elodie Gauthier, David Blachon, Laurent Besacier, Guy-Noël Kouarata, Martine Adda-Decker, Annie Rialland, Gilles Adda, Grégoire Bachman
Friday, 9 September 2016 11:00 Market Street Foyer Show & Tell Session 1   Fri-S&T-1-4 2004 ARET - Automatic Reading of Educational Texts for Visually Impaired Students Martin Grůber, Jindřich Matoušek, Zdeněk Hanzlíček, Zdeněk Krňoul, Zbyněk Zajíc
Friday, 9 September 2016 14:30 Grand Ballroom A New Trends in Neural Networks for Speech Recognition ORAL Fri-O-2-1-1 40 Segmental Recurrent Neural Networks for End-to-End Speech Recognition Liang Lu, Lingpeng Kong, Chris Dyer, Noah A. Smith, Steve Renals
Friday, 9 September 2016 14:50 Grand Ballroom A New Trends in Neural Networks for Speech Recognition ORAL Fri-O-2-1-2 212 Acoustic Modeling Using Bidirectional Gated Recurrent Convolutional Units Markus Nussbaum-Thom, Jia Cui, Bhuvana Ramabhadran, Vaibhava Goel
Friday, 9 September 2016 15:10 Grand Ballroom A New Trends in Neural Networks for Speech Recognition ORAL Fri-O-2-1-3 515 Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition Wei-Ning Hsu, Yu Zhang, Ann Lee, James Glass
Friday, 9 September 2016 15:30 Grand Ballroom A New Trends in Neural Networks for Speech Recognition ORAL Fri-O-2-1-4 580 Stimulated Deep Neural Network for Speech Recognition Chunyang Wu, Penny Karanasou, Mark J.F. Gales, Khe Chai Sim
Friday, 9 September 2016 15:50 Grand Ballroom A New Trends in Neural Networks for Speech Recognition ORAL Fri-O-2-1-5 1036 Phonetic Context Embeddings for DNN-HMM Phone Recognition Leonardo Badino
Friday, 9 September 2016 16:10 Grand Ballroom A New Trends in Neural Networks for Speech Recognition ORAL Fri-O-2-1-6 1446 Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks Ying Zhang, Mohammad Pezeshki, Philémon Brakel, Saizheng Zhang, César Laurent, Yoshua Bengio, Aaron Courville
Friday, 9 September 2016 14:30 Grand Ballroom BC Special Session: The RedDots Challenge: Towards Characterizing Speakers from Short Utterances ORAL Fri-O-2-2-1 929 Joint Speaker and Lexical Modeling for Short-Term Characterization of Speaker Guangsen Wang, Kong Aik Lee, Trung Hieu Nguyen, Hanwu Sun, Bin Ma
Friday, 9 September 2016 14:45 Grand Ballroom BC Special Session: The RedDots Challenge: Towards Characterizing Speakers from Short Utterances ORAL Fri-O-2-2-2 1465 Tandem Features for Text-Dependent Speaker Verification on the RedDots Corpus Md Jahangir Alam, Patrick Kenny, Vishwa Gupta
Friday, 9 September 2016 15:00 Grand Ballroom BC Special Session: The RedDots Challenge: Towards Characterizing Speakers from Short Utterances ORAL Fri-O-2-2-3 362 Text Dependent Speaker Verification Using Un-Supervised HMM-UBM and Temporal GMM-UBM Achintya Kr. Sarkar, Zheng-Hua Tan
Friday, 9 September 2016 15:15 Grand Ballroom BC Special Session: The RedDots Challenge: Towards Characterizing Speakers from Short Utterances ORAL Fri-O-2-2-4 1125 Utterance Verification for Text-Dependent Speaker Recognition: A Comparative Assessment Using the RedDots Corpus Tomi Kinnunen, Md. Sahidullah, Ivan Kukanov, Héctor Delgado, Massimiliano Todisco, Achintya Kr. Sarkar, Nicolai Bæk Thomsen, Ville Hautamäki, Nicholas Evans, Zheng-Hua Tan
Friday, 9 September 2016 15:30 Grand Ballroom BC Special Session: The RedDots Challenge: Towards Characterizing Speakers from Short Utterances ORAL Fri-O-2-2-5 825 Parallel Speaker and Content Modelling for Text-Dependent Speaker Verification Jianbo Ma, Saad Irtza, Kaavya Sriskandaraja, Vidhyasaharan Sethu, Eliathamby Ambikairajah
Friday, 9 September 2016 15:45 Grand Ballroom BC Special Session: The RedDots Challenge: Towards Characterizing Speakers from Short Utterances ORAL Fri-O-2-2-6 1174 i-Vector/HMM Based Text-Dependent Speaker Verification System for RedDots Challenge Hossein Zeinali, Hossein Sameti, Lukáš Burget, Jan Černocký, Nooshin Maghsoodi, Pavel Matějka
Friday, 9 September 2016 16:00 Grand Ballroom BC Special Session: The RedDots Challenge: Towards Characterizing Speakers from Short Utterances ORAL Fri-O-2-2-7 1001 Exploring Session Variability and Template Aging in Speaker Verification for Fixed Phrase Short Utterances Rohan Kumar Das, Sarfaraz Jelil, S.R. Mahadeva Prasanna
Friday, 9 September 2016 14:30 Bayview A Articulatory Measurements and Analysis ORAL Fri-O-2-3-1 1138 Prediction of the Articulatory Movements of Unseen Phonemes of a Speaker Using the Speech Structure of Another Speaker Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu
Friday, 9 September 2016 14:50 Bayview A Articulatory Measurements and Analysis ORAL Fri-O-2-3-2 1399 Vocal Tract Length Normalization for Speaker Independent Acoustic-to-Articulatory Speech Inversion Ganesh Sivaraman, Vikramjit Mitra, Hosung Nam, Mark Tiede, Carol Espy-Wilson
Friday, 9 September 2016 15:10 Bayview A Articulatory Measurements and Analysis ORAL Fri-O-2-3-3 157 Investigation of Speed-Accuracy Tradeoffs in Speech Production Using Real-Time Magnetic Resonance Imaging Adam C. Lammert, Christine H. Shadle, Shrikanth S. Narayanan, Thomas F. Quatieri
Friday, 9 September 2016 15:30 Bayview A Articulatory Measurements and Analysis ORAL Fri-O-2-3-4 583 Characterizing Vocal Tract Dynamics Across Speakers Using Real-Time MRI Tanner Sorensen, Asterios Toutios, Louis Goldstein, Shrikanth S. Narayanan
Friday, 9 September 2016 15:50 Bayview A Articulatory Measurements and Analysis ORAL Fri-O-2-3-5 78 Tracking Contours of Orofacial Articulators from Real-Time MRI of Speech Mathieu Labrunie, Pierre Badin, Dirk Voit, Arun A. Joseph, Laurent Lamalle, Coriandre Vilain, Louis-Jean Boë, Jens Frahm
Friday, 9 September 2016 16:10 Bayview A Articulatory Measurements and Analysis ORAL Fri-O-2-3-6 559 State-of-the-Art MRI Protocol for Comprehensive Assessment of Vocal Tract Structure and Function Sajan Goud Lingala, Asterios Toutios, Johannes Töger, Yongwan Lim, Yinghua Zhu, Yoon-Chul Kim, Colin Vaz, Shrikanth S. Narayanan, Krishna S. Nayak
Friday, 9 September 2016 14:30 Bayview B Automatic Assessment of Emotions ORAL Fri-O-2-4-1 488 DBN-ivector Framework for Acoustic Emotion Recognition Rui Xia, Yang Liu
Friday, 9 September 2016 14:50 Bayview B Automatic Assessment of Emotions ORAL Fri-O-2-4-2 867 An Investigation of Emotional Speech in Depression Classification Brian Stasak, Julien Epps, Nicholas Cummins, Roland Goecke
Friday, 9 September 2016 15:10 Bayview B Automatic Assessment of Emotions ORAL Fri-O-2-4-3 1052 Retrieving Categorical Emotions Using a Probabilistic Framework to Define Preference Learning Samples Reza Lotfian, Carlos Busso
Friday, 9 September 2016 15:30 Bayview B Automatic Assessment of Emotions ORAL Fri-O-2-4-4 1124 At the Border of Acoustics and Linguistics: Bag-of-Audio-Words for the Recognition of Emotions in Speech Maximilian Schmitt, Fabien Ringeval, Björn Schuller
Friday, 9 September 2016 15:50 Bayview B Automatic Assessment of Emotions ORAL Fri-O-2-4-5 1311 Speech Emotion Recognition Using Affective Saliency Arodami Chorianopoulou, Polychronis Koutsakis, Alexandros Potamianos
Friday, 9 September 2016 16:10 Bayview B Automatic Assessment of Emotions ORAL Fri-O-2-4-6 184 Laughter Valence Prediction in Motivational Interviewing Based on Lexical and Acoustic Cues Rahul Gupta, Nishant Nath, Taruna Agrawal, Panayiotis Georgiou, David C. Atkins, Shrikanth S. Narayanan
Friday, 9 September 2016 14:30 Seacliff BCD Acoustic and Articulatory Phonetics ORAL Fri-O-2-5-1 344 Respiratory Belts and Whistles: A Preliminary Study of Breathing Acoustics for Turn-Taking Marcin Włodarczak, Mattias Heldner
Friday, 9 September 2016 14:50 Seacliff BCD Acoustic and Articulatory Phonetics ORAL Fri-O-2-5-2 418 /r/ as Language Marker in Bilingual Speech Production and Perception Constantijn Kaland, Vincenzo Galatà, Lorenzo Spreafico, Alessandro Vietti
Friday, 9 September 2016 15:10 Seacliff BCD Acoustic and Articulatory Phonetics ORAL Fri-O-2-5-3 49 Evaluation of Phonatory Behavior of German and French Speakers in Native and Non-Native Speech Manfred Pützer, Frank Zimmerer, Wolfgang Wokurek, Jeanin Jügler
Friday, 9 September 2016 15:30 Seacliff BCD Acoustic and Articulatory Phonetics ORAL Fri-O-2-5-4 240 Today’s Most Frequently Used F0 Estimation Methods, and Their Accuracy in Estimating Male and Female Pitch in Clean Speech Sofia Strömbergsson
Friday, 9 September 2016 15:50 Seacliff BCD Acoustic and Articulatory Phonetics ORAL Fri-O-2-5-5 1447 A Praat-Based Algorithm to Extract the Amplitude Envelope and Temporal Fine Structure Using the Hilbert Transform Lei He, Volker Dellwo
Friday, 9 September 2016 16:10 Seacliff BCD Acoustic and Articulatory Phonetics ORAL Fri-O-2-5-6 1611 Likelihood Ratio Calculation in Acoustic-Phonetic Forensic Voice Comparison: Comparison of Three Statistical Modelling Approaches Ewald Enzinger
Friday, 9 September 2016 14:30 Seacliff A Source Separation and Spatial Audio ORAL Fri-O-2-6-1 987 A Sparse Spherical Harmonic-Based Model in Subbands for Head-Related Transfer Functions Xiaoke Qi, Jianhua Tao
Friday, 9 September 2016 14:50 Seacliff A Source Separation and Spatial Audio ORAL Fri-O-2-6-2 1176 Single-Channel Multi-Speaker Separation Using Deep Clustering Yusuf Isik, Jonathan Le Roux, Zhuo Chen, Shinji Watanabe, John R. Hershey
Friday, 9 September 2016 15:10 Seacliff A Source Separation and Spatial Audio ORAL Fri-O-2-6-3 120 Jointly Optimizing Activation Coefficients of Convolutive NMF Using DNN for Speech Separation Hao Li, Shuai Nie, Xueliang Zhang, Hui Zhang
Friday, 9 September 2016 15:30 Seacliff A Source Separation and Spatial Audio ORAL Fri-O-2-6-4 382 A Feature Study for Masking-Based Reverberant Speech Separation Masood Delfarah, DeLiang Wang
Friday, 9 September 2016 15:50 Seacliff A Source Separation and Spatial Audio ORAL Fri-O-2-6-5 415 Discriminative Layered Nonnegative Matrix Factorization for Speech Separation Chung-Chien Hsu, Tai-Shih Chi, Jen-Tzung Chien
Friday, 9 September 2016 16:10 Seacliff A Source Separation and Spatial Audio ORAL Fri-O-2-6-6 701 On Discriminative Framework for Single Channel Audio Source Separation Arpita Gang, Pravesh Biyani
Friday, 9 September 2016 14:30 Pacific Concourse - Poster A Special Session: Auditory-Visual Expressive Speech and Gesture in Humans and Machines POSTER Fri-P-2-1-1 380 Generating Natural Video Descriptions via Multimodal Processing Qin Jin, Junwei Liang, Xiaozhu Lin
Friday, 9 September 2016 14:30 Pacific Concourse - Poster A Special Session: Auditory-Visual Expressive Speech and Gesture in Humans and Machines POSTER Fri-P-2-1-2 163 Feature-Level Decision Fusion for Audio-Visual Word Prominence Detection Martin Heckmann
Friday, 9 September 2016 14:30 Pacific Concourse - Poster A Special Session: Auditory-Visual Expressive Speech and Gesture in Humans and Machines POSTER Fri-P-2-1-3 730 Acoustic and Visual Analysis of Expressive Speech: A Case Study of French Acted Speech Slim Ouni, Vincent Colotte, Sara Dahmani, Soumaya Azzi
Friday, 9 September 2016 14:30 Pacific Concourse - Poster A Special Session: Auditory-Visual Expressive Speech and Gesture in Humans and Machines POSTER Fri-P-2-1-4 75 Characterization of Audiovisual Dramatic Attitudes Adela Barbulescu, Rémi Ronfard, Gérard Bailly
Friday, 9 September 2016 14:30 Pacific Concourse - Poster A Special Session: Auditory-Visual Expressive Speech and Gesture in Humans and Machines POSTER Fri-P-2-1-5 846 Conversational Engagement Recognition Using Auditory and Visual Cues Yuyun Huang, Emer Gilmartin, Nick Campbell
Friday, 9 September 2016 14:30 Pacific Concourse - Poster A Special Session: Auditory-Visual Expressive Speech and Gesture in Humans and Machines POSTER Fri-P-2-1-6 85 An Acoustic Analysis of Child-Child and Child-Robot Interactions for Understanding Engagement during Speech-Controlled Computer Games Theodora Chaspari, Jill Fain Lehman
Friday, 9 September 2016 14:30 Pacific Concourse - Poster A Special Session: Auditory-Visual Expressive Speech and Gesture in Humans and Machines POSTER Fri-P-2-1-7 908 Auditory-Visual Lexical Tone Perception in Thai Elderly Listeners with and without Hearing Impairment Benjawan Kasisopa, Chutamanee Onsuwan, Charturong Tantibundhit, Nittayapa Klangpornkun, Suparak Techacharoenrungrueang, Sudaporn Luksaneeyanawin, Denis Burnham
Friday, 9 September 2016 14:30 Pacific Concourse - Poster A Special Session: Auditory-Visual Expressive Speech and Gesture in Humans and Machines POSTER Fri-P-2-1-8 407 Use of Agreement/Disagreement Classification in Dyadic Interactions for Continuous Emotion Recognition Hossein Khaki, Engin Erzin
Friday, 9 September 2016 14:30 Pacific Concourse - Poster B Special Session: Intelligibility Under the Microscope POSTER Fri-P-2-2-1 1119 Microscopic Multilingual Matrix Test Predictions Using an ASR-Based Speech Recognition Model Marc René Schädler, David Hülsmeier, Anna Warzybok, Sabine Hochmuth, Birger Kollmeier
Friday, 9 September 2016 14:30 Pacific Concourse - Poster B Special Session: Intelligibility Under the Microscope POSTER Fri-P-2-2-2 1285 DNN-Based Automatic Speech Recognition as a Model for Human Phoneme Perception Mats Exter, Bernd T. Meyer
Friday, 9 September 2016 14:30 Pacific Concourse - Poster B Special Session: Intelligibility Under the Microscope POSTER Fri-P-2-2-3 1030 Undoing Misperceptions: A Microscopic Analysis of Consistent Confusions Through Signal Modifications Attila Máté Tóth, Martin Cooke
Friday, 9 September 2016 14:30 Pacific Concourse - Poster B Special Session: Intelligibility Under the Microscope POSTER Fri-P-2-2-4 155 Blind Non-Intrusive Speech Intelligibility Prediction Using Twin-HMMs Mahdie Karbasi, Ahmed Hussen Abdelaziz, Hendrik Meutzner, Dorothea Kolossa
Friday, 9 September 2016 14:30 Pacific Concourse - Poster B Special Session: Intelligibility Under the Microscope POSTER Fri-P-2-2-5 24 Misperceptions Arising from Speech-in-Babble Interactions Attila Máté Tóth, Martin Cooke, Jon Barker
Friday, 9 September 2016 14:30 Pacific Concourse - Poster B Special Session: Intelligibility Under the Microscope POSTER Fri-P-2-2-6 267 Introducing Temporal Rate Coding for Speech in Cochlear Implants: A Microscopic Evaluation in Humans and Models Anja Eichenauer, Mathias Dietz, Bernd T. Meyer, Tim Jürgens
Friday, 9 September 2016 14:30 Pacific Concourse - Poster B Special Session: Intelligibility Under the Microscope POSTER Fri-P-2-2-7 330 Language Effects in Noise-Induced Word Misperceptions Maria Luisa Garcia Lecumberri, Jon Barker, Ricard Marxer, Martin Cooke
Friday, 9 September 2016 14:30 Pacific Concourse - Poster B Special Session: Intelligibility Under the Microscope POSTER Fri-P-2-2-8 343 Speech Reductions Cause a De-Weighting of Secondary Acoustic Cues Léo Varnet, Fanny Meunier, Michel Hoen
Friday, 9 September 2016 14:30 Pacific Concourse - Poster B Special Session: Intelligibility Under the Microscope POSTER Fri-P-2-2-9 431 Using Phonologically Weighted Levenshtein Distances for the Prediction of Microscopic Intelligibility Lionel Fontan, Isabelle Ferrané, Jérôme Farinas, Julien Pinquier, Xavier Aumont
Friday, 9 September 2016 14:30 Pacific Concourse - Poster B Special Session: Intelligibility Under the Microscope POSTER Fri-P-2-2-10 697 The Impact of Manner of Articulation on the Intelligibility of Voicing Contrast in Noise: Cross-Linguistic Implications Mayuki Matsui
Friday, 9 September 2016 14:30 Pacific Concourse - Poster B Special Session: Intelligibility Under the Microscope POSTER Fri-P-2-2-11 932 Directly Comparing the Listening Strategies of Humans and Machines Michael I. Mandel
Friday, 9 September 2016 14:30 Pacific Concourse - Poster C Spoken Documents, Spoken Understanding and Semantic Analysis POSTER Fri-P-2-3-1 288 LSTM-Based NeuroCRFs for Named Entity Recognition Marc-Antoine Rondeau, Yi Su
Friday, 9 September 2016 14:30 Pacific Concourse - Poster C Spoken Documents, Spoken Understanding and Semantic Analysis POSTER Fri-P-2-3-2 710 Exploring Word Mover’s Distance and Semantic-Aware Embedding Techniques for Extractive Broadcast News Summarization Shih-Hung Liu, Kuan-Yu Chen, Yu-Lun Hsieh, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, Wen-Lian Hsu
Friday, 9 September 2016 14:30 Pacific Concourse - Poster C Spoken Documents, Spoken Understanding and Semantic Analysis POSTER Fri-P-2-3-3 1219 Improved Neural Bag-of-Words Model to Retrieve Out-of-Vocabulary Words in Speech Recognition Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linarès
Friday, 9 September 2016 14:30 Pacific Concourse - Poster C Spoken Documents, Spoken Understanding and Semantic Analysis POSTER Fri-P-2-3-4 855 Beyond Utterance Extraction: Summary Recombination for Speech Summarization Jérémy Trione, Benoit Favre, Frederic Bechet
Friday, 9 September 2016 14:30 Pacific Concourse - Poster C Spoken Documents, Spoken Understanding and Semantic Analysis POSTER Fri-P-2-3-5 1352 Attention-Based Recurrent Neural Network Models for Joint Intent Detection and Slot Filling Bing Liu, Ian Lane
Friday, 9 September 2016 14:30 Pacific Concourse - Poster C Spoken Documents, Spoken Understanding and Semantic Analysis POSTER Fri-P-2-3-6 1598 Domain Adaptation of Recurrent Neural Networks for Natural Language Understanding Aaron Jaech, Larry Heck, Mari Ostendorf
Friday, 9 September 2016 14:30 Pacific Concourse - Poster C Spoken Documents, Spoken Understanding and Semantic Analysis POSTER Fri-P-2-3-7 1583 LatticeRnn: Recurrent Neural Networks Over Lattices Faisal Ladhak, Ankur Gandhe, Markus Dreyer, Lambert Mathias, Ariya Rastrow, Björn Hoffmeister
Friday, 9 September 2016 14:30 Pacific Concourse - Poster C Spoken Documents, Spoken Understanding and Semantic Analysis POSTER Fri-P-2-3-8 1634 Learning Document Representations Using Subspace Multinomial Model Santosh Kesiraju, Lukáš Burget, Igor Szőke, Jan Černocký
Friday, 9 September 2016 14:30 Pacific Concourse - Poster C Spoken Documents, Spoken Understanding and Semantic Analysis POSTER Fri-P-2-3-9 354 Attention-Based Convolutional Neural Networks for Sentence Classification Zhiwei Zhao, Youzheng Wu
Friday, 9 September 2016 14:30 Pacific Concourse - Poster C Spoken Documents, Spoken Understanding and Semantic Analysis POSTER Fri-P-2-3-10 50 Spoken Language Understanding in a Latent Topic-Based Subspace Mohamed Morchid, Mohamed Bouaziz, Waad Ben Kheder, Killian Janod, Pierre-Michel Bousquet, Richard Dufour, Georges Linarès
Friday, 9 September 2016 14:30 Pacific Concourse - Poster C Spoken Documents, Spoken Understanding and Semantic Analysis POSTER Fri-P-2-3-11 402 Multi-Domain Joint Semantic Frame Parsing Using Bi-Directional RNN-LSTM Dilek Hakkani-Tür, Gokhan Tur, Asli Celikyilmaz, Yun-Nung Chen, Jianfeng Gao, Li Deng, Ye-Yi Wang
Friday, 9 September 2016 14:30 Pacific Concourse - Poster C Spoken Documents, Spoken Understanding and Semantic Analysis POSTER Fri-P-2-3-12 63 Deep Stacked Autoencoders for Spoken Language Understanding Killian Janod, Mohamed Morchid, Richard Dufour, Georges Linarès, Renato De Mori
Friday, 9 September 2016 14:30 Pacific Concourse - Poster C Spoken Documents, Spoken Understanding and Semantic Analysis POSTER Fri-P-2-3-13 727 Labeled Data Generation with Encoder-Decoder LSTM for Semantic Slot Filling Gakuto Kurata, Bing Xiang, Bowen Zhou
Friday, 9 September 2016 14:30 Pacific Concourse - Poster C Spoken Documents, Spoken Understanding and Semantic Analysis POSTER Fri-P-2-3-14 511 Exploring the Correlation of Pitch Accents and Semantic Slots for Spoken Language Understanding Sabrina Stehwien, Ngoc Thang Vu
Friday, 9 September 2016 14:30 Pacific Concourse - Poster C Spoken Documents, Spoken Understanding and Semantic Analysis POSTER Fri-P-2-3-15 964 Analysis on Gated Recurrent Unit Based Question Detection Approach Yaodong Tang, Zhiyong Wu, Helen Meng, Mingxing Xu, Lianhong Cai
Friday, 9 September 2016 14:30 Pacific Concourse - Poster D Spoken Term Detection POSTER Fri-P-2-4-1 1259 Combining State-Level Spotting and Posterior-Based Acoustic Match for Improved Query-by-Example Spoken Term Detection Shuji Oishi, Tatsuya Matsuba, Mitsuaki Makino, Atsuhiko Kai
Friday, 9 September 2016 14:30 Pacific Concourse - Poster D Spoken Term Detection POSTER Fri-P-2-4-2 606 A Novel Discriminative Score Calibration Method for Keyword Search Zhiqiang Lv, Meng Cai, Wei-Qiang Zhang, Jia Liu
Friday, 9 September 2016 14:30 Pacific Concourse - Poster D Spoken Term Detection POSTER Fri-P-2-4-3 1276 Segmented Dynamic Time Warping for Spoken Query-by-Example Search Jorge Proença, Fernando Perdigão
Friday, 9 September 2016 14:30 Pacific Concourse - Poster D Spoken Term Detection POSTER Fri-P-2-4-4 838 Generating Complementary Acoustic Model Spaces in DNN-Based Sequence-to-Frame DTW Scheme for Out-of-Vocabulary Spoken Term Detection Shi-wook Lee, Kazuyo Tanaka, Yoshiaki Itoh
Friday, 9 September 2016 14:30 Pacific Concourse - Poster D Spoken Term Detection POSTER Fri-P-2-4-5 1485 Multi-Task Learning and Weighted Cross-Entropy for DNN-Based Keyword Spotting Sankaran Panchapagesan, Ming Sun, Aparna Khare, Spyros Matsoukas, Arindam Mandal, Björn Hoffmeister, Shiv Vitaladevuni
Friday, 9 September 2016 14:30 Pacific Concourse - Poster D Spoken Term Detection POSTER Fri-P-2-4-6 82 Audio Word2Vec: Unsupervised Learning of Audio Segment Representations Using Sequence-to-Sequence Autoencoder Yu-An Chung, Chao-Chung Wu, Chia-Hao Shen, Hung-Yi Lee, Lin-Shan Lee
Friday, 9 September 2016 14:30 Pacific Concourse - Poster D Spoken Term Detection POSTER Fri-P-2-4-7 642 Non-Uniform Boosted MCE Training of Deep Neural Networks for Keyword Spotting Zhong Meng, Biing-Hwang Juang
Friday, 9 September 2016 14:30 Pacific Concourse - Poster D Spoken Term Detection POSTER Fri-P-2-4-8 1200 Language Model Data Augmentation for Keyword Spotting in Low-Resourced Training Conditions Arseniy Gorin, Rasa Lileikytė, Guangpu Huang, Lori Lamel, Jean-Luc Gauvain, Antoine Laurent
Friday, 9 September 2016 14:30 Market Street Foyer Show & Tell Session 2   Fri-S&T-2-1 2006 STON: Efficient Subtitling in Dutch Using State-of-the-Art Tools Lyan Verwimp, Brecht Desplanques, Kris Demuynck, Joris Pelemans, Marieke Lycke, Patrick Wambacq
Friday, 9 September 2016 14:30 Market Street Foyer Show & Tell Session 2   Fri-S&T-2-2 2007 An Automatic Training Tool for Air Traffic Control Training Petr Stanislav, Luboš Šmídl, Jan Švec
Friday, 9 September 2016 14:30 Market Street Foyer Show & Tell Session 2   Fri-S&T-2-3 2008 Digitala: An Augmented Test and Review Process Prototype for High-Stakes Spoken Foreign Language Examination Reima Karhila, Aku Rouhe, Peter Smit, André Mansikkaniemi, Heini Kallio, Erik Lindroos, Raili Hildén, Martti Vainio, Mikko Kurimo
Friday, 9 September 2016 14:30 Market Street Foyer Show & Tell Session 2   Fri-S&T-2-4 2009 Exploring Collections of Multimedia Archives Through Innovative Interfaces in the Context of Digital Humanities Géraldine Damnati, Delphine Charlet, Marc Denjean
Friday, 9 September 2016 17:00 Grand Ballroom A "Feature Extraction and Acoustic Modeling Using Neural Networks for ASR" ORAL Fri-O-3-1-1 317 Learning Neural Network Representations Using Cross-Lingual Bottleneck Features with Word-Pair Information Yougen Yuan, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li
Friday, 9 September 2016 17:20 Grand Ballroom A "Feature Extraction and Acoustic Modeling Using Neural Networks for ASR" ORAL Fri-O-3-1-2 542 Novel Front-End Features Based on Neural Graph Embeddings for DNN-HMM and LSTM-CTC Acoustic Modeling Yuzong Liu, Katrin Kirchhoff
Friday, 9 September 2016 17:40 Grand Ballroom A "Feature Extraction and Acoustic Modeling Using Neural Networks for ASR" ORAL Fri-O-3-1-3 925 Articulatory Feature Extraction Using CTC to Build Articulatory Classifiers Without Forced Frame Alignments for Speech Recognition Basil Abraham, S. Umesh, Neethu Mariam Joy
Friday, 9 September 2016 18:00 Grand Ballroom A "Feature Extraction and Acoustic Modeling Using Neural Networks for ASR" ORAL Fri-O-3-1-4 1406 On the Role of Nonlinear Transformations in Deep Neural Network Acoustic Models Tasha Nagamine, Michael L. Seltzer, Nima Mesgarani
Friday, 9 September 2016 18:20 Grand Ballroom A "Feature Extraction and Acoustic Modeling Using Neural Networks for ASR" ORAL Fri-O-3-1-5 1459 Complex Linear Projection (CLP): A Discriminative Approach to Joint Feature Extraction and Acoustic Modeling Ehsan Variani, Tara N. Sainath, Izhak Shafran, Michiel Bacchiani
Friday, 9 September 2016 18:40 Grand Ballroom A "Feature Extraction and Acoustic Modeling Using Neural Networks for ASR" ORAL Fri-O-3-1-6 84 Modeling Time-Frequency Patterns with LSTM vs. Convolutional Architectures for LVCSR Tasks Tara N. Sainath, Bo Li
Friday, 9 September 2016 17:00 Grand Ballroom BC Special Session: The Speakers in the Wild (SITW) Speaker Recognition Challenge ORAL Fri-O-3-2-1 1129 The Speakers in the Wild (SITW) Speaker Recognition Database Mitchell McLaren, Luciana Ferrer, Diego Castan, Aaron Lawson
Friday, 9 September 2016 17:15 Grand Ballroom BC Special Session: The Speakers in the Wild (SITW) Speaker Recognition Challenge ORAL Fri-O-3-2-2 1137 The 2016 Speakers in the Wild Speaker Recognition Evaluation Mitchell McLaren, Luciana Ferrer, Diego Castan, Aaron Lawson
Friday, 9 September 2016 17:30 Grand Ballroom BC Special Session: The Speakers in the Wild (SITW) Speaker Recognition Challenge ORAL Fri-O-3-2-3 981 Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge Ondřej Novotný, Pavel Matějka, Oldřich Plchot, Ondřej Glembek, Lukáš Burget, Jan Černocký
Friday, 9 September 2016 17:45 Grand Ballroom BC Special Session: The Speakers in the Wild (SITW) Speaker Recognition Challenge ORAL Fri-O-3-2-4 1197 A Speaker Recognition System for the SITW Challenge Oleg Kudashev, Sergey Novoselov, Konstantin Simonchik, Alexandr Kozlov
Friday, 9 September 2016 18:00 Grand Ballroom BC Special Session: The Speakers in the Wild (SITW) Speaker Recognition Challenge ORAL Fri-O-3-2-5 945 Speakers In The Wild (SITW): The QUT Speaker Recognition System H. Ghaemmaghami, M.H. Rahman, Ivan Himawan, David Dean, Ahilan Kanagasundaram, Sridha Sridharan, Clinton Fookes
Friday, 9 September 2016 18:15 Grand Ballroom BC Special Session: The Speakers in the Wild (SITW) Speaker Recognition Challenge ORAL Fri-O-3-2-6 1378 AUT System for SITW Speaker Recognition Challenge Abbas Khosravani, Mohammad Mehdi Homayounpour
Friday, 9 September 2016 18:30 Grand Ballroom BC Special Session: The Speakers in the Wild (SITW) Speaker Recognition Challenge ORAL Fri-O-3-2-7 1310 LIA System for the SITW Speaker Recognition Challenge Waad Ben Kheder, Moez Ajili, Pierre-Michel Bousquet, Driss Matrouf, Jean-François Bonastre
Friday, 9 September 2016 18:45 Grand Ballroom BC Special Session: The Speakers in the Wild (SITW) Speaker Recognition Challenge ORAL Fri-O-3-2-8 1144 Investigating Various Diarization Algorithms for Speaker in the Wild (SITW) Speaker Recognition Challenge Yi Liu, Yao Tian, Liang He, Jia Liu
Friday, 9 September 2016 17:00 Bayview A "Non-Native Speech Perception" ORAL Fri-O-3-3-1 1095 Does the Importance of Word-Initial and Word-Final Information Differ in Native versus Non-Native Spoken-Word Recognition? Odette Scharenborg, Juul Coumans, Sofoklis Kakouros, Roeland van Hout
Friday, 9 September 2016 17:20 Bayview A "Non-Native Speech Perception" ORAL Fri-O-3-3-2 19 The Effect of Sentence Accent on Non-Native Speech Perception in Noise Odette Scharenborg, Elea Kolkman, Sofoklis Kakouros, Brechtje Post
Friday, 9 September 2016 17:40 Bayview A "Non-Native Speech Perception" ORAL Fri-O-3-3-3 41 The Effects of Modified Speech Styles on Intelligibility for Non-Native Listeners Martin Cooke, Maria Luisa Garcia Lecumberri
Friday, 9 September 2016 18:00 Bayview A "Non-Native Speech Perception" ORAL Fri-O-3-3-4 887 The Influence of Language Experience on the Categorical Perception of Vowels: Evidence from Mandarin and Korean Hao Zhang, Fei Chen, Nan Yan, Lan Wang, Feng Shi, Manwa L. Ng
Friday, 9 September 2016 18:20 Bayview A "Non-Native Speech Perception" ORAL Fri-O-3-3-5 37 Multiple Influences on Vocabulary Acquisition: Parental Input Dominates Dominic W. Massaro
Friday, 9 September 2016 18:40 Bayview A "Non-Native Speech Perception" ORAL Fri-O-3-3-6 76 Can Intensive Exposure to Foreign Language Sounds Affect the Perception of Native Sounds? Jian Gong, Maria Luisa Garcia Lecumberri, Martin Cooke
Friday, 9 September 2016 17:00 Bayview B Behavioral Signal Processing and Speaker State and Traits Analytics ORAL Fri-O-3-4-1 1569 Privacy-Preserving Speech Analytics for Automatic Assessment of Student Collaboration Nikoletta Bassiou, Andreas Tsiartas, Jennifer Smith, Harry Bratt, Colleen Richey, Elizabeth Shriberg, Cynthia D’Angelo, Nonye Alozie
Friday, 9 September 2016 17:20 Bayview B Behavioral Signal Processing and Speaker State and Traits Analytics ORAL Fri-O-3-4-2 1367 Complexity in Prosody: A Nonlinear Dynamical Systems Approach for Dyadic Conversations; Behavior and Outcomes in Couples Therapy Md. Nasir, Brian Baucom, Shrikanth S. Narayanan, Panayiotis Georgiou
Friday, 9 September 2016 17:40 Bayview B Behavioral Signal Processing and Speaker State and Traits Analytics ORAL Fri-O-3-4-3 1186 Couples Behavior Modeling and Annotation Using Low-Resource LSTM Language Models Shao-Yen Tseng, Sandeep Nallan Chakravarthula, Brian Baucom, Panayiotis Georgiou
Friday, 9 September 2016 18:00 Bayview B Behavioral Signal Processing and Speaker State and Traits Analytics ORAL Fri-O-3-4-4 459 Speech Likability and Personality-Based Social Relations: A Round-Robin Analysis over Communication Channels Laura Fernández Gallardo, Benjamin Weiss
Friday, 9 September 2016 18:20 Bayview B Behavioral Signal Processing and Speaker State and Traits Analytics ORAL Fri-O-3-4-5 1560 Behavioral Coding of Therapist Language in Addiction Counseling Using Recurrent Neural Networks Bo Xiao, Doğan Can, James Gibson, Zac E. Imel, David C. Atkins, Panayiotis Georgiou, Shrikanth S. Narayanan
Friday, 9 September 2016 18:40 Bayview B Behavioral Signal Processing and Speaker State and Traits Analytics ORAL Fri-O-3-4-6 880 Factor Analysis Based Speaker Normalisation for Continuous Emotion Prediction Ting Dang, Vidhyasaharan Sethu, Eliathamby Ambikairajah
Friday, 9 September 2016 17:00 Seacliff BCD Spoken Term Detection ORAL Fri-O-3-5-1 1278 Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection Dhananjay Ram, Afsaneh Asaei, Hervé Bourlard
Friday, 9 September 2016 17:20 Seacliff BCD Spoken Term Detection ORAL Fri-O-3-5-2 313 Unsupervised Bottleneck Features for Low-Resource Query-by-Example Spoken Term Detection Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li
Friday, 9 September 2016 17:40 Seacliff BCD Spoken Term Detection ORAL Fri-O-3-5-3 315 A Nonparametric Bayesian Approach for Spoken Term Detection by Example Query Amir Hossein Harati Nejad Torbati, Joseph Picone
Friday, 9 September 2016 18:00 Seacliff BCD Spoken Term Detection ORAL Fri-O-3-5-4 646 Rescoring Hypothesized Detections of Out-of-Vocabulary Keywords Using Subword Samples Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li
Friday, 9 September 2016 18:20 Seacliff BCD Spoken Term Detection ORAL Fri-O-3-5-5 753 Unrestricted Vocabulary Keyword Spotting Using LSTM-CTC Yimeng Zhuang, Xuankai Chang, Yanmin Qian, Kai Yu
Friday, 9 September 2016 18:40 Seacliff BCD Spoken Term Detection ORAL Fri-O-3-5-6 1237 Interactive Spoken Content Retrieval by Deep Reinforcement Learning Yen-Chen Wu, Tzu-Hsiang Lin, Yang-De Chen, Hung-Yi Lee, Lin-Shan Lee
Friday, 9 September 2016 17:00 Seacliff A Co-Inference of Production and Acoustics ORAL Fri-O-3-6-1 1362 Relating Estimated Cyclic Spectral Peak Frequency to Measured Epilarynx Length Using Magnetic Resonance Imaging Elizabeth Godoy, Andrew Dumas, Jennifer Melot, Nicolas Malyska, Thomas F. Quatieri
Friday, 9 September 2016 17:20 Seacliff A Co-Inference of Production and Acoustics ORAL Fri-O-3-6-2 1196 Acoustic-to-Articulatory Inversion Mapping Based on Latent Trajectory Gaussian Mixture Model Patrick Lumban Tobing, Tomoki Toda, Hirokazu Kameoka, Satoshi Nakamura
Friday, 9 September 2016 17:40 Seacliff A Co-Inference of Production and Acoustics ORAL Fri-O-3-6-3 490 Formant Estimation and Tracking Using Deep Learning Yehoshua Dissen, Joseph Keshet
Friday, 9 September 2016 18:00 Seacliff A Co-Inference of Production and Acoustics ORAL Fri-O-3-6-4 571 Convex Hull Convolutive Non-Negative Matrix Factorization for Uncovering Temporal Patterns in Multivariate Time-Series Data Colin Vaz, Asterios Toutios, Shrikanth S. Narayanan
Friday, 9 September 2016 18:20 Seacliff A Co-Inference of Production and Acoustics ORAL Fri-O-3-6-5 735 Majorisation-Minimisation Based Optimisation of the Composite Autoregressive System with Application to Glottal Inverse Filtering Lauri Juvela, Hirokazu Kameoka, Manu Airaksinen, Junichi Yamagishi, Paavo Alku
Friday, 9 September 2016 18:40 Seacliff A Co-Inference of Production and Acoustics ORAL Fri-O-3-6-6 653 F0 Contour Analysis Based on Empirical Mode Decomposition for DNN Acoustic Modeling in Mandarin Speech Recognition Xiaoyun Wang, Xugang Lu, Hisashi Kawai, Seiichi Yamamoto
Friday, 9 September 2016 17:00 Pacific Concourse - Poster A Acoustic and Articulatory Phonetics POSTER Fri-P-3-1-1 29 Vowels and Diphthongs in Cangnan Southern Min Chinese Dialect Fang Hu, Chunyu Ge
Friday, 9 September 2016 17:00 Pacific Concourse - Poster A Acoustic and Articulatory Phonetics POSTER Fri-P-3-1-2 61 Diphthongization of Nuclear Vowels and the Emergence of a Tetraphthong in Hetang Cantonese Wenqi Hu, Fang Hu, Jian Jin
Friday, 9 September 2016 17:00 Pacific Concourse - Poster A Acoustic and Articulatory Phonetics POSTER Fri-P-3-1-3 235 PhonVoc: A Phonetic and Phonological Vocoding Toolkit Milos Cernak, Philip N. Garner
Friday, 9 September 2016 17:00 Pacific Concourse - Poster A Acoustic and Articulatory Phonetics POSTER Fri-P-3-1-4 249 Vowels and Diphthongs in the Taiyuan Jin Chinese Dialect Liping Xia, Fang Hu
Friday, 9 September 2016 17:00 Pacific Concourse - Poster A Acoustic and Articulatory Phonetics POSTER Fri-P-3-1-5 1323 The Effects of Prosody on French V-to-V Coarticulation: A Corpus-Based Study Giuseppina Turco, Cécile Fougeron, Nicolas Audibert
Friday, 9 September 2016 17:00 Pacific Concourse - Poster A Acoustic and Articulatory Phonetics POSTER Fri-P-3-1-6 434 An Acoustic Analysis of /r/ in Tyrolean Vincenzo Galatà, Lorenzo Spreafico, Alessandro Vietti, Constantijn Kaland
Friday, 9 September 2016 17:00 Pacific Concourse - Poster A Acoustic and Articulatory Phonetics POSTER Fri-P-3-1-7 23 Hyperarticulated Production of Korean Glides by Age Group Seung-Eun Chang, Minsook Kim
Friday, 9 September 2016 17:00 Pacific Concourse - Poster A Acoustic and Articulatory Phonetics POSTER Fri-P-3-1-8 597 Coda Stop and Taiwan Min Checked Tone Sound Changes Ho-hsien Pan, Hsiao-tung Huang, Shao-ren Lyu
Friday, 9 September 2016 17:00 Pacific Concourse - Poster B Prosody, Phonation and Voice Quality POSTER Fri-P-3-2-1 611 The Influence of Modality and Speaking Style on the Assimilation Type and Categorization Consistency of Non-Native Speech Sarah E. Fenwick, Catherine T. Best, Chris Davis, Michael D. Tyler
Friday, 9 September 2016 17:00 Pacific Concourse - Poster B Prosody, Phonation and Voice Quality POSTER Fri-P-3-2-2 238 Prosodic Convergence with Spoken Stimuli in Laboratory Data Margaret Zellers
Friday, 9 September 2016 17:00 Pacific Concourse - Poster B Prosody, Phonation and Voice Quality POSTER Fri-P-3-2-3 1057 Effects of Stress on Fricatives: Evidence from Standard Modern Greek Charalambos Themistocleous, Angelandria Savva, Andrie Aristodemou
Friday, 9 September 2016 17:00 Pacific Concourse - Poster B Prosody, Phonation and Voice Quality POSTER Fri-P-3-2-4 824 Analysis of Chinese Syllable Durations in Running Speech of Japanese L2 Learners Yue Sun, Shudon Hsiao, Yoshinori Sagisaka, Jinsong Zhang
Friday, 9 September 2016 17:00 Pacific Concourse - Poster B Prosody, Phonation and Voice Quality POSTER Fri-P-3-2-5 992 Automatic Paragraph Segmentation with Lexical and Prosodic Features Catherine Lai, Mireia Farrús, Johanna D. Moore
Friday, 9 September 2016 17:00 Pacific Concourse - Poster B Prosody, Phonation and Voice Quality POSTER Fri-P-3-2-6 338 Automatic Glottal Inverse Filtering with Non-Negative Matrix Factorization Manu Airaksinen, Lauri Juvela, Tom Bäckström, Paavo Alku
Friday, 9 September 2016 17:00 Pacific Concourse - Poster B Prosody, Phonation and Voice Quality POSTER Fri-P-3-2-7 523 Speaker Identity and Voice Quality: Modeling Human Responses and Automatic Speaker Recognition Soo Jin Park, Caroline Sigouin, Jody Kreiman, Patricia Keating, Jinxi Guo, Gary Yeung, Fang-Yu Kuo, Abeer Alwan
Friday, 9 September 2016 17:00 Pacific Concourse - Poster B Prosody, Phonation and Voice Quality POSTER Fri-P-3-2-8 877 Analysis of Glottal Stop in Assam Sora Language Sishir Kalita, Luke Horo, Priyankoo Sarmah, S.R. Mahadeva Prasanna, S. Dandapat
Friday, 9 September 2016 17:00 Pacific Concourse - Poster B Prosody, Phonation and Voice Quality POSTER Fri-P-3-2-9 1472 Acoustic Differences Between English /t/ Glottalization and Phrasal Creak Marc Garellek, Scott Seyfarth
Friday, 9 September 2016 17:00 Pacific Concourse - Poster B Prosody, Phonation and Voice Quality POSTER Fri-P-3-2-10 348 The Acoustics of Lexical Stress in Italian as a Function of Stress Level and Speaking Style Anders Eriksson, Pier Marco Bertinetto, Mattias Heldner, Rosalba Nodari, Giovanna Lenoci
Friday, 9 September 2016 17:00 Pacific Concourse - Poster B Prosody, Phonation and Voice Quality POSTER Fri-P-3-2-11 405 Cross-Gender and Cross-Dialect Tone Recognition for Vietnamese Antje Schweitzer, Ngoc Thang Vu
Friday, 9 September 2016 17:00 Pacific Concourse - Poster B Prosody, Phonation and Voice Quality POSTER Fri-P-3-2-12 914 Prosody Modification Using Allpass Residual of Speech Signals Karthika Vijayan, K. Sri Rama Murty
Friday, 9 September 2016 17:00 Pacific Concourse - Poster B Prosody, Phonation and Voice Quality POSTER Fri-P-3-2-13 926 Analyzing the Contribution of Top-Down Lexical and Bottom-Up Acoustic Cues in the Detection of Sentence Prominence Sofoklis Kakouros, Joris Pelemans, Lyan Verwimp, Patrick Wambacq, Okko Räsänen
Friday, 9 September 2016 17:00 Pacific Concourse - Poster B Prosody, Phonation and Voice Quality POSTER Fri-P-3-2-14 1396 A Longitudinal Study of Children’s Intonation in Narrative Speech Jeffrey Kallay, Melissa A. Redford
Friday, 9 September 2016 17:00 Pacific Concourse - Poster C Speech Production Analysis and Modeling POSTER Fri-P-3-3-1 1408 Velum Control for Oral Sounds Reed Blaylock, Louis Goldstein, Shrikanth S. Narayanan
Friday, 9 September 2016 17:00 Pacific Concourse - Poster C Speech Production Analysis and Modeling POSTER Fri-P-3-3-2 651 F0 Development in Acquiring Korean Stop Distinction Gayeon Son
Friday, 9 September 2016 17:00 Pacific Concourse - Poster C Speech Production Analysis and Modeling POSTER Fri-P-3-3-3 1146 Phonetic Reduction Can Lead to Lengthening, and Enhancement Can Lead to Shortening Clara Cohen, Matt Carlson
Friday, 9 September 2016 17:00 Pacific Concourse - Poster C Speech Production Analysis and Modeling POSTER Fri-P-3-3-4 189 Mechanical Production of [b], [m] and [w] Using Controlled Labial and Velopharyngeal Gestures Takayuki Arai
Friday, 9 September 2016 17:00 Pacific Concourse - Poster C Speech Production Analysis and Modeling POSTER Fri-P-3-3-5 901 An Improved 3D Geometric Tongue Model Qiang Fang, Yun Chen, Haibo Wang, Jianguo Wei, Jianrong Wang, Xiyu Wu, Aijun Li
Friday, 9 September 2016 17:00 Pacific Concourse - Poster C Speech Production Analysis and Modeling POSTER Fri-P-3-3-6 1199 Congruency Effect Between Articulation and Grasping in Native English Speakers Mikko Tiainen, Fatima M. Felisberti, Kaisa Tiippana, Martti Vainio, Juraj Simko, Jiri Lukavsky, Lari Vainio
Friday, 9 September 2016 17:00 Pacific Concourse - Poster C Speech Production Analysis and Modeling POSTER Fri-P-3-3-7 1126 Emergence of Vocal Developmental Sequences in a Predictive Coding Model of Speech Acquisition Shamima Najnin, Bonny Banerjee
Friday, 9 September 2016 17:00 Pacific Concourse - Poster C Speech Production Analysis and Modeling POSTER Fri-P-3-3-8 379 Categorization of Natural Spanish Whistled Vowels by Naïve Spanish Listeners Julien Meyer, Laure Dentel, Fanny Meunier
Friday, 9 September 2016 17:00 Pacific Concourse - Poster C Speech Production Analysis and Modeling POSTER Fri-P-3-3-9 1506 Between- and Within-Speaker Effects of Bilingualism on F0 Variation Rob Voigt, Dan Jurafsky, Meghan Sumner
Friday, 9 September 2016 17:00 Pacific Concourse - Poster C Speech Production Analysis and Modeling POSTER Fri-P-3-3-10 1630 Vowel Characteristics in the Assessment of L2 English Pronunciation Calbert Graham, Paula Buttery, Francis Nolan
Friday, 9 September 2016 17:00 Pacific Concourse - Poster C Speech Production Analysis and Modeling POSTER Fri-P-3-3-11 1082 Kulning (Swedish Cattle Calls): Acoustic, EGG, Stroboscopic and High-Speed Video Analyses of an Unusual Singing Style Ahmed Geneid, Anne-Maria Laukkanen, Anita McAllister, Robert Eklund
Friday, 9 September 2016 17:00 Pacific Concourse - Poster C Speech Production Analysis and Modeling POSTER Fri-P-3-3-12 1496 Glottal Squeaks in VC Sequences Míša Hejná, Pertti Palo, Scott Moisik
Friday, 9 September 2016 17:00 Pacific Concourse - Poster C Speech Production Analysis and Modeling POSTER Fri-P-3-3-13 761 Automatic Pronunciation Generation by Utilizing a Semi-Supervised Deep Neural Networks Naoya Takahashi, Tofigh Naghibi, Beat Pfister
Friday, 9 September 2016 17:00 Pacific Concourse - Poster D Spoken Dialogue Systems POSTER Fri-P-3-4-1 1172 Personalized Natural Language Understanding Xiaohu Liu, Ruhi Sarikaya, Liang Zhao, Yong Ni, Yi-Cheng Pan
Friday, 9 September 2016 17:00 Pacific Concourse - Poster D Spoken Dialogue Systems POSTER Fri-P-3-4-2 1175 A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue Systems Layla El Asri, Jing He, Kaheer Suleman
Friday, 9 September 2016 17:00 Pacific Concourse - Poster D Spoken Dialogue Systems POSTER Fri-P-3-4-3 1273 Root Cause Analysis of Miscommunication Hotspots in Spoken Dialogue Systems Spiros Georgiladakis, Georgia Athanasopoulou, Raveesh Meena, José Lopes, Arodami Chorianopoulou, Elisavet Palogiannidi, Elias Iosif, Gabriel Skantze, Alexandros Potamianos
Friday, 9 September 2016 17:00 Pacific Concourse - Poster D Spoken Dialogue Systems POSTER Fri-P-3-4-4 1534 Making Personal Digital Assistants Aware of What They Do Not Know Omar Zia Khan, Ruhi Sarikaya
Friday, 9 September 2016 17:00 Pacific Concourse - Poster D Spoken Dialogue Systems POSTER Fri-P-3-4-5 985 Implementing Acoustic-Prosodic Entrainment in a Conversational Avatar Rivka Levitan, Štefan Beňuš, Ramiro H. Gálvez, Agustín Gravano, Florencia Savoretti, Marian Trnka, Andreas Weise, Julia Hirschberg
Friday, 9 September 2016 17:00 Pacific Concourse - Poster D Spoken Dialogue Systems POSTER Fri-P-3-4-6 99 Perceived Usability and Cognitive Demand of Secondary Tasks in Spoken Versus Visual-Manual Automotive Interaction Annika Silvervarg, Sofia Lindvall, Jonatan Andersson, Ida Esberg, Christian Jernberg, Filip Frumerie, Arne Jönsson
Friday, 9 September 2016 17:00 Market Street Foyer Show & Tell Session 3   Fri-S&T-3-1 2012 Zara: An Empathetic Interactive Virtual Agent Pascale Fung, Anik Dey, Farhad Bin Siddique, Ruixi Lin, Yang Yang, Wan Yan, Ricky Ho Yin Chan
Friday, 9 September 2016 17:00 Market Street Foyer Show & Tell Session 3   Fri-S&T-3-2 2013 Measuring Pronunciation Improvement in Users of CAPT Tool TipTopTalk! Cristian Tejedor-García, David Escudero-Mancebo, Enrique Cámara-Arenas, César González-Ferreras, Valentín Cardeñoso-Payo
Friday, 9 September 2016 17:00 Market Street Foyer Show & Tell Session 3   Fri-S&T-3-3 2014 SparkNG: Interactive MATLAB Tools for Introduction to Speech Production, Perception and Processing Fundamentals and Application of the Aliasing-Free L-F Model Component Hideki Kawahara
Friday, 9 September 2016 17:00 Market Street Foyer Show & Tell Session 3   Fri-S&T-3-4 2015 Real-Time Tracking of Speakers’ Emotions, States, and Traits on Mobile Platforms Erik Marchi, Florian Eyben, Gerhard Hagerer, Björn Schuller
Saturday, 10 September 2016 08:00 -- 08:25 Grand Ballroom ABC Special Event: Mindfulness   Sat-SE-1 4002 Mindfulness Special Event Nikki Mirghafori
Saturday, 10 September 2016 08:30 - 9:30 Grand Ballroom ABC Keynote 2: Edward Chang ORAL Sat-Keynote-2 3002 The Human Speech Cortex Edward Chang
Saturday, 10 September 2016 10:00 - 12:00 Grand Ballroom A Special Event: Speaker Comparison for Forensic and Investigative Applications II   Sat-SE-2 4003 Speaker Comparison for Forensic and Investigative Applications II Jean-François Bonastre, Joseph P. Campbell, Anders Eriksson, Hiro Nakasone, Reva Schwartz
Saturday, 10 September 2016 10:00 Grand Ballroom BC Special Session: Clinical and Neuroscience-Inspired Vocal Biomarkers of Neurological and Psychiatric Disorders ORAL Sat-O-4-2-1 1073 Acoustic-Prosodic and Turn-Taking Features in Interactions with Children with Neurodevelopmental Disorders Daniel Bone, Somer Bishop, Rahul Gupta, Sungbok Lee, Shrikanth S. Narayanan
Saturday, 10 September 2016 10:15 Grand Ballroom BC Special Session: Clinical and Neuroscience-Inspired Vocal Biomarkers of Neurological and Psychiatric Disorders ORAL Sat-O-4-2-2 1062 Automatic Detection of Parkinson’s Disease Based on Modulated Vowels Daria Hemmerling, Juan Rafael Orozco-Arroyave, Andrzej Skalski, Janusz Gajda, Elmar Nöth
Saturday, 10 September 2016 10:30 Grand Ballroom BC Special Session: Clinical and Neuroscience-Inspired Vocal Biomarkers of Neurological and Psychiatric Disorders ORAL Sat-O-4-2-3 1542 Towards Automatic Detection of Amyotrophic Lateral Sclerosis from Speech Acoustic and Articulatory Samples Jun Wang, Prasanna V. Kothalkar, Beiming Cao, Daragh Heitzman
Saturday, 10 September 2016 10:45 Grand Ballroom BC Special Session: Clinical and Neuroscience-Inspired Vocal Biomarkers of Neurological and Psychiatric Disorders ORAL Sat-O-4-2-4 292 Neurophysiological Vocal Source Modeling for Biomarkers of Disease Gregory Ciccarelli, Thomas F. Quatieri, Satrajit S. Ghosh
Saturday, 10 September 2016 11:00 Grand Ballroom BC Special Session: Clinical and Neuroscience-Inspired Vocal Biomarkers of Neurological and Psychiatric Disorders ORAL Sat-O-4-2-5 403 Relation of Automatically Extracted Formant Trajectories with Intelligibility Loss and Speaking Rate Decline in Amyotrophic Lateral Sclerosis Rachelle L. Horwitz-Martin, Thomas F. Quatieri, Adam C. Lammert, James R. Williamson, Yana Yunusova, Elizabeth Godoy, Daryush D. Mehta, Jordan R. Green
Saturday, 10 September 2016 11:15 Grand Ballroom BC Special Session: Clinical and Neuroscience-Inspired Vocal Biomarkers of Neurological and Psychiatric Disorders ORAL Sat-O-4-2-6 766 Automatic Analysis of Typical and Atypical Encoding of Spontaneous Emotion in the Voice of Children Fabien Ringeval, Erik Marchi, Charline Grossard, Jean Xavier, Mohamed Chetouani, David Cohen, Björn Schuller
Saturday, 10 September 2016 11:30 Grand Ballroom BC Special Session: Clinical and Neuroscience-Inspired Vocal Biomarkers of Neurological and Psychiatric Disorders ORAL Sat-O-4-2-7 837 Recognition of Depression in Bipolar Disorder: Leveraging Cohort and Person-Specific Knowledge Soheil Khorram, John Gideon, Melvin McInnis, Emily Mower Provost
Saturday, 10 September 2016 11:45 Grand Ballroom BC Special Session: Clinical and Neuroscience-Inspired Vocal Biomarkers of Neurological and Psychiatric Disorders ORAL Sat-O-4-2-8 857 Diagnosing People with Dementia Using Automatic Conversation Analysis Bahman Mirheidari, Daniel Blackburn, Markus Reuber, Traci Walker, Heidi Christensen
Saturday, 10 September 2016 10:00 Bayview A Special Session: Singing Synthesis Challenge: Fill-In the Gap ORAL Sat-O-4-3-1 484 SERAPHIM: A Wavetable Synthesis System with 3D Lip Animation for Real-Time Speech and Singing Applications on Mobile Platforms Paul Yaozhu Chan, Minghui Dong, Grace Xue Hui Ho, Haizhou Li
Saturday, 10 September 2016 10:15 Bayview A Special Session: Singing Synthesis Challenge: Fill-In the Gap ORAL Sat-O-4-3-2 872 Expressive Singing Synthesis Based on Unit Selection for the Singing Synthesis Challenge 2016 Jordi Bonada, Martí Umbert, Merlijn Blaauw
Saturday, 10 September 2016 10:30 Bayview A Special Session: Singing Synthesis Challenge: Fill-In the Gap ORAL Sat-O-4-3-3 1096 Vocal Effort Modification for Singing Synthesis Olivier Perrotin, Christophe d’Alessandro
Saturday, 10 September 2016 10:45 Bayview A Special Session: Singing Synthesis Challenge: Fill-In the Gap ORAL Sat-O-4-3-4 1123 Bertsokantari: a TTS Based Singing Synthesis System Eder del Blanco, Inma Hernaez, Eva Navas, Xabier Sarasola, D. Erro
Saturday, 10 September 2016 11:00 Bayview A Special Session: Singing Synthesis Challenge: Fill-In the Gap ORAL Sat-O-4-3-5 1248 Evaluation of Singing Synthesis: Methodology and Case Study with Concatenative and Performative Systems Lionel Feugère, Christophe d’Alessandro, Samuel Delalez, Luc Ardaillon, Axel Roebel
Saturday, 10 September 2016 11:15 Bayview A Special Session: Singing Synthesis Challenge: Fill-In the Gap ORAL Sat-O-4-3-6 1317 Expressive Control of Singing Voice Synthesis Using Musical Contexts and a Parametric F0 Model Luc Ardaillon, Celine Chabot-Canet, Axel Roebel
Saturday, 10 September 2016 11:30 Bayview A Special Session: Singing Synthesis Challenge: Fill-In the Gap ORAL Sat-O-4-3-7 1390 Optimal Unit Stitching in a Unit Selection Singing Synthesis System Marius Cotescu
Saturday, 10 September 2016 10:00 Bayview B Conversation and Interaction ORAL Sat-O-4-4-1 1456 The Perception of Overlapping Speech: Effects of Speaker Prosody and Listener Attitudes Katherine Hilton
Saturday, 10 September 2016 10:20 Bayview B Conversation and Interaction ORAL Sat-O-4-4-2 585 Who Do You Think Will Speak Next? Perception of Turn-Taking Cues in Slovak and Argentine Spanish Agustín Gravano, Pablo Brusco, Štefan Beňuš
Saturday, 10 September 2016 10:40 Bayview B Conversation and Interaction ORAL Sat-O-4-4-3 587 Disentrainment may be a Positive Thing: A Novel Measure of Unsigned Acoustic-Prosodic Synchrony, and its Relation to Speaker Engagement Juan M. Pérez, Ramiro H. Gálvez, Agustín Gravano
Saturday, 10 September 2016 11:00 Bayview B Conversation and Interaction ORAL Sat-O-4-4-4 346 Respiratory Turn-Taking Cues Marcin Włodarczak, Mattias Heldner
Saturday, 10 September 2016 11:20 Bayview B Conversation and Interaction ORAL Sat-O-4-4-5 547 The Discourse Marker “so” in Turn-Taking and Turn-Releasing Behavior Emma Rennie, Rebecca Lunsford, Peter A. Heeman
Saturday, 10 September 2016 11:40 Bayview B Conversation and Interaction ORAL Sat-O-4-4-6 132 Acoustic Properties of Formality in Conversational Japanese Ethan Sherr-Ziarko
Saturday, 10 September 2016 10:00 Seacliff BCD Automatic Learning of Representations ORAL Sat-O-4-5-1 1299 Inferring Phonemic Classes from CNN Activation Maps Using Clustering Techniques Thomas Pellegrini, Sandrine Mouysset
Saturday, 10 September 2016 10:20 Seacliff BCD Automatic Learning of Representations ORAL Sat-O-4-5-2 811 Joint Learning of Speaker and Phonetic Similarities with Siamese Networks Neil Zeghidour, Gabriel Synnaeve, Nicolas Usunier, Emmanuel Dupoux
Saturday, 10 September 2016 10:40 Seacliff BCD Automatic Learning of Representations ORAL Sat-O-4-5-3 1374 Unsupervised Learning of Acoustic Units Using Autoencoders and Kohonen Nets Vikramjit Mitra, Dimitra Vergyri, Horacio Franco
Saturday, 10 September 2016 11:00 Seacliff BCD Automatic Learning of Representations ORAL Sat-O-4-5-4 256 Learning Multiscale Features Directly from Waveforms Zhenyao Zhu, Jesse H. Engel, Awni Hannun
Saturday, 10 September 2016 11:20 Seacliff BCD Automatic Learning of Representations ORAL Sat-O-4-5-5 988 Supervised Learning of Acoustic Models in a Zero Resource Setting to Improve DPGMM Clustering Michael Heck, Sakriani Sakti, Satoshi Nakamura
Saturday, 10 September 2016 11:40 Seacliff BCD Automatic Learning of Representations ORAL Sat-O-4-5-6 1099 Semi-Supervised and Cross-Lingual Knowledge Transfer Learnings for DNN Hybrid Acoustic Models Under Low-Resource Conditions Haihua Xu, Hang Su, Chongjia Ni, Xiong Xiao, Hao Huang, Eng Siong Chng, Haizhou Li
Saturday, 10 September 2016 10:00 Seacliff A Language Modeling for Conversational Speech and Confidence Measures ORAL Sat-O-4-6-1 562 Recurrent Out-of-Vocabulary Word Detection Using Distribution of Features Taichi Asami, Ryo Masumura, Yushi Aono, Koichi Shinoda
Saturday, 10 September 2016 10:20 Seacliff A Language Modeling for Conversational Speech and Confidence Measures ORAL Sat-O-4-6-2 72 Investigation of Semi-Supervised Acoustic Model Training Based on the Committee of Heterogeneous Neural Networks Naoyuki Kanda, Shoji Harada, Xugang Lu, Hisashi Kawai
Saturday, 10 September 2016 10:40 Seacliff A Language Modeling for Conversational Speech and Confidence Measures ORAL Sat-O-4-6-3 784 Acoustic Word Embeddings for ASR Error Detection Sahar Ghannay, Yannick Estève, Nathalie Camelin, Paul deléglise
Saturday, 10 September 2016 11:00 Seacliff A Language Modeling for Conversational Speech and Confidence Measures ORAL Sat-O-4-6-4 1250 Combining Semantic Word Classes and Sub-Word Unit Speech Recognition for Robust OOV Detection Axel Horndasch, Anton Batliner, Caroline Kaufhold, Elmar Nöth
Saturday, 10 September 2016 11:20 Seacliff A Language Modeling for Conversational Speech and Confidence Measures ORAL Sat-O-4-6-5 45 Web Data Selection Based on Word Embedding for Low-Resource Speech Recognition Chuandong Xie, Wu Guo, Guoping Hu, Junhua Liu
Saturday, 10 September 2016 11:40 Seacliff A Language Modeling for Conversational Speech and Confidence Measures ORAL Sat-O-4-6-6 788 Colloquialising Modern Standard Arabic Text for Improved Speech Recognition Sarah Al-Shareef, Thomas Hain
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster A Topics in Speech Perception POSTER Sat-P-4-1-1 1483 Pitch-Range Perception: The Dynamic Interaction Between Voice Quality and Fundamental Frequency Jianjing Kuang, Mark Liberman
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster A Topics in Speech Perception POSTER Sat-P-4-1-2 66 Comparing the Contributions of Amplitude and Phase to Speech Intelligibility in a Vocoder-Based Speech Synthesis Model Fei Chen, Benson C.L. Chiao
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster A Topics in Speech Perception POSTER Sat-P-4-1-3 9 Modeling Noise Influence to Speech Intelligibility Non-Intrusively by Reduced Speech Dynamic Range Fei Chen
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster A Topics in Speech Perception POSTER Sat-P-4-1-4 325 Do GMM Phoneme Classifiers Perceive Synthetic Sibilants as Humans Do? Gábor Pintér, Hiroki Watanabe
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster A Topics in Speech Perception POSTER Sat-P-4-1-5 1327 Neural Responses to Speech-Specific Modulations Derived from a Spectro-Temporal Filter Bank Marina Frye, Cristiano Micheli, Inga M. Schepers, Gerwin Schalk, Jochem W. Rieger, Bernd T. Meyer
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster A Topics in Speech Perception POSTER Sat-P-4-1-6 967 Comparing Different Methods for Analyzing ERP Signals Kimberley Mulder, Louis ten Bosch, Lou Boves
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster A Topics in Speech Perception POSTER Sat-P-4-1-7 973 Supplementary Motor Area Activation in Disfluency Perception: An fMRI Study of Listener Neural Responses to Spontaneously Produced Unfilled and Filled Pauses Robert Eklund, Martin Ingvar
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster A Topics in Speech Perception POSTER Sat-P-4-1-8 28 Vowel Fundamental and Formant Frequency Contributions to English and Mandarin Sentence Intelligibility Daniel Fogerty, Fei Chen
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster B Behavioral Signal Processing and Speaker State and Traits Analytics POSTER Sat-P-4-2-1 448 Attention Assisted Discovery of Sub-Utterance Structure in Speech Emotion Recognition Che-Wei Huang, Shrikanth S. Narayanan
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster B Behavioral Signal Processing and Speaker State and Traits Analytics POSTER Sat-P-4-2-2 324 Combining CNN and BLSTM to Extract Textual and Acoustic Features for Recognizing Stances in Mandarin Ideological Debate Competition Linchuan Li, Zhiyong Wu, Mingxing Xu, Helen Meng, Lianhong Cai
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster B Behavioral Signal Processing and Speaker State and Traits Analytics POSTER Sat-P-4-2-3 1064 Inter-Speech Clicks in an Interspeech Keynote Jürgen Trouvain, Zofia Malisz
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster B Behavioral Signal Processing and Speaker State and Traits Analytics POSTER Sat-P-4-2-4 1118 Speaker Age Classification and Regression Using i-Vectors Joanna Grzybowska, Stanisław Kacprzak
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster B Behavioral Signal Processing and Speaker State and Traits Analytics POSTER Sat-P-4-2-5 1217 Sparsely Connected and Disjointly Trained Deep Neural Networks for Low Resource Behavioral Annotation: Acoustic Classification in Couples’ Therapy Haoqi Li, Brian Baucom, Panayiotis Georgiou
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster B Behavioral Signal Processing and Speaker State and Traits Analytics POSTER Sat-P-4-2-6 1328 Automatically Classifying Self-Rated Personality Scores from Speech Guozhen An, Sarah Ita Levitan, Rivka Levitan, Andrew Rosenberg, Michelle Levine, Julia Hirschberg
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster B Behavioral Signal Processing and Speaker State and Traits Analytics POSTER Sat-P-4-2-7 146 Estimation of Children’s Physical Characteristics from Their Voices Jill Fain Lehman, Rita Singh
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster B Behavioral Signal Processing and Speaker State and Traits Analytics POSTER Sat-P-4-2-8 1623 Talking to a System and Talking to a Human: A Study from a Speech-to-Speech, Machine Translation Mediated Map Task Hayakawa Akira, Saturnino Luz, Nick Campbell
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster B Behavioral Signal Processing and Speaker State and Traits Analytics POSTER Sat-P-4-2-9 187 Predicting Affective Dimensions Based on Self Assessed Depression Severity Rahul Gupta, Shrikanth S. Narayanan
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster B Behavioral Signal Processing and Speaker State and Traits Analytics POSTER Sat-P-4-2-10 400 Enhancement of Automatic Oral Presentation Assessment System Using Latent N-Grams Word Representation and Part-of-Speech Information Wen-Yu Huang, Shan-Wen Hsiao, Hung-Ching Sun, Ming-Chuan Hsieh, Ming-Hsueh Tsai, Chi-Chun Lee
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster B Behavioral Signal Processing and Speaker State and Traits Analytics POSTER Sat-P-4-2-11 1114 Use of Vowels in Discriminating Speech-Laugh from Laughter and Neutral Speech Sri Harsha Dumpala, P. Gangamohan, Suryakanth V. Gangashetty, B. Yegnanarayana
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster B Behavioral Signal Processing and Speaker State and Traits Analytics POSTER Sat-P-4-2-12 498 A Convex Model for Linguistic Influence in Group Conversations Kan Kawabata, Visar Berisha, Anna Scaglione, Amy LaCross
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster B Behavioral Signal Processing and Speaker State and Traits Analytics POSTER Sat-P-4-2-13 554 A Deep Learning Approach to Modeling Empathy in Addiction Counseling James Gibson, Doğan Can, Bo Xiao, Zac E. Imel, David C. Atkins, Panayiotis Georgiou, Shrikanth S. Narayanan
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster B Behavioral Signal Processing and Speaker State and Traits Analytics POSTER Sat-P-4-2-14 620 Unipolar Depression vs. Bipolar Disorder: An Elicitation-Based Approach to Short-Term Detection of Mood Disorder Kun-Yi Huang, Chung-Hsien Wu, Yu-Ting Kuo, Fong-Lin Jang
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sat-P-4-3-1 1320 Conditional Random Fields for the Tunisian Dialect Grapheme-to-Phoneme Conversion Abir Masmoudi, Mariem Ellouze, Fethi Bougares, Yannick Esètve, Lamia Belguith
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sat-P-4-3-2 621 Efficient Thai Grapheme-to-Phoneme Conversion Using CRF-Based Joint Sequence Modeling Sittipong Saychum, Sarawoot Kongyoung, Anocha Rugchatjaroen, Patcharika Chootrakool, Sawit Kasuriya, Chai Wutiwiwatchai
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sat-P-4-3-3 385 An Articulatory-Based Singing Voice Synthesis Using Tongue and Lips Imaging Aurore Jaumard-Hakoun, Kele Xu, Clémence Leboullenger, Pierre Roussel-Ragot, Bruce Denby
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sat-P-4-3-4 363 Phoneme Embedding and its Application to Speech Driven Talking Avatar Synthesis Xu Li, Zhiyong Wu, Helen Meng, Jia Jia, Xiaoyan Lou, Lianhong Cai
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sat-P-4-3-5 364 Expressive Speech Driven Talking Avatar Synthesis with DBLSTM Using Limited Amount of Emotional Bimodal Data Xu Li, Zhiyong Wu, Helen Meng, Jia Jia, Xiaoyan Lou, Lianhong Cai
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sat-P-4-3-6 483 Audio-to-Visual Speech Conversion Using Deep Neural Networks Sarah Taylor, Akihiro Kato, Iain Matthews, Ben Milner
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sat-P-4-3-7 1105 Generative Acoustic-Phonemic-Speaker Model Based on Three-Way Restricted Boltzmann Machine Toru Nakashika, Yasuhiro Minami
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sat-P-4-3-8 596 Articulatory Synthesis Based on Real-Time Magnetic Resonance Imaging Data Asterios Toutios, Tanner Sorensen, Krishna Somandepalli, Rachel Alexander, Shrikanth S. Narayanan
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sat-P-4-3-9 659 Deep Neural Network Based Acoustic-to-Articulatory Inversion Using Phone Sequence Information Xurong Xie, Xunying Liu, Lan Wang
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sat-P-4-3-10 715 Articulatory-to-Acoustic Conversion with Cascaded Prediction of Spectral and Excitation Features Using Neural Networks Zheng-Chen Liu, Zhen-Hua Ling, Li-Rong Dai
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sat-P-4-3-11 1336 Generating Gestural Scores from Acoustics Through a Sparse Anchor-Based Representation of Speech Christopher Liberatore, Ricardo Gutierrez-Osuna
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sat-P-4-3-12 1222 On the Suitability of Vocalic Sandwiches in a Corpus-Based TTS Engine David Guennec, Damien Lolive
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sat-P-4-3-13 273 Unsupervised Stress Information Labeling Using Gaussian Process Latent Variable Model for Statistical Speech Synthesis Decha Moungsri, Tomoki Koriyama, Takao Kobayashi
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sat-P-4-3-14 1607 Using Zero-Frequency Resonator to Extract Multilingual Intonation Structure Jinfu Ni, Yoshinori Shiga, Hisashi Kawai
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster D Resources and Annotation of Resources POSTER Sat-P-4-4-1 873 A DNN-HMM Approach to Story Segmentation Jia Yu, Xiong Xiao, Lei Xie, Eng Siong Chng, Haizhou Li
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster D Resources and Annotation of Resources POSTER Sat-P-4-4-2 1003 The SIWIS Database: A Multilingual Speech Database with Acted Emphasis Jean-Philippe Goldman, Pierre-Edouard Honnet, Rob Clark, Philip N. Garner, Maria Ivanova, Alexandros Lazaridis, Hui Liang, Tiago Macedo, Beat Pfister, Manuel Sam Ribeiro, Eric Wehrli, Junichi Yamagishi
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster D Resources and Annotation of Resources POSTER Sat-P-4-4-3 48 Open Source Speech and Language Resources for Frisian Emre Yılmaz, Henk van den Heuvel, Jelske Dijkstra, Hans Van de Velde, Frederik Kampstra, Jouke Algra, David Van Leeuwen
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster D Resources and Annotation of Resources POSTER Sat-P-4-4-4 1141 The SRI CLEO Speaker-State Corpus Andreas Kathol, Elizabeth Shriberg, Massimilano de Zambotti
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster D Resources and Annotation of Resources POSTER Sat-P-4-4-5 139 SingaKids-Mandarin: Speech Corpus of Singaporean Children Speaking Mandarin Chinese Nancy F. Chen, Rong Tong, Darren Wee, Peixuan Lee, Bin Ma, Haizhou Li
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster D Resources and Annotation of Resources POSTER Sat-P-4-4-6 1541 The SRI Speech-Based Collaborative Learning Corpus Colleen Richey, Cynthia D’Angelo, Nonye Alozie, Harry Bratt, Elizabeth Shriberg
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster D Resources and Annotation of Resources POSTER Sat-P-4-4-7 270 An Expectation Maximization Approach to Joint Modeling of Multidimensional Ratings Derived from Multiple Annotators Anil Ramakrishna, Rahul Gupta, Ruth B. Grossman, Shrikanth S. Narayanan
Saturday, 10 September 2016 10:00 Pacific Concourse - Poster D Resources and Annotation of Resources POSTER Sat-P-4-4-8 442 Voting Detector: A Combination of Anomaly Detectors to Reveal Annotation Errors in TTS Corpora Jindřich Matoušek, Daniel Tihelka
Saturday, 10 September 2016 10:00 Market Street Foyer Show & Tell Session 4   Sat-S&T-4-1 2017 The Magic Stone: A Video Game to Improve Communication Skills of People with Intellectual Disabilities Mario Corrales-Astorgano, David Escudero-Mancebo, César González-Ferreras, Yurena Gutiérrez-González, Valle Flores-Lucas, Valentín Cardeñoso-Payo, Lourdes Aguilar-Cuevas
Saturday, 10 September 2016 10:00 Market Street Foyer Show & Tell Session 4   Sat-S&T-4-2 2018 Identifying Perceptually Similar Voices with a Speaker Recognition System Using Auto-Phonetic Features Finnian Kelly, Anil Alexander, Oscar Forth, Samuel Kent, Jonas Lindh, Joel Åkesson
Saturday, 10 September 2016 10:00 Market Street Foyer Show & Tell Session 4   Sat-S&T-4-3 2019 A Real-Time Framework for Visual Feedback of Articulatory Data Using Statistical Shape Models Kristy James, Alexander Hewer, Ingmar Steiner, Stefanie Wuhrer
Saturday, 10 September 2016 10:00 Market Street Foyer Show & Tell Session 4   Sat-S&T-4-4 2020 Flexible, Rapid Authoring of Goal-Orientated, Multi-Turn Dialogues Using the Task Completion Platform Alex Marin, Paul Crook, Omar Zia Khan, Vasiliy Radostev, Khushboo Aggarwal, Ruhi Sarikaya
Saturday, 10 September 2016 13:30 Grand Ballroom A Acoustic Model Adaptation ORAL Sat-O-5-1-1 203 Context Adaptive Neural Network for Rapid Adaptation of Deep CNN Based Acoustic Models Marc Delcroix, Keisuke Kinoshita, Atsunori Ogawa, Takuya Yoshioka, Dung T. Tran, Tomohiro Nakatani
Saturday, 10 September 2016 13:50 Grand Ballroom A Acoustic Model Adaptation ORAL Sat-O-5-1-2 250 Transfer Learning with Bottleneck Feature Networks for Whispered Speech Recognition Boon Pang Lim, Faith Wong, Yuyao Li, Jia Wei Bay
Saturday, 10 September 2016 14:10 Grand Ballroom A Acoustic Model Adaptation ORAL Sat-O-5-1-3 600 Adaptation of Neural Networks Constrained by Prior Statistics of Node Co-Activations Tasha Nagamine, Zhuo Chen, Nima Mesgarani
Saturday, 10 September 2016 14:30 Grand Ballroom A Acoustic Model Adaptation ORAL Sat-O-5-1-4 1161 Domain Adaptation of CNN Based Acoustic Models Under Limited Resource Settings Masayuki Suzuki, Ryuki Tachibana, Samuel Thomas, Bhuvana Ramabhadran, George Saon
Saturday, 10 September 2016 14:50 Grand Ballroom A Acoustic Model Adaptation ORAL Sat-O-5-1-5 1249 Subspace LHUC for Fast Adaptation of Deep Neural Network Acoustic Models Lahiru Samarakoon, Khe Chai Sim
Saturday, 10 September 2016 15:10 Grand Ballroom A Acoustic Model Adaptation ORAL Sat-O-5-1-6 1348 Improving Children’s Speech Recognition Through Out-of-Domain Data Augmentation Joachim Fainberg, Peter Bell, Mike Lincoln, Steve Renals
Saturday, 10 September 2016 13:30 Grand Ballroom BC Special Session: Sharing Research and Education Resources for Understanding Speech Processing ORAL Sat-O-5-2-1 997 Virtual Machines and Containers as a Platform for Experimentation Florian Metze, Eric Riebling, Anne S. Warlaumont, Elika Bergelson
Saturday, 10 September 2016 13:45 Grand Ballroom BC Special Session: Sharing Research and Education Resources for Understanding Speech Processing ORAL Sat-O-5-2-2 148 CloudCAST - Remote Speech Technology for Speech Professionals Phil Green, Ricard Marxer, Stuart Cunningham, Heidi Christensen, Frank Rudzicz, Maria Yancheva, André Coy, Massimiliano Malavasi, Lorenzo Desideri, Fabio Tamburini
Saturday, 10 September 2016 14:00 Grand Ballroom BC Special Session: Sharing Research and Education Resources for Understanding Speech Processing ORAL Sat-O-5-2-3 700 webASR 2 - Improved Cloud Based Speech Technology Thomas Hain, Jeremy Christian, Oscar Saz, Salil Deena, Madina Hasan, Raymond W.M. Ng, Rosanna Milner, Mortaza Doulaty, Yulan Liu
Saturday, 10 September 2016 14:15 Grand Ballroom BC Special Session: Sharing Research and Education Resources for Understanding Speech Processing ORAL Sat-O-5-2-4 1540 Sharing Speech Synthesis Software for Research and Education Within Low-Tech and Low-Resource Communities Andrew R. Plummer, Mary E. Beckman
Saturday, 10 September 2016 14:30 Grand Ballroom BC Special Session: Sharing Research and Education Resources for Understanding Speech Processing ORAL Sat-O-5-2-5 524 The Berkeley Phonetics Machine Ronald L. Sprouse, Keith Johnson
Saturday, 10 September 2016 14:45 Grand Ballroom BC Special Session: Sharing Research and Education Resources for Understanding Speech Processing ORAL Sat-O-5-2-6 1223 Experiences with Shared Resources for Research and Education in Speech and Language Processing Rebecca Bates, Eric Fosler-Lussier, Florian Metze, Martha Larson, Gina-Anne Levow, Emily Mower Provost
Saturday, 10 September 2016 15:00 Grand Ballroom BC Special Session: Sharing Research and Education Resources for Understanding Speech Processing ORAL Sat-O-5-2-7 4001 Panel and Audience Discussion: How do we Develop, Disseminate, and Sustain Shared Resources from User and Developer Perspectives?  
Saturday, 10 September 2016 13:30 Bayview A Special Session: Voice Conversion Challenge ORAL Sat-O-5-3-1 1066 The Voice Conversion Challenge 2016 Tomoki Toda, Ling-Hui Chen, Daisuke Saito, Fernando Villavicencio, Mirjam Wester, Zhizheng Wu, Junichi Yamagishi
Saturday, 10 September 2016 13:45 Bayview A Special Session: Voice Conversion Challenge ORAL Sat-O-5-3-2 1331 Analysis of the Voice Conversion Challenge 2016 Evaluation Results Mirjam Wester, Zhizheng Wu, Junichi Yamagishi
Saturday, 10 September 2016 14:00 Bayview A Special Session: Voice Conversion Challenge ORAL Sat-O-5-3-3 456 The USTC System for Voice Conversion Challenge 2016: Neural Network Based Approaches for Spectrum, Aperiodicity and F0 Conversion Ling-Hui Chen, Li-Juan Liu, Zhen-Hua Ling, Yuan Jiang, Li-Rong Dai
Saturday, 10 September 2016 14:15 Bayview A Special Session: Voice Conversion Challenge ORAL Sat-O-5-3-4 1437 A Voice Conversion Mapping Function Based on a Stacked Joint-Autoencoder Seyed Hamidreza Mohammadi, Alexander Kain
Saturday, 10 September 2016 14:30 Bayview A Special Session: Voice Conversion Challenge ORAL Sat-O-5-3-5 567 Locally Linear Embedding for Exemplar-Based Spectral Conversion Yi-Chiao Wu, Hsin-Te Hwang, Chin-Cheng Hsu, Yu Tsao, Hsin-Min Wang
Saturday, 10 September 2016 14:45 Bayview A Special Session: Voice Conversion Challenge ORAL Sat-O-5-3-6 305 Applying Spectral Normalisation and Efficient Envelope Estimation and Statistical Transformation for the Voice Conversion Challenge 2016 Fernando Villavicencio, Junichi Yamagishi, Jordi Bonada, Felipe Espic
Saturday, 10 September 2016 15:00 Bayview A Special Session: Voice Conversion Challenge ORAL Sat-O-5-3-7 219 ML Parameter Generation with a Reformulated MGE Training Criterion - Participation in the Voice Conversion Challenge 2016 D. Erro, A. Alonso, L. Serrano, D. Tavarez, I. Odriozola, Xabier Sarasola, Eder del Blanco, J. Sanchez, I. Saratxaga, Eva Navas, Inma Hernaez
Saturday, 10 September 2016 15:15 Bayview A Special Session: Voice Conversion Challenge ORAL Sat-O-5-3-8 970 The NU-NAIST Voice Conversion System for the Voice Conversion Challenge 2016 Kazuhiro Kobayashi, Shinnosuke Takamichi, Satoshi Nakamura, Tomoki Toda
Saturday, 10 September 2016 13:30 Bayview B Intelligibility and Masking ORAL Sat-O-5-4-1 1571 Release from Energetic Masking Caused by Repeated Patterns of Glimpsing Windows Maury Lander-Portnoy
Saturday, 10 September 2016 13:50 Bayview B Intelligibility and Masking ORAL Sat-O-5-4-2 1587 Glimpsing Predictions for Natural and Vocoded Sentence Intelligibility During Modulation Masking: Effect of the Glimpse Cutoff Criterion Bobby Gibbs II, Daniel Fogerty
Saturday, 10 September 2016 14:10 Bayview B Intelligibility and Masking ORAL Sat-O-5-4-3 171 Temporal Envelopes in Sine-Wave Speech Recognition Li Xu
Saturday, 10 September 2016 14:30 Bayview B Intelligibility and Masking ORAL Sat-O-5-4-4 176 Understanding Periodically Interrupted Mandarin Speech Jing Liu, Rosanna H.N. Tong, Fei Chen
Saturday, 10 September 2016 14:50 Bayview B Intelligibility and Masking ORAL Sat-O-5-4-5 4 Factors Affecting the Intelligibility of Sine-Wave Speech Fei Chen, Daniel Fogerty
Saturday, 10 September 2016 15:10 Bayview B Intelligibility and Masking ORAL Sat-O-5-4-6 1618 Effects of Urgent Speech and Preceding Sounds on Speech Intelligibility in Noisy and Reverberant Environments Nao Hodoshima
Saturday, 10 September 2016 13:30 Seacliff BCD Robust Speaker Recognition and Anti-Spoofing ORAL Sat-O-5-5-1 1280 Integrated Spoofing Countermeasures and Automatic Speaker Verification: An Evaluation on ASVspoof 2015 Md. Sahidullah, Héctor Delgado, Massimiliano Todisco, Hong Yu, Tomi Kinnunen, Nicholas Evans, Zheng-Hua Tan
Saturday, 10 September 2016 13:50 Seacliff BCD Robust Speaker Recognition and Anti-Spoofing ORAL Sat-O-5-5-2 1326 Cross-Database Evaluation of Audio-Based Spoofing Detection Systems Pavel Korshunov, Sébastien Marcel
Saturday, 10 September 2016 14:10 Seacliff BCD Robust Speaker Recognition and Anti-Spoofing ORAL Sat-O-5-5-3 844 Investigation of Sub-Band Discriminative Information Between Spoofed and Genuine Speech Kaavya Sriskandaraja, Vidhyasaharan Sethu, Phu Ngoc Le, Eliathamby Ambikairajah
Saturday, 10 September 2016 14:30 Seacliff BCD Robust Speaker Recognition and Anti-Spoofing ORAL Sat-O-5-5-4 743 An Investigation of Spoofing Speech Detection Under Additive Noise and Reverberant Conditions Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li
Saturday, 10 September 2016 14:50 Seacliff BCD Robust Speaker Recognition and Anti-Spoofing ORAL Sat-O-5-5-5 1153 Robust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech Md. Sahidullah, Rosa Gonzalez Hautamäki, Dennis Alexander Lehmann Thomsen, Tomi Kinnunen, Zheng-Hua Tan, Ville Hautamäki, Robert Parts, Martti Pitkänen
Saturday, 10 September 2016 15:10 Seacliff BCD Robust Speaker Recognition and Anti-Spoofing ORAL Sat-O-5-5-6 650 Statistical Modeling of Speaker’s Voice with Temporal Co-Location for Active Voice Authentication Zhong Meng, Biing-Hwang Juang
Saturday, 10 September 2016 13:30 Seacliff A Speech Enhancement and Applications ORAL Sat-O-5-6-1 245 Joint Enhancement and Coding of Speech by Incorporating Wiener Filtering in a CELP Codec Johannes Fischer, Tom Bäckström
Saturday, 10 September 2016 13:50 Seacliff A Speech Enhancement and Applications ORAL Sat-O-5-6-2 729 Multi-Channel Linear Prediction Based on Binaural Coherence for Speech Dereverberation Hong Liu, Xiuling Wang, Miao Sun, Cheng Pang
Saturday, 10 September 2016 14:10 Seacliff A Speech Enhancement and Applications ORAL Sat-O-5-6-3 234 Single-Channel Speech Enhancement Using Double Spectrum Martin Blass, Pejman Mowlaee, W. Bastiaan Kleijn
Saturday, 10 September 2016 14:30 Seacliff A Speech Enhancement and Applications ORAL Sat-O-5-6-4 300 On the Appropriateness of Complex-Valued Neural Networks for Speech Enhancement Lukas Drude, Bhiksha Raj, Reinhold Haeb-Umbach
Saturday, 10 September 2016 14:50 Seacliff A Speech Enhancement and Applications ORAL Sat-O-5-6-5 350 Introducing the Turbo-Twin-HMM for Audio-Visual Speech Enhancement Steffen Zeiler, Hendrik Meutzner, Ahmed Hussen Abdelaziz, Dorothea Kolossa
Saturday, 10 September 2016 15:10 Seacliff A Speech Enhancement and Applications ORAL Sat-O-5-6-6 1318 Assessing Speech Quality in Speech-Aware Hearing Aids Based on Phoneme Posteriorgrams Constantin Spille, Hendrik Kayser, Hynek Hermansky, Bernd T. Meyer
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster A Speech Analysis POSTER Sat-P-5-1-1 153 Time-Varying Quasi-Closed-Phase Weighted Linear Prediction Analysis of Speech for Accurate Formant Detection and Tracking Dhananjaya Gowda, Paavo Alku
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster A Speech Analysis POSTER Sat-P-5-1-2 664 Improved Depiction of Tissue Boundaries in Vocal Tract Real-Time MRI Using Automatic Off-Resonance Correction Yongwan Lim, Sajan Goud Lingala, Asterios Toutios, Shrikanth S. Narayanan, Krishna S. Nayak
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster A Speech Analysis POSTER Sat-P-5-1-3 1183 Modeling and Transforming Speech Using Variational Autoencoders Merlijn Blaauw, Jordi Bonada
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster A Speech Analysis POSTER Sat-P-5-1-4 1600 Phase-Encoded Speech Spectrograms Chandra Sekhar Seelamantula
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster A Speech Analysis POSTER Sat-P-5-1-5 771 Towards Minimally Invasive Velar State Detection in Normal and Silent Speech Peter Birkholz, Petko Bakardjiev, Steffen Kürbis, Rico Petrick
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster A Speech Analysis POSTER Sat-P-5-1-6 117 RNN-BLSTM Based Multi-Pitch Estimation Jianshu Zhang, Jian Tang, Li-Rong Dai
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster A Speech Analysis POSTER Sat-P-5-1-7 140 TUSK: A Framework for Overviewing the Performance of F0 Estimators Masanori Morise, Hideki Kawahara
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster A Speech Analysis POSTER Sat-P-5-1-8 369 A Robust Non-Parametric and Filtering Based Approach for Glottal Closure Instant Detection Pradeep Rengaswamy, Gurunath Reddy M., K. Sreenivasa Rao, Pallab Dasgupta
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster B Speaker Recognition POSTER Sat-P-5-2-1 518 Analysis of Face Mask Effect on Speaker Recognition Rahim Saeidi, Ilkka Huhtakallio, Paavo Alku
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster B Speaker Recognition POSTER Sat-P-5-2-2 1282 Data Selection for Within-Class Covariance Estimation Elliot Singer, Tyler Campbell, Douglas Reynolds
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster B Speaker Recognition POSTER Sat-P-5-2-3 1179 Inter-Task System Fusion for Speaker Recognition M. Ferras, Srikanth Madikeri, S. Dey, Petr Motlicek, Hervé Bourlard
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster B Speaker Recognition POSTER Sat-P-5-2-4 1071 Mahalanobis Metric Scoring Learned from Weighted Pairwise Constraints in I-Vector Speaker Recognition System Zhenchun Lei, Yanhong Wan, Jian Luo, Yingen Yang
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster B Speaker Recognition POSTER Sat-P-5-2-5 668 Novel Subband Autoencoder Features for Detection of Spoofed Speech Meet H. Soni, Tanvina B. Patel, Hemant A. Patil
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster B Speaker Recognition POSTER Sat-P-5-2-6 1134 On the Issue of Calibration in DNN-Based Speaker Recognition Systems Mitchell McLaren, Diego Castan, Luciana Ferrer, Aaron Lawson
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster B Speaker Recognition POSTER Sat-P-5-2-7 1302 Probabilistic Approach Using Joint Long and Short Session i-Vectors Modeling to Deal with Short Utterances for Speaker Recognition Waad Ben Kheder, Driss Matrouf, Moez Ajili, Jean-François Bonastre
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster B Speaker Recognition POSTER Sat-P-5-2-8 778 Short Utterance Variance Modelling and Utterance Partitioning for PLDA Speaker Verification Ahilan Kanagasundaram, David Dean, Sridha Sridharan, Clinton Fookes, Ivan Himawan
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster B Speaker Recognition POSTER Sat-P-5-2-9 763 Speaker-Dependent Dictionary-Based Speech Enhancement for Text-Dependent Speaker Verification Nicolai Bæk Thomsen, Dennis Alexander Lehmann Thomsen, Zheng-Hua Tan, Børge Lindberg, Søren Holdt Jensen
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster B Speaker Recognition POSTER Sat-P-5-2-10 1520 Text-Available Speaker Recognition System for Forensic Applications Chengzhu Yu, Chunlei Zhang, Finnian Kelly, Abhijeet Sangwan, John H.L. Hansen
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster B Speaker Recognition POSTER Sat-P-5-2-11 432 Transfer Learning for Speaker Verification on Short Utterances Qingyang Hong, Lin Li, Lihong Wan, Jun Zhang, Feng Tong
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster B Speaker Recognition POSTER Sat-P-5-2-12 683 Twin Model G-PLDA for Duration Mismatch Compensation in Text-Independent Speaker Verification Jianbo Ma, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Kong Aik Lee
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster B Speaker Recognition POSTER Sat-P-5-2-13 65 Universal Background Sparse Coding and Multilayer Bootstrap Network for Speaker Clustering Xiao-Lei Zhang
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster B Speaker Recognition POSTER Sat-P-5-2-14 614 Improving Deep Neural Networks Based Speaker Verification Using Unlabeled Data Yao Tian, Meng Cai, Liang He, Wei-Qiang Zhang, Jia Liu
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster C Decoding, System Combination POSTER Sat-P-5-3-1 71 Maximum a posteriori Based Decoding for CTC Acoustic Models Naoyuki Kanda, Xugang Lu, Hisashi Kawai
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster C Decoding, System Combination POSTER Sat-P-5-3-2 938 Phonetic and Phonological Posterior Search Space Hashing Exploiting Class-Specific Sparsity Structures Afsaneh Asaei, Gil Luyet, Milos Cernak, Hervé Bourlard
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster C Decoding, System Combination POSTER Sat-P-5-3-3 1393 Model Compression Applied to Small-Footprint Keyword Spotting George Tucker, Minhua Wu, Ming Sun, Sankaran Panchapagesan, Gengshen Fu, Shiv Vitaladevuni
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster C Decoding, System Combination POSTER Sat-P-5-3-4 1552 Why do ASR Systems Despite Neural Nets Still Depend on Robust Features Angel Mario Castro Martinez, Marc René Schädler
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster C Decoding, System Combination POSTER Sat-P-5-3-5 1562 An Adaptive Multi-Band System for Low Power Voice Command Recognition Qing He, Gregory W. Wornell, Wei Ma
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster C Decoding, System Combination POSTER Sat-P-5-3-6 287 Memory-Efficient Modeling and Search Techniques for Hardware ASR Decoders Michael Price, Anantha Chandrakasan, James Glass
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster C Decoding, System Combination POSTER Sat-P-5-3-7 377 Log-Linear System Combination Using Structured Support Vector Machines J. Yang, Anton Ragni, Mark J.F. Gales, Kate M. Knill
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster C Decoding, System Combination POSTER Sat-P-5-3-8 1298 Efficient Segmental Cascades for Speech Recognition Hao Tang, Weiran Wang, Kevin Gimpel, Karen Livescu
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster C Decoding, System Combination POSTER Sat-P-5-3-9 1307 A WFST Framework for Single-Pass Multi-Stream Decoding Sirui Xu, Eric Fosler-Lussier
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster C Decoding, System Combination POSTER Sat-P-5-3-10 1381 Comparison of Multiple System Combination Techniques for Keyword Spotting William Hartmann, Le Zhang, Kerri Barnes, Roger Hsiao, Stavros Tsakalidis, Richard Schwartz
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster C Decoding, System Combination POSTER Sat-P-5-3-11 309 Rescoring by Combination of Posteriorgram Score and Subword-Matching Score for Use in Query-by-Example Masato Obara, Kazunori Kojima, Kazuyo Tanaka, Shi-wook Lee, Yoshiaki Itoh
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster C Decoding, System Combination POSTER Sat-P-5-3-12 831 Phone Synchronous Decoding with CTC Lattice Zhehuai Chen, Wei Deng, Tao Xu, Kai Yu
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster D Special Session: Clinical and Neuroscience-Inspired Vocal Biomarkers of Neurological and Psychiatric Disorders POSTER Sat-P-5-4-1 1566 Speech Features for Depression Detection Saurabh Sahu, Carol Espy-Wilson
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster D Special Session: Clinical and Neuroscience-Inspired Vocal Biomarkers of Neurological and Psychiatric Disorders POSTER Sat-P-5-4-2 1122 Parkinson’s Disease Progression Assessment from Speech Using GMM-UBM T. Arias-Vergara, J.C. Vasquez-Correa, Juan Rafael Orozco-Arroyave, J.F. Vargas-Bonilla, Elmar Nöth
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster D Special Session: Clinical and Neuroscience-Inspired Vocal Biomarkers of Neurological and Psychiatric Disorders POSTER Sat-P-5-4-3 100 Speech-Based Detection of Alzheimer’s Disease in Conversational German Jochen Weiner, Christian Herff, Tanja Schultz
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster D Special Session: Clinical and Neuroscience-Inspired Vocal Biomarkers of Neurological and Psychiatric Disorders POSTER Sat-P-5-4-4 1339 Cross-Cultural Depression Recognition from Vocal Biomarkers Sharifa Alghowinem, Roland Goecke, Julien Epps, Michael Wagner, Jeffrey Cohn
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster D Special Session: Clinical and Neuroscience-Inspired Vocal Biomarkers of Neurological and Psychiatric Disorders POSTER Sat-P-5-4-5 1228 Speech Recognition in Alzheimer’s Disease and in its Assessment Luke Zhou, Kathleen C. Fraser, Frank Rudzicz
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster D Special Session: Clinical and Neuroscience-Inspired Vocal Biomarkers of Neurological and Psychiatric Disorders POSTER Sat-P-5-4-6 520 Does She Speak RTT? Towards an Earlier Identification of Rett Syndrome Through Intelligent Pre-Linguistic Vocalisation Analysis Florian B. Pokorny, Peter B. Marschik, Christa Einspieler, Björn Schuller
Saturday, 10 September 2016 13:30 Pacific Concourse - Poster D Special Session: Clinical and Neuroscience-Inspired Vocal Biomarkers of Neurological and Psychiatric Disorders POSTER Sat-P-5-4-7 74 Speech Rhythm in Parkinson’s Disease: A Study on Italian Massimo Pettorino, Maria Grazia Busà, Elisa Pellegrino
Saturday, 10 September 2016 13:30 Market Street Foyer Show & Tell Session 5   Sat-S&T-5-1 2023 English Language Speech Assistant Xavier Anguera, Vu Van
Saturday, 10 September 2016 13:30 Market Street Foyer Show & Tell Session 5   Sat-S&T-5-2 2024 Remeeting - Deep Insights to Conversations Allen Guo, Arlo Faria, Korbinian Riedhammer
Saturday, 10 September 2016 13:30 Market Street Foyer Show & Tell Session 5   Sat-S&T-5-3 2025 SERAPHIM Live! - Singing Synthesis for the Performer, the Composer, and the 3D Game Developer Paul Yaozhu Chan, Minghui Dong, Grace Xue Hui Ho, Haizhou Li
Saturday, 10 September 2016 13:30 Market Street Foyer Show & Tell Session 5   Sat-S&T-5-4 2010 My-Own-Voice: A Web Service That Allows You to Create a Text-to-Speech Voice From Your Own Voice Fabrice Malfrere, Olivier Deroo, Emmanuelle Franques, Jonathan Hourez, Nicolas Mazars, Vincent Pagel, Geoffrey Wilfart
Sunday, 11 September 2016 08:30 - 9:30 Grand Ballroom ABC Keynote 3: Anne Fernald ORAL Sun-Keynote-3 3003 Talking with Kids Really Matters: Early Language Experience Shapes Later Life Chances Anne Fernald
Sunday, 11 September 2016 10:00 Grand Ballroom A Far-Field Speech Processing ORAL Sun-O-6-1-1 92 Reducing the Computational Complexity of Multimicrophone Acoustic Models with Integrated Feature Extraction Tara N. Sainath, Arun Narayanan, Ron J. Weiss, Ehsan Variani, Kevin W. Wilson, Michiel Bacchiani, Izhak Shafran
Sunday, 11 September 2016 10:20 Grand Ballroom A Far-Field Speech Processing ORAL Sun-O-6-1-2 173 Neural Network Adaptive Beamforming for Robust Multichannel Speech Recognition Bo Li, Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Michiel Bacchiani
Sunday, 11 September 2016 10:40 Grand Ballroom A Far-Field Speech Processing ORAL Sun-O-6-1-3 552 Improved MVDR Beamforming Using Single-Channel Mask Prediction Networks Hakan Erdogan, John R. Hershey, Shinji Watanabe, Michael I. Mandel, Jonathan Le Roux
Sunday, 11 September 2016 11:00 Grand Ballroom A Far-Field Speech Processing ORAL Sun-O-6-1-4 865 Channel Selection for Distant Speech Recognition Exploiting Cepstral Distance Cristina Guerrero, Georgina Tryfou, Maurizio Omologo
Sunday, 11 September 2016 11:20 Grand Ballroom A Far-Field Speech Processing ORAL Sun-O-6-1-5 1275 Multichannel Spatial Clustering for Robust Far-Field Automatic Speech Recognition in Mismatched Conditions Michael I. Mandel, Jon Barker
Sunday, 11 September 2016 11:40 Grand Ballroom A Far-Field Speech Processing ORAL Sun-O-6-1-6 1475 Far-Field ASR Without Parallel Data Vijayaditya Peddinti, Vimal Manohar, Yiming Wang, Daniel Povey, Sanjeev Khudanpur
Sunday, 11 September 2016 10:00 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-6-2-1 129 The INTERSPEECH 2016 Computational Paralinguistics Challenge: Deception, Sincerity & Native Language Björn Schuller, Stefan Steidl, Anton Batliner, Julia Hirschberg, Judee K. Burgoon, Alice Baird, Aaron Elkins, Yue Zhang, Eduardo Coutinho, Keelan Evanini
Sunday, 11 September 2016 10:10 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-6-2-2 4007 The Deception Sub-Challenge: The Data Björn Schuller, Stefan Steidl, Anton Batliner, Julia Hirschberg, Judee K. Burgoon, Alice Baird, Aaron Elkins, Yue Zhang, Eduardo Coutinho, Keelan Evanini
Sunday, 11 September 2016 10:20 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-6-2-3 1519 Combining Acoustic-Prosodic, Lexical, and Phonotactic Features for Automatic Deception Detection Sarah Ita Levitan, Guozhen An, Min Ma, Rivka Levitan, Andrew Rosenberg, Julia Hirschberg
Sunday, 11 September 2016 10:30 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-6-2-4 565 Is Deception Emotional? An Emotion-Driven Predictive Approach Shahin Amiriparian, Jouni Pohjalainen, Erik Marchi, Sergey Pugachevskiy, Björn Schuller
Sunday, 11 September 2016 10:40 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-6-2-5 33 Prosodic Cues and Answer Type Detection for the Deception Sub-Challenge Claude Montacié, Marie-José Caraty
Sunday, 11 September 2016 10:50 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-6-2-6 4008 The Sincerity Sub-Challenge: The Data Björn Schuller, Stefan Steidl, Anton Batliner, Julia Hirschberg, Judee K. Burgoon, Alice Baird, Aaron Elkins, Yue Zhang, Eduardo Coutinho, Keelan Evanini
Sunday, 11 September 2016 11:00 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-6-2-7 1537 Automatic Estimation of Perceived Sincerity from Spoken Language Brandon M. Booth, Rahul Gupta, Pavlos Papadopoulos, Ruchir Travadi, Shrikanth S. Narayanan
Sunday, 11 September 2016 11:10 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-6-2-8 956 Estimating the Sincerity of Apologies in Speech by DNN Rank Learning and Prosodic Analysis Gábor Gosztolya, Tamás Grósz, György Szaszák, László Tóth
Sunday, 11 September 2016 11:20 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-6-2-9 756 Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation Hung-Shin Lee, Yu Tsao, Chi-Chun Lee, Hsin-Min Wang, Wei-Cheng Lin, Wei-Chen Chen, Shan-Wen Hsiao, Shyh-Kang Jeng
Sunday, 11 September 2016 11:30 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-6-2-10 971 Prediction of Deception and Sincerity from Speech Using Automatic Phone Recognition-Based Features Robert Herms
Sunday, 11 September 2016 11:40 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-6-2-11 1305 Sincerity and Deception in Speech: Two Sides of the Same Coin? A Transfer- and Multi-Task Learning Perspective Yue Zhang, Felix Weninger, Zhao Ren, Björn Schuller
Sunday, 11 September 2016 11:50 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-6-2-12 995 Fusing Acoustic Feature Representations for Computational Paralinguistics Tasks Heysem Kaya, Alexey A. Karpov
Sunday, 11 September 2016 10:00 Bayview A Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations ORAL Sun-O-6-3-1 4009 Introduction Naomi Harte, Peter Jancovic, Karl-L. Schuchmann
Sunday, 11 September 2016 10:05 Bayview A Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations ORAL Sun-O-6-3-2 4010 Poster Overview Presentations Naomi Harte, Peter Jancovic, Karl-L. Schuchmann
Sunday, 11 September 2016 11:15 Bayview A Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations ORAL Sun-O-6-3-3 4011 Discussion Naomi Harte, Peter Jancovic, Karl-L. Schuchmann
Sunday, 11 September 2016 11:55 Bayview A Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations ORAL Sun-O-6-3-4 4012 Closing remarks Naomi Harte, Peter Jancovic, Karl-L. Schuchmann
Sunday, 11 September 2016 10:00 Bayview B Dialogue Systems and Analysis of Dialogue ORAL Sun-O-6-4-1 479 A Stochastic Model for Computer-Aided Human-Human Dialogue Merwan Barlier, Romain Laroche, Olivier Pietquin
Sunday, 11 September 2016 10:20 Bayview B Dialogue Systems and Analysis of Dialogue ORAL Sun-O-6-4-2 527 Highlighting Psychological Features for Predicting Child Interjections During Story Telling Gaël Lejeune, François Rioult, Bruno Crémilleux
Sunday, 11 September 2016 10:40 Bayview B Dialogue Systems and Analysis of Dialogue ORAL Sun-O-6-4-3 949 Hybrid Dialogue State Tracking for Real World Human-to-Human Dialogues Kai Sun, Su Zhu, Lu Chen, Siqiu Yao, Xueyang Wu, Kai Yu
Sunday, 11 September 2016 11:00 Bayview B Dialogue Systems and Analysis of Dialogue ORAL Sun-O-6-4-4 202 Automatic Recognition of Social Roles Using Long Term Role Transitions in Small Group Interactions Gaurav Fotedar, Aditya Gaonkar P., Saikat Chatterjee, Prasanta Kumar Ghosh
Sunday, 11 September 2016 11:20 Bayview B Dialogue Systems and Analysis of Dialogue ORAL Sun-O-6-4-5 951 On the Influence of Gender on Interruptions in Multiparty Dialogue Paul Van Eecke, Raquel Fernández
Sunday, 11 September 2016 11:40 Bayview B Dialogue Systems and Analysis of Dialogue ORAL Sun-O-6-4-6 535 Detection of User Escalation in Human-Computer Interactions Ian Beaver, Cynthia Freeman
Sunday, 11 September 2016 10:00 Seacliff BCD "Interaction between Speech Production and Perception" ORAL Sun-O-6-5-1 396 Assessing Idiosyncrasies in a Bayesian Model of Speech Communication Marie-Lou Barnaud, Julien Diard, Pierre Bessière, Jean-Luc Schwartz
Sunday, 11 September 2016 10:20 Seacliff BCD "Interaction between Speech Production and Perception" ORAL Sun-O-6-5-2 420 Prosodic and Linguistic Analysis of Semantic Fluency Data: A Window into Speech Production and Cognition Maria K. Wolters, Najoung Kim, Jung-Ho Kim, Sarah E. MacPherson, Jong C. Park
Sunday, 11 September 2016 10:40 Seacliff BCD "Interaction between Speech Production and Perception" ORAL Sun-O-6-5-3 1594 Sensorimotor Response to Visual Imagery of Tongue Displacement William F. Katz, Divya Prabhakaran
Sunday, 11 September 2016 11:00 Seacliff BCD "Interaction between Speech Production and Perception" ORAL Sun-O-6-5-4 262 Does Auditory-Motor Learning of Speech Transfer from the CV Syllable to the CVCV Word? Tiphaine Caudrelier, Pascal Perrier, Jean-Luc Schwartz, Amélie Rochet-Capellan
Sunday, 11 September 2016 11:20 Seacliff BCD "Interaction between Speech Production and Perception" ORAL Sun-O-6-5-5 373 Exemplar Dynamics in Phonetic Convergence of Speech Rate Antje Schweitzer, Michael Walsh
Sunday, 11 September 2016 11:40 Seacliff BCD "Interaction between Speech Production and Perception" ORAL Sun-O-6-5-6 843 Articulation Rate in Adverse Listening Conditions in Younger and Older Adults Outi Tuomainen, Valerie Hazan
Sunday, 11 September 2016 10:00 Seacliff A Multimodal Processing ORAL Sun-O-6-6-1 56 Error Correction in Lightly Supervised Alignment of Broadcast Subtitles Julia Olcoz, Oscar Saz, Thomas Hain
Sunday, 11 September 2016 10:20 Seacliff A Multimodal Processing ORAL Sun-O-6-6-2 472 Automatic Genre and Show Identification of Broadcast Media Mortaza Doulaty, Oscar Saz, Raymond W.M. Ng, Thomas Hain
Sunday, 11 September 2016 10:40 Seacliff A Multimodal Processing ORAL Sun-O-6-6-3 599 Speaker-Targeted Audio-Visual Models for Speech Recognition in Cocktail-Party Environments Guan-Lin Chao, William Chan, Ian Lane
Sunday, 11 September 2016 11:00 Seacliff A Multimodal Processing ORAL Sun-O-6-6-4 196 Text-Dependent Audiovisual Synchrony Detection for Spoofing Detection in Mobile Person Recognition Amit Aides, Hagai Aronowitz
Sunday, 11 September 2016 11:20 Seacliff A Multimodal Processing ORAL Sun-O-6-6-5 406 Improving Boundary Estimation in Audiovisual Speech Activity Detection Using Bayesian Information Criterion Fei Tao, John H.L. Hansen, Carlos Busso
Sunday, 11 September 2016 11:40 Seacliff A Multimodal Processing ORAL Sun-O-6-6-6 166 Dynamic Stream Weighting for Turbo-Decoding-Based Audiovisual ASR Sebastian Gergen, Steffen Zeiler, Ahmed Hussen Abdelaziz, Robert Nickel, Dorothea Kolossa
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster A Pitch, Tone, and Music POSTER Sun-P-6-1-1 1272 Retrieval of Textual Song Lyrics from Sung Inputs Anna M. Kruspe
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster A Pitch, Tone, and Music POSTER Sun-P-6-1-2 510 Phoneme, Phone Boundary, and Tone in Automatic Scoring of Mandarin Proficiency Jiahong Yuan, Mark Liberman
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster A Pitch, Tone, and Music POSTER Sun-P-6-1-3 528 Tone Classification in Mandarin Chinese Using Convolutional Neural Networks Charles Chen, Razvan Bunescu, Li Xu, Chang Liu
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster A Pitch, Tone, and Music POSTER Sun-P-6-1-4 1401 Robust Estimation of Fundamental Frequency Using Single Frequency Filtering Approach Vishala Pannala, G. Aneeja, Sudarsana Reddy Kadiri, B. Yegnanarayana
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster A Pitch, Tone, and Music POSTER Sun-P-6-1-5 394 A Fast and Accurate Fundamental Frequency Estimator Using Recursive Moving Average Filters Ryunosuke Daido, Yuji Hisaminato
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster A Pitch, Tone, and Music POSTER Sun-P-6-1-6 679 Frequency Estimation from Waveforms Using Multi-Layered Neural Networks Prateek Verma, Ronald W. Schafer
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster B Speaker Diarization and Recognition POSTER Sun-P-6-2-1 468 Speaker Linking and Applications Using Non-Parametric Hashing Methods Douglas E. Sturim, William M. Campbell
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster B Speaker Diarization and Recognition POSTER Sun-P-6-2-2 572 Iterative PLDA Adaptation for Speaker Diarization Gaël Le Lan, Delphine Charlet, Anthony Larcher, Sylvain Meignier
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster B Speaker Diarization and Recognition POSTER Sun-P-6-2-3 1497 A Speaker Diarization System for Studying Peer-Led Team Learning Groups Harishchandra Dubey, Lakshmish Kaushik, Abhijeet Sangwan, John H.L. Hansen
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster B Speaker Diarization and Recognition POSTER Sun-P-6-2-4 126 DNN-Based Speaker Clustering for Speaker Diarisation Rosanna Milner, Thomas Hain
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster B Speaker Diarization and Recognition POSTER Sun-P-6-2-5 503 On the Importance of Efficient Transition Modeling for Speaker Diarization Itshak Lapidot, Jean-François Bonastre
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster B Speaker Diarization and Recognition POSTER Sun-P-6-2-6 1380 Priors for Speaker Counting and Diarization with AHC Gregory Sell, Alan McCree, Daniel Garcia-Romero
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster B Speaker Diarization and Recognition POSTER Sun-P-6-2-7 714 Two-Pass IB Based Speaker Diarization System Using Meeting-Specific ANN Based Features Nauman Dawalatabad, Srikanth Madikeri, Chandra Sekhar C., Hema A. Murthy
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster B Speaker Diarization and Recognition POSTER Sun-P-6-2-8 717 DNN-Based Amplitude and Phase Feature Enhancement for Noise Robust Speaker Identification Zeyan Oo, Yuta Kawakami, Longbiao Wang, Seiichi Nakagawa, Xiong Xiao, Masahiro Iwahashi
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster B Speaker Diarization and Recognition POSTER Sun-P-6-2-9 969 Unit-Selection Attack Detection Based on Unfiltered Frequency-Domain Features Ulrich Scherhag, Andreas Nautsch, Christian Rathgeb, Christoph Busch
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster B Speaker Diarization and Recognition POSTER Sun-P-6-2-10 1549 Investigating the Impact of Dialect Prestige on Lexical Decision Mairym Lloréns Monteserín, Jason Zevin
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster B Speaker Diarization and Recognition POSTER Sun-P-6-2-11 282 Speaker Verification Using Short Utterances with DNN-Based Estimation of Subglottal Acoustic Features Jinxi Guo, Gary Yeung, Deepak Muralidharan, Harish Arsikere, Amber Afshan, Abeer Alwan
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster B Speaker Diarization and Recognition POSTER Sun-P-6-2-12 1157 Factor Analysis Based Speaker Verification Using ASR Hang Su, Steven Wegmann
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster B Speaker Diarization and Recognition POSTER Sun-P-6-2-13 773 Joint Sound Source Separation and Speaker Recognition Jeroen Zegers, Hugo Van hamme
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster B Speaker Diarization and Recognition POSTER Sun-P-6-2-14 540 Robust Multichannel Gender Classification from Speech in Movie Audio Naveen Kumar, Md. Nasir, Panayiotis Georgiou, Shrikanth S. Narayanan
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sun-P-6-3-1 264 Recent Advances in Google Real-Time HMM-Driven Unit Selection Synthesizer Xavi Gonzalvo, Siamak Tazari, Chun-an Chan, Markus Becker, Alexander Gutkin, Hanna Silen
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sun-P-6-3-2 134 First Step Towards End-to-End Parametric TTS Synthesis: Generating Spectral Parameters with Neural Attention Wenfu Wang, Shuang Xu, Bo Xu
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sun-P-6-3-3 222 The Parameterized Phoneme Identity Feature as a Continuous Real-Valued Vector for Neural Network Based Speech Synthesis Zhengqi Wen, Ya Li, Jianhua Tao
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sun-P-6-3-4 230 Improved Time-Frequency Trajectory Excitation Vocoder for DNN-Based Speech Synthesis Eunwoo Song, Frank K. Soong, Hong-Goo Kang
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sun-P-6-3-5 290 Voice Quality Control Using Perceptual Expressions for Statistical Parametric Speech Synthesis Based on Cluster Adaptive Training Yamato Ohtani, Koichiro Mori, Masahiro Morita
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sun-P-6-3-6 487 Waveform Generation Based on Signal Reshaping for Statistical Parametric Speech Synthesis Felipe Espic, Cassia Valentini-Botinhao, Zhizheng Wu, Simon King
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sun-P-6-3-7 506 Speaker Representations for Speaker Adaptation in Multiple Speakers’ BLSTM-RNN-Based Speech Synthesis Yi Zhao, Daisuke Saito, Nobuaki Minematsu
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sun-P-6-3-8 522 Fast, Compact, and High Quality LSTM-RNN Based Statistical Parametric Speech Synthesizers for Mobile Devices Heiga Zen, Yannis Agiomyrgiannakis, Niels Egberts, Fergus Henderson, Przemysław Szczepaniak
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sun-P-6-3-9 589 An Investigation of DNN-Based Speech Synthesis Using Speaker Codes Nobukatsu Hojo, Yusuke Ijima, Hideyuki Mizuno
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sun-P-6-3-10 712 Using Text and Acoustic Features in Predicting Glottal Excitation Waveforms for Parametric Speech Synthesis with Recurrent Neural Networks Lauri Juvela, Xin Wang, Shinji Takaki, Manu Airaksinen, Junichi Yamagishi, Paavo Alku
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sun-P-6-3-11 1006 Model Integration for HMM- and DNN-Based Speech Synthesis Using Product-of-Experts Framework Kentaro Tachibana, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sun-P-6-3-12 1188 Idlak Tangle: An Open Source Kaldi Based Parametric Speech Synthesiser Based on DNN Blaise Potard, Matthew P. Aylett, David A. Baude, Petr Motlicek
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sun-P-6-3-13 258 Probabilistic Amplitude Demodulation Features in Speech Synthesis for Improving Prosody Alexandros Lazaridis, Milos Cernak, Philip N. Garner
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sun-P-6-3-14 409 On Smoothing and Enhancing Dynamics of Pitch Contours Represented by Discrete Orthogonal Polynomials for Prosody Generation Chen-Yu Chiang
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sun-P-6-3-15 885 An Investigation of Recurrent Neural Network Architectures Using Word Embeddings for Phrase Break Prediction Anandaswarup Vadapalli, Suryakanth V. Gangashetty
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster C Speech Synthesis Poster POSTER Sun-P-6-3-16 1325 Model-Based Parametric Prosody Synthesis with Deep Neural Network Hao Liu, Heng Lu, Xu Shao, Yi Xu
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster D Language Model Adaptation POSTER Sun-P-6-4-1 1382 Active and Semi-Supervised Learning in ASR: Benefits on the Acoustic and Language Models Thomas Drugman, Janne Pylkkönen, Reinhard Kneser
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster D Language Model Adaptation POSTER Sun-P-6-4-2 1093 Learning N-Gram Language Models from Uncertain Data Vitaly Kuznetsov, Hank Liao, Mehryar Mohri, Michael Riley, Brian Roark
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster D Language Model Adaptation POSTER Sun-P-6-4-3 130 Entropy Based Pruning for Non-Negative Matrix Based Language Models with Contextual Features Barlas Oğuz, Issac Alphonso, Shuangyu Chang
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster D Language Model Adaptation POSTER Sun-P-6-4-4 1342 Unsupervised Adaptation of Recurrent Neural Network Language Models Siva Reddy Gangireddy, Pawel Swietojanski, Peter Bell, Steve Renals
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster D Language Model Adaptation POSTER Sun-P-6-4-5 1358 Contextual Prediction Models for Speech Recognition Yoni Halpern, Keith Hall, Vlad Schogol, Michael Riley, Brian Roark, Gleb Skobeltsyn, Martin Bäuml
Sunday, 11 September 2016 10:00 Pacific Concourse - Poster D Language Model Adaptation POSTER Sun-P-6-4-6 480 Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition Salil Deena, Madina Hasan, Mortaza Doulaty, Oscar Saz, Thomas Hain
Sunday, 11 September 2016 10:00 Market Street Foyer Show & Tell Session 6   Sun-S&T-6-1 2022 A Low Cost Desktop Robot and Tele-Presence Device for Interactive Speech Research Michael C. Brady
Sunday, 11 September 2016 10:00 Market Street Foyer Show & Tell Session 6   Sun-S&T-6-2 2005 Silent-Speech Command Word Recognition Using Electro-Optical Stomatography Simon Stone, Peter Birkholz
Sunday, 11 September 2016 10:00 Market Street Foyer Show & Tell Session 6   Sun-S&T-6-3 2016 An Engine for Online Video Search in Large Archives of the Holocaust Testimonies Petr Stanislav, Jan Švec, Pavel Ircing
Sunday, 11 September 2016 10:00 Market Street Foyer Show & Tell Session 6   Sun-S&T-6-4 2021 MIVOQ-PTTS - A Revolutionary New Way of Thinking TTS Piero Cosi
Sunday, 11 September 2016 13:30 Grand Ballroom A Robustness in Speech Processing ORAL Sun-O-7-1-1 741 Data Selection by Sequence Summarizing Neural Network in Mismatch Condition Training Kateřina Žmolíková, Martin Karafiát, Karel Veselý, Marc Delcroix, Shinji Watanabe, Lukáš Burget, Jan Černocký
Sunday, 11 September 2016 13:50 Grand Ballroom A Robustness in Speech Processing ORAL Sun-O-7-1-2 760 Incorporating a Generative Front-End Layer to Deep Neural Network for Noise Robust Automatic Speech Recognition Souvik Kundu, Khe Chai Sim, Mark J.F. Gales
Sunday, 11 September 2016 14:10 Grand Ballroom A Robustness in Speech Processing ORAL Sun-O-7-1-3 852 Robust Speech Recognition Using Generalized Distillation Framework Konstantin Markov, Tomoko Matsui
Sunday, 11 September 2016 14:30 Grand Ballroom A Robustness in Speech Processing ORAL Sun-O-7-1-4 879 Adversarial Multi-Task Learning of Deep Neural Networks for Robust Speech Recognition Yusuke Shinohara
Sunday, 11 September 2016 14:50 Grand Ballroom A Robustness in Speech Processing ORAL Sun-O-7-1-5 1277 The Use of Locally Normalized Cepstral Coefficients (LNCC) to Improve Speaker Recognition Accuracy in Highly Reverberant Rooms Víctor Poblete, Juan Pablo Escudero, Josué Fredes, José Novoa, Richard M. Stern, Simon King, Néstor Becerra Yoma
Sunday, 11 September 2016 15:10 Grand Ballroom A Robustness in Speech Processing ORAL Sun-O-7-1-6 1386 Two-Stage Data Augmentation for Low-Resourced Speech Recognition William Hartmann, Tim Ng, Roger Hsiao, Stavros Tsakalidis, Richard Schwartz
Sunday, 11 September 2016 13:30 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-7-2-1 4013 The Native Language Sub-Challenge: The Data Björn Schuller, Stefan Steidl, Anton Batliner, Julia Hirschberg, Judee K. Burgoon, Alice Baird, Aaron Elkins, Yue Zhang, Eduardo Coutinho, Keelan Evanini
Sunday, 11 September 2016 13:40 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-7-2-2 1100 Native Language Identification Using Spectral and Source-Based Features Avni Rajpal, Tanvina B. Patel, Hardik B. Sailor, Maulik C. Madhavi, Hemant A. Patil, Hiroya Fujisaki
Sunday, 11 September 2016 13:50 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-7-2-3 1148 Accent Identification by Combining Deep Neural Networks and Recurrent Neural Networks Trained on Long and Short Term Features Yishan Jiao, Ming Tu, Visar Berisha, Julie Liss
Sunday, 11 September 2016 14:00 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-7-2-4 261 Convolutional Neural Networks with Data Augmentation for Classifying Speakers’ Native Language Gil Keren, Jun Deng, Jouni Pohjalainen, Björn Schuller
Sunday, 11 September 2016 14:10 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-7-2-5 1473 Native Language Detection Using the I-Vector Framework Mohammed Senoussaoui, Patrick Cardinal, Najim Dehak, Alessandro L. Koerich
Sunday, 11 September 2016 14:20 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-7-2-6 1466 Within-Speaker Features for Native Language Recognition in the Interspeech 2016 Computational Paralinguistics Challenge Mark Huckvale
Sunday, 11 September 2016 14:30 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-7-2-7 1312 Multimodal Fusion of Multirate Acoustic, Prosodic, and Lexical Speaker Characteristics for Native Language Identification Prashanth Gurunath Shivakumar, Sandeep Nallan Chakravarthula, Panayiotis Georgiou
Sunday, 11 September 2016 14:40 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-7-2-8 1491 Exploiting Phone Log-Likelihood Ratio Features for the Detection of the Native Language of Non-Native English Speakers Alberto Abad, Eugénio Ribeiro, Fábio Kepler, Ramon Astudillo, Isabel Trancoso
Sunday, 11 September 2016 14:50 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-7-2-9 962 Determining Native Language and Deception Using Phonetic Features and Classifier Combination Gábor Gosztolya, Tamás Grósz, Róbert Busa-Fekete, László Tóth
Sunday, 11 September 2016 15:00 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-7-2-10 4014 The INTERSPEECH 2016 Computational Paralinguistics Challenge: A Summary of Results Björn Schuller, Stefan Steidl, Anton Batliner, Julia Hirschberg, Judee K. Burgoon, Alice Baird, Aaron Elkins, Yue Zhang, Eduardo Coutinho, Keelan Evanini
Sunday, 11 September 2016 15:10 Grand Ballroom BC Special Session: Interspeech 2016 Computational Paralinguistics Challenge (ComParE): Deception, Sincerity & Native Language ORAL Sun-O-7-2-11 4015 Discussion Björn Schuller, Stefan Steidl, Anton Batliner, Julia Hirschberg, Judee K. Burgoon, Alice Baird, Aaron Elkins, Yue Zhang, Eduardo Coutinho, Keelan Evanini
Sunday, 11 September 2016 13:30 Bayview A Acoustic and Articulatory Phonetics ORAL Sun-O-7-3-1 568 A Preliminary Ultrasound Study of Nasal and Lateral Coronals in Arrernte Marija Tabain, Richard Beare
Sunday, 11 September 2016 13:50 Bayview A Acoustic and Articulatory Phonetics ORAL Sun-O-7-3-2 605 Illustrating the Production of the International Phonetic Alphabet Sounds Using Fast Real-Time Magnetic Resonance Imaging Asterios Toutios, Sajan Goud Lingala, Colin Vaz, Jangwon Kim, John Esling, Patricia Keating, Matthew Gordon, Dani Byrd, Louis Goldstein, Krishna S. Nayak, Shrikanth S. Narayanan
Sunday, 11 September 2016 14:10 Bayview A Acoustic and Articulatory Phonetics ORAL Sun-O-7-3-3 762 Marginal Contrast Among Romanian Vowels: Evidence from ASR and Functional Load Margaret E.L. Renwick, Ioana Vasilescu, Camille Dutrey, Lori Lamel, Bianca Vieru
Sunday, 11 September 2016 14:30 Bayview A Acoustic and Articulatory Phonetics ORAL Sun-O-7-3-4 1054 Effects of Subglottal-Coupling and Interdental-Space on Formant Trajectories During Front-to-Back Vowel Transitions in Chinese Shuanglin Fan, Kiyoshi Honda, Jianwu Dang, Hui Feng
Sunday, 11 September 2016 14:50 Bayview A Acoustic and Articulatory Phonetics ORAL Sun-O-7-3-5 1498 Perceptual Lateralization of Coda Rhotic Production in Puerto Rican Spanish Mairym Lloréns Monteserín, Shrikanth S. Narayanan, Louis Goldstein
Sunday, 11 September 2016 15:10 Bayview A Acoustic and Articulatory Phonetics ORAL Sun-O-7-3-6 662 Interaction Between Lexical Tone and Intonation: An EMA Study Hao Yi, Sam Tilsen
Sunday, 11 September 2016 13:30 Bayview B Speech Synthesis Oral I: Neural Networks ORAL Sun-O-7-4-1 1053 Deep Bidirectional LSTM Modeling of Timbre and Prosody for Emotional Voice Conversion Huaiping Ming, Dongyan Huang, Lei Xie, Jie Wu, Minghui Dong, Haizhou Li
Sunday, 11 September 2016 13:50 Bayview B Speech Synthesis Oral I: Neural Networks ORAL Sun-O-7-4-2 1084 Visual Speech Synthesis Using Dynamic Visemes, Contextual Features and DNNs Ausdang Thangthai, Ben Milner, Sarah Taylor
Sunday, 11 September 2016 14:10 Bayview B Speech Synthesis Oral I: Neural Networks ORAL Sun-O-7-4-3 96 A Template-Based Approach for Speech Synthesis Intonation Generation Using LSTMs Srikanth Ronanki, Gustav Eje Henter, Zhizheng Wu, Simon King
Sunday, 11 September 2016 14:30 Bayview B Speech Synthesis Oral I: Neural Networks ORAL Sun-O-7-4-4 172 Multi-Language Multi-Speaker Acoustic Modeling for LSTM-RNN Based Statistical Parametric Speech Synthesis Bo Li, Heiga Zen
Sunday, 11 September 2016 14:50 Bayview B Speech Synthesis Oral I: Neural Networks ORAL Sun-O-7-4-5 342 GlottDNN - A Full-Band Glottal Vocoder for Statistical Parametric Speech Synthesis Manu Airaksinen, Bajibabu Bollepalli, Lauri Juvela, Zhizheng Wu, Simon King, Paavo Alku
Sunday, 11 September 2016 15:10 Bayview B Speech Synthesis Oral I: Neural Networks ORAL Sun-O-7-4-6 1027 Singing Voice Synthesis Based on Deep Neural Networks Masanari Nishimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda
Sunday, 11 September 2016 13:30 Seacliff BCD Speech Quality & Intelligibility ORAL Sun-O-7-5-1 27 Blind Recovery of Perceptual Models in Distributed Speech and Audio Coding Tom Bäckström, Florin Ghido, Johannes Fischer
Sunday, 11 September 2016 13:50 Seacliff BCD Speech Quality & Intelligibility ORAL Sun-O-7-5-2 14 Glimpse-Based Metrics for Predicting Speech Intelligibility in Additive Noise Conditions Yan Tang, Martin Cooke
Sunday, 11 September 2016 14:10 Seacliff BCD Speech Quality & Intelligibility ORAL Sun-O-7-5-3 255 Analyzing the Relation Between Overall Quality and the Quality of Individual Phases in a Telephone Conversation Friedemann Köster, Sebastian Möller
Sunday, 11 September 2016 14:30 Seacliff BCD Speech Quality & Intelligibility ORAL Sun-O-7-5-4 144 Intelligibility Enhancement at the Receiving End of the Speech Transmission System - Effects of Far-End Noise Reduction Emma Jokinen, Paavo Alku
Sunday, 11 September 2016 14:50 Seacliff BCD Speech Quality & Intelligibility ORAL Sun-O-7-5-5 1448 Intelligibility of Disordered Speech: Global and Detailed Scores Mario Ganzeboom, Marjoke Bakker, Catia Cucchiarini, Helmer Strik
Sunday, 11 September 2016 15:10 Seacliff BCD Speech Quality & Intelligibility ORAL Sun-O-7-5-6 500 Modulation Enhancement of Temporal Envelopes for Increasing Speech Intelligibility in Noise Maria Koutsogiannaki, Yannis Stylianou
Sunday, 11 September 2016 13:30 Seacliff A Speech Translation and Metadata for Linguistic/Discourse Structure ORAL Sun-O-7-6-1 154 Dynamic Transcription for Low-Latency Speech Translation Jan Niehues, Thai Son Nguyen, Eunah Cho, Thanh-Le Ha, Kevin Kilgour, Markus Müller, Matthias Sperber, Sebastian Stüker, Alex Waibel
Sunday, 11 September 2016 13:50 Seacliff A Speech Translation and Metadata for Linguistic/Discourse Structure ORAL Sun-O-7-6-2 862 Learning a Translation Model from Word Lattices Oliver Adams, Graham Neubig, Trevor Cohn, Steven Bird
Sunday, 11 September 2016 14:10 Seacliff A Speech Translation and Metadata for Linguistic/Discourse Structure ORAL Sun-O-7-6-3 1247 Disfluency Detection Using a Bidirectional LSTM Vicky Zayats, Mari Ostendorf, Hannaneh Hajishirzi
Sunday, 11 September 2016 14:30 Seacliff A Speech Translation and Metadata for Linguistic/Discourse Structure ORAL Sun-O-7-6-4 257 Sentence Boundary Detection Based on Parallel Lexical and Acoustic Models Xiaoyin Che, Sheng Luo, Haojin Yang, Christoph Meinel
Sunday, 11 September 2016 14:50 Seacliff A Speech Translation and Metadata for Linguistic/Discourse Structure ORAL Sun-O-7-6-5 898 Transferring Emphasis in Speech Translation Using Hard-Attentional Neural Network Models Quoc Truong Do, Sakriani Sakti, Graham Neubig, Satoshi Nakamura
Sunday, 11 September 2016 15:10 Seacliff A Speech Translation and Metadata for Linguistic/Discourse Structure ORAL Sun-O-7-6-6 464 Better Evaluation of ASR in Speech Translation Context Using Word Embeddings Ngoc-Tien Le, Christophe Servan, Benjamin Lecouteux, Laurent Besacier
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster A Speech Coding and Audio Processing for Noise Reduction POSTER Sun-P-7-1-1 55 Entropy Coding of Spectral Envelopes for Speech and Audio Coding Using Distribution Quantization Srikanth Korse, Tobias Jähnel, Tom Bäckström
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster A Speech Coding and Audio Processing for Noise Reduction POSTER Sun-P-7-1-2 1595 An Objective Evaluation Methodology for Blind Bandwidth Extension Stéphane Villette, Sen Li, Pravin Ramadas, Daniel J. Sinder
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster A Speech Coding and Audio Processing for Noise Reduction POSTER Sun-P-7-1-3 917 EVS Channel Aware Mode Robustness to Frame Erasures Anssi Rämö, Antti Kurittu, Henri Toukomaa
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster A Speech Coding and Audio Processing for Noise Reduction POSTER Sun-P-7-1-4 1049 An Interaural Magnification Algorithm for Enhancement of Naturally-Occurring Level Differences Shadi Pirhosseinloo, Kostas Kokkinakis
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster A Speech Enhancement and Noise Reduction POSTER Sun-P-7-1-5 1340 Probabilistic Spatial Filter Estimation for Signal Enhancement in Multi-Channel Automatic Speech Recognition Hendrik Kayser, Niko Moritz, Jörn Anemüller
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster A Speech Coding and Audio Processing for Noise Reduction POSTER Sun-P-7-1-6 894 Improved a priori SAP Estimator in Complex Noisy Environment for Dual Channel Microphone System Youna Ji, Young-cheol Park
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster A Speech Coding and Audio Processing for Noise Reduction POSTER Sun-P-7-1-7 757 A Spectral Modulation Sensitivity Weighted Pre-Emphasis Filter for Active Noise Control System Kah-Meng Cheong, Yuh-Yuan Wang, Tai-Shih Chi
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster A Speech Coding and Audio Processing for Noise Reduction POSTER Sun-P-7-1-8 798 Semi-Coupled Dictionary Based Automatic Bandwidth Extension Approach for Enhancing Children’s ASR Ganji Sreeram, Rohit Sinha
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster B Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations POSTER Sun-P-7-2-1 1110 Bird Song Synthesis Based on Hidden Markov Models Jordi Bonada, Robert Lachlan, Merlijn Blaauw
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster B Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations POSTER Sun-P-7-2-2 1360 Noise-Robust Hidden Markov Models for Limited Training Data for Within-Species Bird Phrase Classification Kantapon Kaewtip, Charles Taylor, Abeer Alwan
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster B Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations POSTER Sun-P-7-2-3 1410 A Framework for Automated Marmoset Vocalization Detection and Classification Alan Wisler, Laura J. Brattain, Rogier Landman, Thomas F. Quatieri
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster B Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations POSTER Sun-P-7-2-4 336 Call Alternation Between Specific Pairs of Male Frogs Revealed by a Sound-Imaging Method in Their Natural Habitat Ikkyu Aihara, Takeshi Mizumoto, Hiromitsu Awano, Hiroshi G. Okuno
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster B Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations POSTER Sun-P-7-2-5 361 Sinusoidal Modelling for Ecoacoustics Patrice Guyot, Alice Eldridge, Ying Chen Eyre-Walker, Alison Johnston, Thomas Pellegrini, Mika Peck
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster B Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations POSTER Sun-P-7-2-6 465 Individual Identity in Songbirds: Signal Representations and Metric Learning for Locating the Information in Complex Corvid Calls Dan Stowell, Veronica Morfi, Lisa F. Gill
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster B Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations POSTER Sun-P-7-2-7 669 Recognition of Multiple Bird Species Based on Penalised Maximum Likelihood and HMM-Based Modelling of Individual Vocalisation Elements Peter Jančovič, Münevver Köküer
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster B Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations POSTER Sun-P-7-2-8 746 Cost Effective Acoustic Monitoring of Bird Species Ciira wa Maina
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster B Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations POSTER Sun-P-7-2-9 748 Feature Learning and Automatic Segmentation for Dolphin Communication Analysis Daniel Kohlsdorf, Denise Herzing, Thad Starner
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster B Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations POSTER Sun-P-7-2-10 782 Localizing Bird Songs Using an Open Source Robot Audition System with a Microphone Array Reiji Suzuki, Shiho Matsubayashi, Kazuhiro Nakadai, Hiroshi G. Okuno
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster B Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations POSTER Sun-P-7-2-11 83 Robust Detection of Multiple Bioacoustic Events with Repetitive Structures Frank Kurth
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster B Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations POSTER Sun-P-7-2-12 841 A Real-Time Parametric General-Purpose Mammalian Vocal Synthesiser Roger K. Moore
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster B Special Session: Speech, Audio, and Language Processing Techniques Applied to Bird and Animal Vocalizations POSTER Sun-P-7-2-13 90 YIN-Bird: Improved Pitch Tracking for Bird Vocalisations Colm O’Reilly, Nicola M. Marples, David J. Kelly, Naomi Harte
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster C Learning, Education and Different Speech POSTER Sun-P-7-3-1 1602 Mispronunciation Detection Leveraging Maximum Performance Criterion Training of Acoustic Models and Decision Functions Yao-Chi Hsu, Ming-Han Yang, Hsiao-Tsung Hung, Berlin Chen
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster C Learning, Education and Different Speech POSTER Sun-P-7-3-2 1388 Using Clinician Annotations to Improve Automatic Speech Recognition of Stuttered Speech Peter A. Heeman, Rebecca Lunsford, Andy McMillin, J. Scott Yaruss
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster C Learning, Education and Different Speech POSTER Sun-P-7-3-3 986 Deep Neural Networks for Voice Quality Assessment Based on the GRBAS Scale Simin Xie, Nan Yan, Ping Yu, Manwa L. Ng, Lan Wang, Zhuanzhuan Ji
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster C Learning, Education and Different Speech POSTER Sun-P-7-3-4 850 Automated Screening of Speech Development Issues in Children by Identifying Phonological Error Patterns Lauren Ward, Alessandro Stefani, Daniel Smith, Andreas Duenser, Jill Freyne, Barbara Dodd, Angela Morgan
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster C Learning, Education and Different Speech POSTER Sun-P-7-3-5 1162 Automatic Pronunciation Evaluation of Non-Native Mandarin Tone by Using Multi-Level Confidence Measures Ju Lin, Yanlu Xie, Jinsong Zhang
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster C Learning, Education and Different Speech POSTER Sun-P-7-3-6 776 Dysarthric Speech Recognition Using Kullback-Leibler Divergence-Based Hidden Markov Model Myungjong Kim, Jun Wang, Hoirin Kim
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster C Learning, Education and Different Speech POSTER Sun-P-7-3-7 1518 Detection of Total Syllables and Canonical Syllables in Infant Vocalizations Anne S. Warlaumont, Heather L. Ramsdell-Hudock
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster C Learning, Education and Different Speech POSTER Sun-P-7-3-8 213 Improving Automatic Recognition of Aphasic Speech with AphasiaBank Duc Le, Emily Mower Provost
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster C Learning, Education and Different Speech POSTER Sun-P-7-3-9 513 Pronunciation Assessment of Japanese Learners of French with GOP Scores and Phonetic Information Vincent Laborde, Thomas Pellegrini, Lionel Fontan, Julie Mauclair, Halima Sahraoui, Jérôme Farinas
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster C Learning, Education and Different Speech POSTER Sun-P-7-3-10 539 Pronunciation Error Detection for New Language Learners Sean Robertson, Cosmin Munteanu, Gerald Penn
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster C Learning, Education and Different Speech POSTER Sun-P-7-3-11 427 L2 English Rhythm in Read Speech by Chinese Students Hongwei Ding, Xinping Xu
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster D Dialogue Systems and Analysis of Dialogue POSTER Sun-P-7-4-1 810 Improving the Probabilistic Framework for Representing Dialogue Systems with User Response Model Miao Li, Zhipeng Chen, Ji Wu
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster D Dialogue Systems and Analysis of Dialogue POSTER Sun-P-7-4-2 1234 Dialogue Session Segmentation by Embedding-Enhanced TextTiling Yiping Song, Lili Mou, Rui Yan, Li Yi, Zinan Zhu, Xiaohua Hu, Ming Zhang
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster D Dialogue Systems and Analysis of Dialogue POSTER Sun-P-7-4-3 800 Target-Based State and Tracking Algorithm for Spoken Dialogue System Miao Li, Zhiyang He, Ji Wu
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster D Dialogue Systems and Analysis of Dialogue POSTER Sun-P-7-4-4 1359 Neural Attention Models for Sequence Classification: Analysis and Application to Key Term Extraction and Dialogue Act Detection Sheng-syun Shen, Hung-Yi Lee
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster D Dialogue Systems and Analysis of Dialogue POSTER Sun-P-7-4-5 563 Objective Language Feature Analysis in Children with Neurodevelopmental Disorders During Autism Assessment Manoj Kumar, Rahul Gupta, Daniel Bone, Nikolaos Malandrakis, Somer Bishop, Shrikanth S. Narayanan
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster D Dialogue Systems and Analysis of Dialogue POSTER Sun-P-7-4-6 404 Improving Generalisation to New Speakers in Spoken Dialogue State Tracking Iñigo Casanueva, Thomas Hain, Phil Green
Sunday, 11 September 2016 13:30 Pacific Concourse - Poster D Dialogue Systems and Analysis of Dialogue POSTER Sun-P-7-4-7 876 Towards Machine Comprehension of Spoken Content: Initial TOEFL Listening Comprehension Test by Machine Bo-Hsiang Tseng, Sheng-syun Shen, Hung-Yi Lee, Lin-Shan Lee
Sunday, 11 September 2016 16:00 Grand Ballroom A Topics in Speech Recognition ORAL Sun-O-8-1-1 283 How Neural Network Depth Compensates for HMM Conditional Independence Assumptions in DNN-HMM Acoustic Models Suman Ravuri, Steven Wegmann
Sunday, 11 September 2016 16:20 Grand Ballroom A Topics in Speech Recognition ORAL Sun-O-8-1-2 968 Jointly Learning to Locate and Classify Words Using Convolutional Networks Dimitri Palaz, Gabriel Synnaeve, Ronan Collobert
Sunday, 11 September 2016 16:40 Grand Ballroom A Topics in Speech Recognition ORAL Sun-O-8-1-3 128 On the Efficient Representation and Execution of Deep Acoustic Models Raziel Alvarez, Rohit Prabhavalkar, Anton Bakhtin
Sunday, 11 September 2016 17:00 Grand Ballroom A Topics in Speech Recognition ORAL Sun-O-8-1-4 595 Purely Sequence-Trained Neural Networks for ASR Based on Lattice-Free MMI Daniel Povey, Vijayaditya Peddinti, Daniel Galvez, Pegah Ghahremani, Vimal Manohar, Xingyu Na, Yiming Wang, Sanjeev Khudanpur
Sunday, 11 September 2016 17:20 Grand Ballroom A Topics in Speech Recognition ORAL Sun-O-8-1-5 832 Virtual Adversarial Training Applied to Neural Higher-Order Factors for Phone Classification Martin Ratajczak, Sebastian Tschiatschek, Franz Pernkopf
Sunday, 11 September 2016 17:40 Grand Ballroom A Topics in Speech Recognition ORAL Sun-O-8-1-6 911 Sequence Student-Teacher Training of Deep Neural Networks Jeremy H.M. Wong, Mark J.F. Gales
Sunday, 11 September 2016 16:00 Grand Ballroom BC Special Session: Realism in Robust Speech Processing ORAL Sun-O-8-2-1 1395 Robustness in Speech, Speaker, and Language Recognition: “You’ve Got to Know Your Limitations” John H.L. Hansen, Hynek Bořil
Sunday, 11 September 2016 16:15 Grand Ballroom BC Special Session: Realism in Robust Speech Processing ORAL Sun-O-8-2-2 143 The Use of Read versus Conversational Lombard Speech in Spectral Tilt Modeling for Intelligibility Enhancement in Near-End Noise Conditions Emma Jokinen, Ulpu Remes, Paavo Alku
Sunday, 11 September 2016 16:30 Grand Ballroom BC Special Session: Realism in Robust Speech Processing ORAL Sun-O-8-2-3 1609 Corpora for the Evaluation of Robust Speaker Recognition Systems Douglas E. Sturim, Pedro A. Torres-Carrasquillo, Joseph P. Campbell
Sunday, 11 September 2016 16:45 Grand Ballroom BC Special Session: Realism in Robust Speech Processing ORAL Sun-O-8-2-4 1384 A French Corpus for Distant-Microphone Speech Processing in Real Homes Nancy Bertin, Ewen Camberlein, Emmanuel Vincent, Romain Lebarbenchon, Stéphane Peillon, Éric Lamande, Sunit Sivasankaran, Frédéric Bimbot, Irina Illina, Ariane Tom, Sylvain Fleury, Éric Jamet
Sunday, 11 September 2016 17:00 Grand Ballroom BC Special Session: Realism in Robust Speech Processing ORAL Sun-O-8-2-5 731 Realistic Multi-Microphone Data Simulation for Distant Speech Recognition Mirco Ravanelli, Piergiorgio Svaizer, Maurizio Omologo
Sunday, 11 September 2016 17:15 Grand Ballroom BC Special Session: Realism in Robust Speech Processing ORAL Sun-O-8-2-6 978 Synthesis of Device-Independent Noise Corpora for Realistic ASR Evaluation Hannes Gamper, Mark R.P. Thomas, Lyle Corbin, Ivan Tashev
Sunday, 11 September 2016 17:30 Grand Ballroom BC Special Session: Realism in Robust Speech Processing ORAL Sun-O-8-2-7 544 Speaker Recognition Using Real vs Synthetic Parallel Data for DNN Channel Compensation Fred Richardson, Michael Brandstein, Jennifer Melot, Douglas Reynolds
Sunday, 11 September 2016 17:45 Grand Ballroom BC Special Session: Realism in Robust Speech Processing ORAL Sun-O-8-2-8 4016 Discussion "Dayana Ribas, Emmanuel Vincent, John Hansen, Emma Jokinen, Mirco Ravanelli, Hannes Gamper, Fred Richardson
"
Sunday, 11 September 2016 16:00 Bayview A Spoken Word Recognition ORAL Sun-O-8-3-1 1072 Combining Data-Oriented and Process-Oriented Approaches to Modeling Reaction Time Data Louis ten Bosch, Lou Boves, M. Ernestus
Sunday, 11 September 2016 16:20 Bayview A Spoken Word Recognition ORAL Sun-O-8-3-2 610 Do Listeners Learn Better from Natural Speech? Michael McAuliffe, Molly Babel, Charlotte Vaughn
Sunday, 11 September 2016 16:40 Bayview A Spoken Word Recognition ORAL Sun-O-8-3-3 814 Processing and Adaptation to Ambiguous Sounds during the Course of Perceptual Learning Polina Drozdova, Roeland van Hout, Odette Scharenborg
Sunday, 11 September 2016 17:00 Bayview A Spoken Word Recognition ORAL Sun-O-8-3-4 882 The Effect of Background Noise on the Activation of Phonological and Semantic Information During Spoken-Word Recognition Florian Hintz, Odette Scharenborg
Sunday, 11 September 2016 17:20 Bayview A Spoken Word Recognition ORAL Sun-O-8-3-5 906 Relationships Between Functional Load and Auditory Confusability Under Different Speech Environments Shinae Kang, Clara Cohen
Sunday, 11 September 2016 17:40 Bayview A Spoken Word Recognition ORAL Sun-O-8-3-6 1445 The Role of Pitch in Punjabi Word Identification Jasmeen Kanwal, Amanda Ritchart
Sunday, 11 September 2016 16:00 Bayview B Speech Synthesis Oral: High Level Linguistic Features ORAL Sun-O-8-4-1 864 Improving TTS with Corpus-Specific Pronunciation Adaptation Marie Tahon, Raheel Qader, Gwénolé Lecorvé, Damien Lolive
Sunday, 11 September 2016 16:20 Bayview B Speech Synthesis Oral: High Level Linguistic Features ORAL Sun-O-8-4-2 1229 Deep Bidirectional Long Short-Term Memory Recurrent Neural Networks for Grapheme-to-Phoneme Conversion Utilizing Complex Many-to-Many Alignments Amr El-Desoky Mousa, Björn Schuller
Sunday, 11 September 2016 16:40 Bayview B Speech Synthesis Oral: High Level Linguistic Features ORAL Sun-O-8-4-3 1419 Predicting Pronunciations with Syllabification and Stress with Recurrent Neural Networks Daan van Esch, Mason Chua, Kanishka Rao
Sunday, 11 September 2016 17:00 Bayview B Speech Synthesis Oral: High Level Linguistic Features ORAL Sun-O-8-4-4 165 Adaptive Latency for Part-of-Speech Tagging in Incremental Text-to-Speech Synthesis Maël Pouget, Olha Nahorna, Thomas Hueber, Gérard Bailly
Sunday, 11 September 2016 17:20 Bayview B Speech Synthesis Oral: High Level Linguistic Features ORAL Sun-O-8-4-5 399 Redefining the Linguistic Context Feature Set for HMM and DNN TTS Through Position and Parsing Rasmus Dall, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda
Sunday, 11 September 2016 17:40 Bayview B Speech Synthesis Oral: High Level Linguistic Features ORAL Sun-O-8-4-6 390 Enhance the Word Vector with Prosodic Information for the Recurrent Neural Network Based TTS System Xin Wang, Shinji Takaki, Junichi Yamagishi
Sunday, 11 September 2016 16:00 Seacliff BCD Speech Enhancement ORAL Sun-O-8-5-1 586 Local Sparsity Based Online Dictionary Learning for Environment-Adaptive Speech Enhancement with Nonnegative Matrix Factorization Kwang Myung Jeon, Hong Kook Kim
Sunday, 11 September 2016 16:20 Seacliff BCD Speech Enhancement ORAL Sun-O-8-5-2 501 Noise Aware and Combined Noise Models for Speech Denoising in Unknown Noise Conditions Pavlos Papadopoulos, Colin Vaz, Shrikanth S. Narayanan
Sunday, 11 September 2016 16:40 Seacliff BCD Speech Enhancement ORAL Sun-O-8-5-3 437 Causal Speech Enhancement Combining Data-Driven Learning and Suppression Rule Estimation Seyedmahdad Mirsamadi, Ivan Tashev
Sunday, 11 September 2016 17:00 Seacliff BCD Speech Enhancement ORAL Sun-O-8-5-4 150 A Phase-Based Time-Frequency Masking for Multi-Channel Speech Enhancement in Domestic Environments Alessio Brutti, Antigoni Tsiami, Athanasios Katsamanis, Petros Maragos
Sunday, 11 September 2016 17:20 Seacliff BCD Speech Enhancement ORAL Sun-O-8-5-5 1026 Generalizing Steady State Suppression for Enhanced Intelligibility Under Reverberation Petko N. Petkov, Yannis Stylianou
Sunday, 11 September 2016 17:40 Seacliff BCD Speech Enhancement ORAL Sun-O-8-5-6 652 Speech Intelligibility Prediction Based on the Envelope Power Spectrum Model with the Dynamic Compressive Gammachirp Auditory Filterbank Katsuhiko Yamamoto, Toshio Irino, Toshie Matsui, Shoko Araki, Keisuke Kinoshita, Tomohiro Nakatani
Sunday, 11 September 2016 16:00 Seacliff A Dialogue: Backchannels and Turntaking ORAL Sun-O-8-6-1 118 Prediction and Generation of Backchannel Form for Attentive Listening Systems Tatsuya Kawahara, Takashi Yamaguchi, Koji Inoue, Katsuya Takanashi, Nigel Ward
Sunday, 11 September 2016 16:20 Seacliff A Dialogue: Backchannels and Turntaking ORAL Sun-O-8-6-2 1350 Measuring Turn-Taking Offsets in Human-Human Dialogues Rebecca Lunsford, Peter A. Heeman, Emma Rennie
Sunday, 11 September 2016 16:40 Seacliff A Dialogue: Backchannels and Turntaking ORAL Sun-O-8-6-3 1409 Using Past Speaker Behavior to Better Predict Turn Transitions Tomer Meshorer, Peter A. Heeman
Sunday, 11 September 2016 17:00 Seacliff A Dialogue: Backchannels and Turntaking ORAL Sun-O-8-6-4 22 Quantitative Analysis of Backchannels Uttered by an Interviewer During Neuropsychological Tests Gérard Bailly, Frédéric Elisei, Alexandra Juphard, Olivier Moreaud
Sunday, 11 September 2016 17:20 Seacliff A Dialogue: Backchannels and Turntaking ORAL Sun-O-8-6-5 859 Predicting User Satisfaction from Turn-Taking in Spoken Conversations Shammur Absar Chowdhury, Evgeny A. Stepanov, Giuseppe Riccardi
Sunday, 11 September 2016 17:40 Seacliff A Dialogue: Backchannels and Turntaking ORAL Sun-O-8-6-6 1274 Towards Building an Attentive Artificial Listener: On the Perception of Attentiveness in Feedback Utterances Catharine Oertel, Joakim Gustafson, Alan W. Black
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster A Language Recognition POSTER Sun-P-8-1-1 881 Language Recognition via Sparse Coding Youngjune L. Gwon, William M. Campbell, Douglas E. Sturim, H.T. Kung
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster A Language Recognition POSTER Sun-P-8-1-2 560 A Feature Normalisation Technique for PLLR Based Language Identification Systems Sarith Fernando, Vidhyasaharan Sethu, Eliathamby Ambikairajah
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster A Language Recognition POSTER Sun-P-8-1-3 910 An Investigation of Deep Neural Network Architectures for Language Recognition in Indian Languages Mounika K.V., Sivanand Achanta, Lakshmi H. R., Suryakanth V. Gangashetty, Anil Kumar Vuppala
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster A Language Recognition POSTER Sun-P-8-1-4 1297 Automatic Dialect Detection in Arabic Broadcast Speech Ahmed Ali, Najim Dehak, Patrick Cardinal, Sameer Khurana, Sree Harsha Yella, James Glass, Peter Bell, Steve Renals
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster A Language Recognition POSTER Sun-P-8-1-5 630 Combining Weak Tokenisers for Phonotactic Language Recognition in a Resource-Constrained Setting Raymond W.M. Ng, Bhusan Chettri, Thomas Hain
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster A Language Recognition POSTER Sun-P-8-1-6 686 End-to-End Language Identification Using Attention-Based Recurrent Neural Networks Wang Geng, Wenfu Wang, Yuanyuan Zhao, Xinyuan Cai, Bo Xu
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster A Language Recognition POSTER Sun-P-8-1-7 333 Enhancing Multilingual Recognition of Emotion in Speech by Language Identification Hesam Sagha, Pavel Matějka, Maryna Gavryukova, Filip Povolny, Erik Marchi, Björn Schuller
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster B Speech and Audio Segmentation and Classification POSTER Sun-P-8-2-1 1112 Deep Neural Network Bottleneck Features for Acoustic Event Recognition Seongkyu Mun, Suwon Shon, Wooil Kim, Hanseok Ko
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster B Speech and Audio Segmentation and Classification POSTER Sun-P-8-2-2 1345 Combining Energy and Cross-Entropy Analysis for Nuclear Segments Detection Antonio Origlia, Francesco Cutugno
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster B Speech and Audio Segmentation and Classification POSTER Sun-P-8-2-3 1346 Anchored Speech Detection Roland Maas, Sree Hari Krishnan Parthasarathi, Brian King, Ruitong Huang, Björn Hoffmeister
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster B Speech and Audio Segmentation and Classification POSTER Sun-P-8-2-4 1366 Towards Smart-Cars That Can Listen: Abnormal Acoustic Event Detection on the Road Mahesh Kumar Nandwana, Taufiq Hasan
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster B Speech and Audio Segmentation and Classification POSTER Sun-P-8-2-5 175 Hierarchical Classification of Speaker and Background Noise and Estimation of SNR Using Sparse Representation K.V. Vijay Girish, A.G. Ramakrishnan, T.V. Ananthapadmanabha
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster B Speech and Audio Segmentation and Classification POSTER Sun-P-8-2-6 392 Robust Sound Event Detection in Continuous Audio Environments Haomin Zhang, Ian McLoughlin, Yan Song
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster B Speech and Audio Segmentation and Classification POSTER Sun-P-8-2-7 805 Deep Convolutional Neural Networks and Data Augmentation for Acoustic Event Recognition Naoya Takahashi, Michael Gygli, Beat Pfister, Luc Van Gool
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster B Speech and Audio Segmentation and Classification POSTER Sun-P-8-2-8 1184 Artificial Neural Network-Based Feature Combination for Spatial Voice Activity Detection Stefan Meier, Walter Kellermann
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster B Speech and Audio Segmentation and Classification POSTER Sun-P-8-2-9 1281 HAPPY Team Entry to NIST OpenSAD Challenge: A Fusion of Short-Term Unsupervised and Segment i-Vector Based Speech Activity Detectors Tomi Kinnunen, Alexey Sholokhov, Elie Khoury, Dennis Alexander Lehmann Thomsen, Md. Sahidullah, Zheng-Hua Tan
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster B Speech and Audio Segmentation and Classification POSTER Sun-P-8-2-10 1341 Manual versus Automated: The Challenging Routine of Infant Vocalisation Segmentation in Home Videos to Study Neuro(mal)development Florian B. Pokorny, Robert Peharz, Wolfgang Roth, Matthias Zöhrer, Franz Pernkopf, Peter B. Marschik, Björn Schuller
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster B Speech and Audio Segmentation and Classification POSTER Sun-P-8-2-11 247 Minimizing Annotation Effort for Adaptation of Speech-Activity Detection Systems Luciana Ferrer, Martin Graciarena
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster C New Products and Services POSTER Sun-P-8-3-1 874 Progress and Prospects for Spoken Language Technology: What Ordinary People Think Roger K. Moore, Hui Li, Shih-Hao Liao
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster C New Products and Services POSTER Sun-P-8-3-2 948 Progress and Prospects for Spoken Language Technology: Results from Four Sexennial Surveys Roger K. Moore, Ricard Marxer
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster C New Products and Services POSTER Sun-P-8-3-3 673 On Employing a Highly Mismatched Crowd for Speech Transcription Purushotam Radadia, Rahul Kumar, Kanika Kalra, Shirish Karande, Sachin Lodha
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster C New Products and Services POSTER Sun-P-8-3-4 1031 Sage: The New BBN Speech Processing Platform Roger Hsiao, Ralf Meermeier, Tim Ng, Zhongqiang Huang, Maxwell Jordan, Enoch Kan, Tanel Alumäe, Jan Silovsky, William Hartmann, Francis Keith, Omer Lang, Manhung Siu, Owen Kimball
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster C New Products and Services POSTER Sun-P-8-3-5 105 DNN-Based Feature Enhancement Using Joint Training Framework for Robust Multichannel Speech Recognition Kang Hyun Lee, Tae Gyoon Kang, Woo Hyun Kang, Nam Soo Kim
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster C New Products and Services POSTER Sun-P-8-3-6 340 Deep Neural Network Frontend for Continuous EMG-Based Speech Recognition Michael Wand, Jürgen Schmidhuber
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster C New Products and Services POSTER Sun-P-8-3-7 963 Overcoming Data Sparsity in Acoustic Modeling of Low-Resource Language by Borrowing Data and Model Parameters from High-Resource Languages Basil Abraham, S. Umesh, Neethu Mariam Joy
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster C New Products and Services POSTER Sun-P-8-3-8 371 Multi-Language Neural Network Language Models Anton Ragni, Edgar Dakin, Xie Chen, Mark J.F. Gales, Kate M. Knill
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster C New Products and Services POSTER Sun-P-8-3-9 1517 Bidirectional Recurrent Neural Network with Attention Mechanism for Punctuation Restoration Ottokar Tilk, Tanel Alumäe
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster C New Products and Services POSTER Sun-P-8-3-10 618 TheanoLM - An Extensible Toolkit for Neural Network Language Modeling Seppo Enarvi, Mikko Kurimo
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster C New Products and Services POSTER Sun-P-8-3-11 462 Selection of Multi-Genre Broadcast Data for the Training of Automatic Speech Recognition Systems P. Lanchantin, Mark J.F. Gales, Penny Karanasou, X. Liu, Y. Qian, L. Wang, P.C. Woodland, C. Zhang
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster C New Products and Services POSTER Sun-P-8-3-12 660 Manipulating Word Lattices to Incorporate Human Corrections Yashesh Gaur, Florian Metze, Jeffrey P. Bigham
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster C New Products and Services POSTER Sun-P-8-3-13 1503 Context-Aware Restaurant Recommendation for Natural Language Queries: A Formative User Study in the Automotive Domain Philipp Fischer, Cornelius Styp von Rekowski, Andreas Nürnberger
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster C New Products and Services POSTER Sun-P-8-3-14 1589 Teaming Up: Making the Most of Diverse Representations for a Novel Personalized Speech Retrieval Application Stephanie Pancoast, Murat Akbacak
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster C New Products and Services POSTER Sun-P-8-3-15 546 Automatic Speech Transcription for Low-Resource Languages - The Case of Yoloxóchitl Mixtec (Mexico) Vikramjit Mitra, Andreas Kathol, Jonathan D. Amith, Rey Castillo García
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster C New Products and Services POSTER Sun-P-8-3-16 617 Real-Time Presentation Tracking Using Semantic Keyword Spotting Reza Asadi, Harriet J. Fell, Timothy Bickmore, Ha Trinh
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster D Low Resource Speech Recognition POSTER Sun-P-8-4-1 1319 Deriving Phonetic Transcriptions and Discovering Word Segmentations for Speech-to-Speech Translation in Low-Resource Settings Andrew Wilkinson, Tiancheng Zhao, Alan W. Black
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster D Low Resource Speech Recognition POSTER Sun-P-8-4-2 919 Unsupervised Joint Estimation of Grapheme-to-Phoneme Conversion Systems and Acoustic Model Adaptation for Non-Native Speech Recognition Satoshi Tsujioka, Sakriani Sakti, Koichiro Yoshino, Graham Neubig, Satoshi Nakamura
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster D Low Resource Speech Recognition POSTER Sun-P-8-4-3 537 Learning Personalized Pronunciations for Contact Name Recognition Antoine Bruguier, Fuchun Peng, Françoise Beaufays
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster D Low Resource Speech Recognition POSTER Sun-P-8-4-4 1375 Generation and Pruning of Pronunciation Variants to Improve ASR Accuracy Zhenhao Ge, Aravind Ganapathiraju, Ananth N. Iyer, Scott A. Randal, Felix I. Wyss
Sunday, 11 September 2016 16:00 Pacific Concourse - Poster D Low Resource Speech Recognition POSTER Sun-P-8-4-5 1364 Optimizing Speech Recognition Evaluation Using Stratified Sampling Janne Pylkkönen, Thomas Drugman, Max Bisani
Monday, 12 September 2016 08:30 - 9:30 Grand Ballroom ABC Keynote 4: Dan Jurafsky ORAL Mon-Keynote-4 3004 Ketchup, Interdisciplinarity, and the Spread of Innovation in Speech and Language Processing Dan Jurafsky
Monday, 12 September 2016 10:00 - 12:00 Grand Ballroom A Special Event: Speech Ventures   Mon-SE-3 4004 Speech Ventures Nicolas Scheffer, Korbinian Riedhammer, Alex Lebrun, David Suendermann-Oeft
Monday, 12 September 2016 10:00 Grand Ballroom BC Special Session: Speech and Language Technologies for Human-Machine Conversation-Based Language Education ORAL Mon-O-9-2-1 289 Context Aware Mispronunciation Detection for Mandarin Pronunciation Training Rong Tong, Nancy F. Chen, Bin Ma, Haizhou Li
Monday, 12 September 2016 10:15 Grand Ballroom BC Special Session: Speech and Language Technologies for Human-Machine Conversation-Based Language Education ORAL Mon-O-9-2-2 1457 DNN Online with iVectors Acoustic Modeling and Doc2Vec Distributed Representations for Improving Automated Speech Scoring Jidong Tao, Lei Chen, Chong Min Lee
Monday, 12 September 2016 10:30 Grand Ballroom BC Special Session: Speech and Language Technologies for Human-Machine Conversation-Based Language Education ORAL Mon-O-9-2-3 291 Self-Adaptive DNN for Improving Spoken Language Proficiency Assessment Yao Qian, Xinhao Wang, Keelan Evanini, David Suendermann-Oeft
Monday, 12 September 2016 10:45 Grand Ballroom BC Special Session: Speech and Language Technologies for Human-Machine Conversation-Based Language Education ORAL Mon-O-9-2-4 517 Detecting Mispronunciations of L2 Learners and Providing Corrective Feedback Using Knowledge-Guided and Data-Driven Decision Trees Wei Li, Kehuang Li, Sabato Marco Siniscalchi, Nancy F. Chen, Chin-Hui Lee
Monday, 12 September 2016 11:00 Grand Ballroom BC Special Session: Speech and Language Technologies for Human-Machine Conversation-Based Language Education ORAL Mon-O-9-2-5 663 Phoneme Set Design Considering Integrated Acoustic and Linguistic Features of Second Language Speech Xiaoyun Wang, Tsuneo Kato, Seiichi Yamamoto
Monday, 12 September 2016 11:15 Grand Ballroom BC Special Session: Speech and Language Technologies for Human-Machine Conversation-Based Language Education ORAL Mon-O-9-2-6 750 HMM-Based Non-Native Accent Assessment Using Posterior Features Ramya Rasipuram, Milos Cernak, Mathew Magimai-Doss
Monday, 12 September 2016 11:30 Grand Ballroom BC Special Session: Speech and Language Technologies for Human-Machine Conversation-Based Language Education ORAL Mon-O-9-2-7 915 Automatic Assessment and Error Detection of Shadowing Speech: Case of English Spoken by Japanese Learners Shuju Shi, Yosuke Kashiwagi, Shohei Toyama, Junwei Yue, Yutaka Yamauchi, Daisuke Saito, Nobuaki Minematsu
Monday, 12 September 2016 10:00 Bayview A Phonation and Voice Quality ORAL Mon-O-9-3-1 1101 Multiplicity of the Acoustic Correlates of the Fortis-Lenis Contrast: Plosives in Aberystwyth English Míša Hejná
Monday, 12 September 2016 10:20 Bayview A Phonation and Voice Quality ORAL Mon-O-9-3-2 893 Automatic Measurement of Voice Onset Time and Prevoicing Using Recurrent Neural Networks Yossi Adi, Joseph Keshet, Olga Dmitrieva, Matt Goldrick
Monday, 12 September 2016 10:40 Bayview A Phonation and Voice Quality ORAL Mon-O-9-3-3 954 L1-L2 Interference: The Case of Final Devoicing of French Voiced Fricatives in Final Position by German Learners Sucheta Ghosh, Camille Fauth, Aghilas Sini, Yves Laprie
Monday, 12 September 2016 11:00 Bayview A Phonation and Voice Quality ORAL Mon-O-9-3-4 1160 Perceptual Salience of Voice Source Parameters in Signaling Focal Prominence Irena Yanushevskaya, Andy Murphy, Christer Gobl, Ailbhe Ní Chasaide
Monday, 12 September 2016 11:20 Bayview A Phonation and Voice Quality ORAL Mon-O-9-3-5 1194 Classification of Voice Modality Using Electroglottogram Waveforms Michal Borsky, Daryush D. Mehta, Julius P. Gudjohnsen, Jon Gudnason
Monday, 12 September 2016 11:40 Bayview A Phonation and Voice Quality ORAL Mon-O-9-3-6 1309 Voice-Quality Difference Between the Vowels in Filled Pauses and Ordinary Lexical Items Kikuo Maekawa, Hiroki Mori
Monday, 12 September 2016 10:00 Bayview B Speech Synthesis Oral: Prosody and Expressive Speech ORAL Mon-O-9-4-1 815 Generation of Emotion Control Vector Using MDS-Based Space Transformation for Expressive Speech Synthesis Yan-You Chen, Chung-Hsien Wu, Yu-Fong Huang
Monday, 12 September 2016 10:20 Bayview B Speech Synthesis Oral: Prosody and Expressive Speech ORAL Mon-O-9-4-2 979 Direct Expressive Voice Training Based on Semantic Selection Igor Jauk, Antonio Bonafonte
Monday, 12 September 2016 10:40 Bayview B Speech Synthesis Oral: Prosody and Expressive Speech ORAL Mon-O-9-4-3 1034 Syllable-Level Representations of Suprasegmental Features for DNN-Based Text-to-Speech Synthesis Manuel Sam Ribeiro, Oliver Watts, Junichi Yamagishi
Monday, 12 September 2016 11:00 Bayview B Speech Synthesis Oral: Prosody and Expressive Speech ORAL Mon-O-9-4-4 752 Pause Prediction from Text for Speech Synthesis with User-Definable Pause Insertion Likelihood Threshold Norbert Braunschweiler, Ranniery Maia
Monday, 12 September 2016 11:20 Bayview B Speech Synthesis Oral: Prosody and Expressive Speech ORAL Mon-O-9-4-5 930 A Hybrid System for Continuous Word-Level Emphasis Modeling Based on HMM State Clustering and Adaptive Training Quoc Truong Do, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura
Monday, 12 September 2016 11:40 Bayview B Speech Synthesis Oral: Prosody and Expressive Speech ORAL Mon-O-9-4-6 1060 Improving Prosodic Boundaries Prediction for Mandarin Speech Synthesis by Using Enhanced Embedding Feature and Model Fusion Approach Yibin Zheng, Ya Li, Zhengqi Wen, Xingguang Ding, Jianhua Tao
Monday, 12 September 2016 10:00 Seacliff BCD Language Recognition ORAL Mon-O-9-5-1 169 Results of The 2015 NIST Language Recognition Evaluation Hui Zhao, Désiré Bansé, George Doddington, Craig Greenberg, Jaime Hernández-Cordero, John Howard, Lisa Mason, Alvin Martin, Douglas Reynolds, Elliot Singer, Audrey Tong
Monday, 12 September 2016 10:20 Seacliff BCD Language Recognition ORAL Mon-O-9-5-2 624 The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS Kong Aik Lee, Haizhou Li, Li Deng, Ville Hautamäki, Wei Rao, Xiong Xiao, Anthony Larcher, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Aleksandr Sizov, Jianshu Chen, Ivan Kukanov, Amir Hossein Poorjam, Trung Ngo Trong, Cheng-Lin Xu, Haihua Xu, Bin Ma, Eng Siong Chng, Sylvain Meignier
Monday, 12 September 2016 10:40 Seacliff BCD Language Recognition ORAL Mon-O-9-5-3 722 Pair-Wise Distance Metric Learning of Neural Network Model for Spoken Language Identification Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai
Monday, 12 September 2016 11:00 Seacliff BCD Language Recognition ORAL Mon-O-9-5-4 293 Non-Iterative Parameter Estimation for Total Variability Model Using Randomized Singular Value Decomposition Ruchir Travadi, Shrikanth S. Narayanan
Monday, 12 September 2016 11:20 Seacliff BCD Language Recognition ORAL Mon-O-9-5-5 1334 Stacked Long-Term TDNN for Spoken Language Recognition Daniel Garcia-Romero, Alan McCree
Monday, 12 September 2016 11:40 Seacliff BCD Language Recognition ORAL Mon-O-9-5-6 180 A Divide-and-Conquer Approach for Language Identification Based on Recurrent Neural Networks G. Gelly, Jean-Luc Gauvain, V.B. Le, A. Messaoudi
Monday, 12 September 2016 10:00 Seacliff A Spoken Language Understanding Systems ORAL Mon-O-9-6-1 1171 Context-Sensitive and Role-Dependent Spoken Language Understanding Using Bidirectional and Attention LSTMs Chiori Hori, Takaaki Hori, Shinji Watanabe, John R. Hershey
Monday, 12 September 2016 10:20 Seacliff A Spoken Language Understanding Systems ORAL Mon-O-9-6-2 1301 A Step Beyond Local Observations with a Dialog Aware Bidirectional GRU Network for Spoken Language Understanding Vedran Vukotić, Christian Raymond, Guillaume Gravier
Monday, 12 September 2016 10:40 Seacliff A Spoken Language Understanding Systems ORAL Mon-O-9-6-3 312 End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Language Understanding Yun-Nung Chen, Dilek Hakkani-Tür, Gokhan Tur, Jianfeng Gao, Li Deng
Monday, 12 September 2016 11:00 Seacliff A Spoken Language Understanding Systems ORAL Mon-O-9-6-4 395 Sequential Convolutional Neural Networks for Slot Filling in Spoken Language Understanding Ngoc Thang Vu
Monday, 12 September 2016 11:20 Seacliff A Spoken Language Understanding Systems ORAL Mon-O-9-6-5 512 A New Pre-Training Method for Training Deep Learning Models with Application to Spoken Language Understanding Asli Celikyilmaz, Ruhi Sarikaya, Dilek Hakkani-Tür, Xiaohu Liu, Nikhil Ramesh, Gokhan Tur
Monday, 12 September 2016 11:40 Seacliff A Spoken Language Understanding Systems ORAL Mon-O-9-6-6 851 Joint Syntactic and Semantic Analysis with a Multitask Deep Learning Framework for Spoken Language Understanding Jeremie Tafforeau, Frederic Bechet, Thierry Artiere, Benoit Favre
Monday, 12 September 2016 10:00 Pacific Concourse - Poster A Language Recognition POSTER Mon-P-9-1-1 1584 Exploiting Hidden-Layer Responses of Deep Neural Networks for Language Recognition Ruizhi Li, Sri Harish Mallidi, Lukáš Burget, Oldřich Plchot, Najim Dehak
Monday, 12 September 2016 10:00 Pacific Concourse - Poster A Language Recognition POSTER Mon-P-9-1-2 558 Out of Set Language Modelling in Hierarchical Language Identification Saad Irtza, Vidhyasaharan Sethu, Sarith Fernando, Eliathamby Ambikairajah, Haizhou Li
Monday, 12 September 2016 10:00 Pacific Concourse - Poster A Language Recognition POSTER Mon-P-9-1-3 719 Language Identification Based on Generative Modeling of Posteriorgram Sequences Extracted from Frame-by-Frame DNNs and LSTM-RNNs Ryo Masumura, Taichi Asami, Hirokazu Masataki, Yushi Aono, Sumitaka Sakauchi
Monday, 12 September 2016 10:00 Pacific Concourse - Poster A Language Recognition POSTER Mon-P-9-1-4 684 Gating Recurrent Enhanced Memory Neural Networks on Language Identification Wang Geng, Yuanyuan Zhao, Wenfu Wang, Xinyuan Cai, Bo Xu
Monday, 12 September 2016 10:00 Pacific Concourse - Poster A Language Recognition POSTER Mon-P-9-1-5 764 Sequence Summarizing Neural Networks for Spoken Language Recognition Jan Pešán, Lukáš Burget, Jan Černocký
Monday, 12 September 2016 10:00 Pacific Concourse - Poster A Language Recognition POSTER Mon-P-9-1-6 1585 The Role of Spectral Resolution in Foreign-Accented Speech Perception Michelle R. Kapolowicz, Vahid Montazeri, Peter F. Assmann
Monday, 12 September 2016 10:00 Pacific Concourse - Poster A Language Recognition POSTER Mon-P-9-1-7 791 THU-EE System Description for NIST LRE 2015 Liang He, Yao Tian, Yi Liu, Jiaming Xu, Weiwei Liu, Cai Meng, Jia Liu
Monday, 12 September 2016 10:00 Pacific Concourse - Poster A Language Recognition POSTER Mon-P-9-1-8 1438 Variation in Spoken North Sami Language Kristiina Jokinen, Trung Ngo Trong, Ville Hautamäki
Monday, 12 September 2016 10:00 Pacific Concourse - Poster B Music, Audio, and Source Separation POSTER Mon-P-9-2-1 1236 Improved Music Genre Classification with Convolutional Neural Networks Weibin Zhang, Wenkang Lei, Xiangmin Xu, Xiaofeng Xing
Monday, 12 September 2016 10:00 Pacific Concourse - Poster B Music, Audio, and Source Separation POSTER Mon-P-9-2-2 856 Enhanced Harmonic Content and Vocal Note Based Predominant Melody Extraction from Vocal Polyphonic Music Signals Gurunath Reddy M., K. Sreenivasa Rao
Monday, 12 September 2016 10:00 Pacific Concourse - Poster B Music, Audio, and Source Separation POSTER Mon-P-9-2-3 551 Long Short-Term Memory for Speaker Generalization in Supervised Speech Separation Jitong Chen, DeLiang Wang
Monday, 12 September 2016 10:00 Pacific Concourse - Poster B Music, Audio, and Source Separation POSTER Mon-P-9-2-4 131 Phonotactic Language Identification for Singing Anna M. Kruspe
Monday, 12 September 2016 10:00 Pacific Concourse - Poster B Music, Audio, and Source Separation POSTER Mon-P-9-2-5 1025 Comparing the Influence of Spectro-Temporal Integration in Computational Speech Segregation Thomas Bentsen, Tobias May, Abigail A. Kressner, Torsten Dau
Monday, 12 September 2016 10:00 Pacific Concourse - Poster B Music, Audio, and Source Separation POSTER Mon-P-9-2-6 1449 Blind Speech Separation with GCC-NMF Sean U.N. Wood, Jean Rouat
Monday, 12 September 2016 10:00 Pacific Concourse - Poster B Music, Audio, and Source Separation POSTER Mon-P-9-2-7 1555 Effects of Cochlear Hearing Loss on the Benefits of Ideal Binary Masking Vahid Montazeri, Shaikat Hossain, Peter F. Assmann
Monday, 12 September 2016 10:00 Pacific Concourse - Poster B Music, Audio, and Source Separation POSTER Mon-P-9-2-8 216 Combining Mask Estimates for Single Channel Audio Source Separation Using Deep Neural Networks Emad M. Grais, Gerard Roma, Andrew J.R. Simpson, Mark D. Plumbley
Monday, 12 September 2016 10:00 Pacific Concourse - Poster B Music, Audio, and Source Separation POSTER Mon-P-9-2-9 252 Monaural Source Separation Using a Random Forest Classifier Cosimo Riday, Saurabh Bhargava, Richard H.R. Hahnloser, Shih-Chii Liu
Monday, 12 September 2016 10:00 Pacific Concourse - Poster B Music, Audio, and Source Separation POSTER Mon-P-9-2-10 321 Adaptive Group Sparsity for Non-Negative Matrix Factorization with Application to Unsupervised Source Separation Xu Li, Ziteng Wang, Xiaofei Wang, Qiang Fu, Yonghong Yan
Monday, 12 September 2016 10:00 Pacific Concourse - Poster B Music, Audio, and Source Separation POSTER Mon-P-9-2-11 1063 A Robust Dual-Microphone Speech Source Localization Algorithm for Reverberant Environments Yanmeng Guo, Xiaofei Wang, Chao Wu, Qiang Fu, Ning Ma, Guy J. Brown
Monday, 12 September 2016 10:00 Pacific Concourse - Poster B Music, Audio, and Source Separation POSTER Mon-P-9-2-12 1149 Speech Localisation in a Multitalker Mixture by Humans and Machines Ning Ma, Guy J. Brown
Monday, 12 September 2016 10:00 Pacific Concourse - Poster B Music, Audio, and Source Separation POSTER Mon-P-9-2-13 575 Reverberation-Robust One-Bit TDOA Based Moving Source Localization for Automatic Camera Steering Harshavardhan Sundar, Gokul Deepak Manavalan, T.V. Sreenivas, Chandra Sekhar Seelamantula
Monday, 12 September 2016 10:00 Pacific Concourse - Poster B Music, Audio, and Source Separation POSTER Mon-P-9-2-14 758 Multi-Talker Speech Recognition Based on Blind Source Separation with ad hoc Microphone Array Using Smartphones and Cloud Storage Keiko Ochi, Nobutaka Ono, Shigeki Miyabe, Shoji Makino
Monday, 12 September 2016 10:00 Pacific Concourse - Poster C Acoustic Modeling with Neural Networks POSTER Mon-P-9-3-1 823 Phase-Aware Signal Processing for Automatic Speech Recognition Johannes Fahringer, Tobias Schrank, Johannes Stahl, Pejman Mowlaee, Franz Pernkopf
Monday, 12 September 2016 10:00 Pacific Concourse - Poster C Acoustic Modeling with Neural Networks POSTER Mon-P-9-3-2 812 Unsupervised Deep Auditory Model Using Stack of Convolutional RBMs for Speech Recognition Hardik B. Sailor, Hemant A. Patil
Monday, 12 September 2016 10:00 Pacific Concourse - Poster C Acoustic Modeling with Neural Networks POSTER Mon-P-9-3-3 124 Interpretation of Low Dimensional Neural Network Bottleneck Features in Terms of Human Perception and Production Philip Weber, Linxue Bai, Martin Russell, Peter Jančovič, Stephen Houghton
Monday, 12 September 2016 10:00 Pacific Concourse - Poster C Acoustic Modeling with Neural Networks POSTER Mon-P-9-3-4 121 Compact Feedforward Sequential Memory Networks for Large Vocabulary Continuous Speech Recognition Shiliang Zhang, Hui Jiang, Shifu Xiong, Si Wei, Li-Rong Dai
Monday, 12 September 2016 10:00 Pacific Concourse - Poster C Acoustic Modeling with Neural Networks POSTER Mon-P-9-3-5 185 Future Context Attention for Unidirectional LSTM Based Acoustic Model Jian Tang, Shiliang Zhang, Si Wei, Li-Rong Dai
Monday, 12 September 2016 10:00 Pacific Concourse - Poster C Acoustic Modeling with Neural Networks POSTER Mon-P-9-3-6 192 Hybrid Accelerated Optimization for Speech Recognition Jen-Tzung Chien, Pei-Wen Huang, Tan Lee
Monday, 12 September 2016 10:00 Pacific Concourse - Poster C Acoustic Modeling with Neural Networks POSTER Mon-P-9-3-7 334 On Online Attention-Based Speech Recognition and Joint Mandarin Character-Pinyin Training William Chan, Ian Lane
Monday, 12 September 2016 10:00 Pacific Concourse - Poster C Acoustic Modeling with Neural Networks POSTER Mon-P-9-3-8 391 GMM-Free Flat Start Sequence-Discriminative DNN Training Gábor Gosztolya, Tamás Grósz, László Tóth
Monday, 12 September 2016 10:00 Pacific Concourse - Poster C Acoustic Modeling with Neural Networks POSTER Mon-P-9-3-9 412 Open-Domain Audio-Visual Speech Recognition: A Deep Learning Approach Yajie Miao, Florian Metze
Monday, 12 September 2016 10:00 Pacific Concourse - Poster C Acoustic Modeling with Neural Networks POSTER Mon-P-9-3-10 677 Multidimensional Residual Learning Based on Recurrent Neural Networks for Acoustic Modeling Yuanyuan Zhao, Shuang Xu, Bo Xu
Monday, 12 September 2016 10:00 Pacific Concourse - Poster C Acoustic Modeling with Neural Networks POSTER Mon-P-9-3-11 759 Towards Online-Recognition with Deep Bidirectional LSTM Acoustic Models Albert Zeyer, Ralf Schlüter, Hermann Ney
Monday, 12 September 2016 10:00 Pacific Concourse - Poster C Acoustic Modeling with Neural Networks POSTER Mon-P-9-3-12 1033 Advances in Very Deep Convolutional Neural Networks for LVCSR Tom Sercu, Vaibhava Goel
Monday, 12 September 2016 10:00 Pacific Concourse - Poster C Acoustic Modeling with Neural Networks POSTER Mon-P-9-3-13 1495 Acoustic Modelling from the Signal Domain Using CNNs Pegah Ghahremani, Vimal Manohar, Daniel Povey, Sanjeev Khudanpur
Monday, 12 September 2016 10:00 Pacific Concourse - Poster C Acoustic Modeling with Neural Networks POSTER Mon-P-9-3-14 1190 Distilling Knowledge from Ensembles of Neural Networks for Speech Recognition Yevgen Chebotar, Austin Waters
Monday, 12 September 2016 10:00 Pacific Concourse - Poster C Acoustic Modeling with Neural Networks POSTER Mon-P-9-3-15 1300 Triphone State-Tying via Deep Canonical Correlation Analysis Weiran Wang, Hao Tang, Karen Livescu
Monday, 12 September 2016 10:00 Pacific Concourse - Poster C Acoustic Modeling with Neural Networks POSTER Mon-P-9-3-16 1279 Low-Rank Representation of Nearest Neighbor Posterior Probabilities to Enhance DNN Based Acoustic Modeling Gil Luyet, Pranay Dighe, Afsaneh Asaei, Hervé Bourlard
Monday, 12 September 2016 10:00 Pacific Concourse - Poster D Robustness and Adaptation POSTER Mon-P-9-4-1 378 Improving Large Vocabulary Accented Mandarin Speech Recognition with Attribute-Based I-Vectors Hao Zheng, Shanshan Zhang, Liwei Qiao, Jianping Li, Wenju Liu
Monday, 12 September 2016 10:00 Pacific Concourse - Poster D Robustness and Adaptation POSTER Mon-P-9-4-2 1020 Pitch-Adaptive Front-End Features for Robust Children’s ASR S. Shahnawazuddin, Abhishek Dey, Rohit Sinha
Monday, 12 September 2016 10:00 Pacific Concourse - Poster D Robustness and Adaptation POSTER Mon-P-9-4-3 1142 ASR Confidence Estimation with Speaker-Adapted Recurrent Neural Networks Miguel Ángel del-Agua, Santiago Piqueras, Adrià Giménez, Alberto Sanchis, Jorge Civera, Alfons Juan
Monday, 12 September 2016 10:00 Pacific Concourse - Poster D Robustness and Adaptation POSTER Mon-P-9-4-4 299 Automatic Correction of ASR Outputs by Using Machine Translation Luis Fernando D’Haro, Rafael E. Banchs
Monday, 12 September 2016 10:00 Pacific Concourse - Poster D Robustness and Adaptation POSTER Mon-P-9-4-5 619 A Framework for Practical Multistream ASR Sri Harish Mallidi, Hynek Hermansky
Monday, 12 September 2016 10:00 Pacific Concourse - Poster D Robustness and Adaptation POSTER Mon-P-9-4-6 904 DNNs for Unsupervised Extraction of Pseudo FMLLR Features Without Explicit Adaptation Data Neethu Mariam Joy, Murali Karthick Baskar, S. Umesh, Basil Abraham
Monday, 12 September 2016 10:00 Pacific Concourse - Poster D Robustness and Adaptation POSTER Mon-P-9-4-7 1233 Multi-Attribute Factorized Hidden Layer Adaptation for DNN Acoustic Models Lahiru Samarakoon, Khe Chai Sim
Monday, 12 September 2016 10:00 Pacific Concourse - Poster D Robustness and Adaptation POSTER Mon-P-9-4-8 819 Speaker Normalization Through Feature Shifting of Linearly Transformed i-Vector Jahyun Goo, Younggwan Kim, Hyungjun Lim, Hoirin Kim
Monday, 12 September 2016 12:15 - 13:00 Grand Ballroom A Special Event: Computational Approaches to Linguistic Code Switching   Mon-SE-4 4005 Computational Approaches to Linguistic Code Switching Mona Diab, Pascale Fung, Julia Hirschberg, Thamar Solorio
Monday, 12 September 2016 13:30 Grand Ballroom A Neural Networks for Language Modeling ORAL Mon-O-10-1-1 1239 Compositional Neural Network Language Models for Agglutinative Languages Ebru Arisoy, Murat Saraclar
Monday, 12 September 2016 13:50 Grand Ballroom A Neural Networks for Language Modeling ORAL Mon-O-10-1-2 1295 NN-Grams: Unifying Neural Network and n-Gram Language Models for Speech Recognition Babak Damavandi, Shankar Kumar, Noam Shazeer, Antoine Bruguier
Monday, 12 September 2016 14:10 Grand Ballroom A Neural Networks for Language Modeling ORAL Mon-O-10-1-3 375 Recurrent Neural Network Language Model with Incremental Updated Context Information Generated Using Bag-of-Words Representation Md. Akmal Haidar, Mikko Kurimo
Monday, 12 September 2016 14:30 Grand Ballroom A Neural Networks for Language Modeling ORAL Mon-O-10-1-4 422 Sequential Recurrent Neural Networks for Language Modeling Youssef Oualil, Clayton Greenberg, Mittul Singh, Dietrich Klakow
Monday, 12 September 2016 14:50 Grand Ballroom A Neural Networks for Language Modeling ORAL Mon-O-10-1-5 44 Word-Phrase-Entity Recurrent Neural Networks for Language Modeling Michael Levit, Sarangarajan Parthasarathy, Shuangyu Chang
Monday, 12 September 2016 15:10 Grand Ballroom A Neural Networks for Language Modeling ORAL Mon-O-10-1-6 491 LSTM, GRU, Highway and a Bit of Attention: An Empirical Overview for Language Modeling in Speech Recognition Kazuki Irie, Zoltán Tüske, Tamer Alkhouli, Ralf Schlüter, Hermann Ney
Monday, 12 September 2016 13:30 Grand Ballroom BC Special Session: Sub-Saharan African Languages: From Speech Fundamentals to Applications ORAL Mon-O-10-2-1 657 Automatic Speech Recognition Using Probabilistic Transcriptions in Swahili, Amharic, and Dinka Amit Das, Preethi Jyothi, Mark Hasegawa-Johnson
Monday, 12 September 2016 13:45 Grand Ballroom BC Special Session: Sub-Saharan African Languages: From Speech Fundamentals to Applications ORAL Mon-O-10-2-2 461 Speed Perturbation and Vowel Duration Modeling for ASR in Hausa and Wolof Languages Elodie Gauthier, Laurent Besacier, Sylvie Voisin
Monday, 12 September 2016 14:00 Grand Ballroom BC Special Session: Sub-Saharan African Languages: From Speech Fundamentals to Applications ORAL Mon-O-10-2-3 1412 Improving the Lwazi ASR Baseline Charl van Heerden, Neil Kleynhans, Marelie Davel
Monday, 12 September 2016 14:15 Grand Ballroom BC Special Session: Sub-Saharan African Languages: From Speech Fundamentals to Applications ORAL Mon-O-10-2-4 886 Preliminary Experiments on Unsupervised Word Discovery in Mboshi Pierre Godard, Gilles Adda, Martine Adda-Decker, Alexandre Allauzen, Laurent Besacier, Hélène Bonneau-Maynard, Guy-Noël Kouarata, Kevin Löser, Annie Rialland, François Yvon
Monday, 12 September 2016 14:30 Grand Ballroom BC Special Session: Sub-Saharan African Languages: From Speech Fundamentals to Applications ORAL Mon-O-10-2-5 1440 Unsupervised Phoneme Segmentation of Previously Unseen Languages Marco Vetter, Markus Müller, Fatima Hamlaoui, Graham Neubig, Satoshi Nakamura, Sebastian Stüker, Alex Waibel
Monday, 12 September 2016 14:45 Grand Ballroom BC Special Session: Sub-Saharan African Languages: From Speech Fundamentals to Applications ORAL Mon-O-10-2-6 796 CNN-Based Phone Segmentation Experiments in a Less-Represented Language Céline Manenti, Thomas Pellegrini, Julien Pinquier
Monday, 12 September 2016 15:00 Grand Ballroom BC Special Session: Sub-Saharan African Languages: From Speech Fundamentals to Applications ORAL Mon-O-10-2-7 1040 Part-of-Speech Tagging and Chunking in Text-to-Speech Synthesis for South African Languages Georg I. Schlünz, Nkosikhona Dlamini, Rynhardt P. Kruger
Monday, 12 September 2016 15:15 Grand Ballroom BC Special Session: Sub-Saharan African Languages: From Speech Fundamentals to Applications ORAL Mon-O-10-2-8 820 The Effect of Postlexical Deletion on Automatic Speech Recognition in Fast Spontaneously Spoken Zulu Ewald van der Westhuizen, Thomas Niesler
Monday, 12 September 2016 13:30 Bayview A Speech Production Models ORAL Mon-O-10-3-1 1499 A New Model of Speech Motor Control Based on Task Dynamics and State Feedback Vikram Ramanarayanan, Benjamin Parrell, Louis Goldstein, Srikantan Nagarajan, John Houde
Monday, 12 September 2016 13:50 Bayview A Speech Production Models ORAL Mon-O-10-3-2 1500 Using a Biomechanical Model and Articulatory Data for the Numerical Production of Vowels Saeed Dabbaghchian, Marc Arnela, Olov Engwall, Oriol Guasch, Ian Stavness, Pierre Badin
Monday, 12 September 2016 14:10 Bayview A Speech Production Models ORAL Mon-O-10-3-3 1513 A New Model for Acoustic Wave Propagation and Scattering in the Vocal Tract Jianguo Wei, Wendan Guan, Darcy Q. Hou, Dingyi Pan, Wenhuan Lu, Jianwu Dang
Monday, 12 September 2016 14:30 Bayview A Speech Production Models ORAL Mon-O-10-3-4 1579 Uncontrolled Manifolds in Vowel Production: Assessment with a Biomechanical Model of the Tongue Andrew Szabados, Pascal Perrier
Monday, 12 September 2016 14:50 Bayview A Speech Production Models ORAL Mon-O-10-3-5 1597 Experimental Validation of Sound Generated from Flow in Simplified Vocal Tract Model of Sibilant /s/ Tsukasa Yoshinaga, Kazunori Nozaki, Shigeo Wada
Monday, 12 September 2016 15:10 Bayview A Speech Production Models ORAL Mon-O-10-3-6 441 Bayesian Modeling in Speech Motor Control: A Principled Structure for the Integration of Various Constraints Jean-François Patri, Pascal Perrier, Julien Diard
Monday, 12 September 2016 13:30 Bayview B Speaker States and Traits ORAL Mon-O-10-4-1 998 Facing Realism in Spontaneous Emotion Recognition from Speech: Feature Enhancement by Autoencoder with LSTM Neural Networks Zixing Zhang, Fabien Ringeval, Jing Han, Jun Deng, Erik Marchi, Björn Schuller
Monday, 12 September 2016 13:50 Bayview B Speaker States and Traits ORAL Mon-O-10-4-2 429 Defining Emotionally Salient Regions Using Qualitative Agreement Method Srinivas Parthasarathy, Carlos Busso
Monday, 12 September 2016 14:10 Bayview B Speaker States and Traits ORAL Mon-O-10-4-3 692 Representation Learning for Speech Emotion Recognition Sayan Ghosh, Eugene Laksana, Louis-Philippe Morency, Stefan Scherer
Monday, 12 September 2016 14:30 Bayview B Speaker States and Traits ORAL Mon-O-10-4-4 645 Multilingual Speech Emotion Recognition System Based on a Three-Layer Model Xingfeng Li, Masato Akagi
Monday, 12 September 2016 14:50 Bayview B Speaker States and Traits ORAL Mon-O-10-4-5 1557 Analysis of Multi-Lingual Emotion Recognition Using Auditory Attention Features Ozlem Kalinli
Monday, 12 September 2016 15:10 Bayview B Speaker States and Traits ORAL Mon-O-10-4-6 868 On the Correlation and Transferability of Features Between Automatic Speech Recognition and Speech Emotion Recognition Haytham M. Fayek, Margaret Lech, Lawrence Cavedon
Monday, 12 September 2016 13:30 Seacliff BCD Speaker Recognition ORAL Mon-O-10-5-1 1115 On the Influence of Text Content on Pass-Phrase Strength for Short-Duration Text-Dependent Automatic Speaker Authentication Giacomo Valenti, Adrien Daniel, Nicholas Evans
Monday, 12 September 2016 13:50 Seacliff BCD Speaker Recognition ORAL Mon-O-10-5-2 1140 Articulation Rate Filtering of CQCC Features for Automatic Speaker Verification Massimiliano Todisco, Héctor Delgado, Nicholas Evans
Monday, 12 September 2016 14:10 Seacliff BCD Speaker Recognition ORAL Mon-O-10-5-3 1159 The IBM Speaker Recognition System: Recent Advances and Error Analysis Seyed Omid Sadjadi, Jason W. Pelecanos, Sriram Ganapathy
Monday, 12 September 2016 14:30 Seacliff BCD Speaker Recognition ORAL Mon-O-10-5-4 1292 Probabilistic Approach Using Joint Clean and Noisy i-Vectors Modeling for Speaker Recognition Waad Ben Kheder, Driss Matrouf, Moez Ajili, Jean-François Bonastre
Monday, 12 September 2016 14:50 Seacliff BCD Speaker Recognition ORAL Mon-O-10-5-5 1523 Generalized Discriminant Analysis (GDA) for Improved i-Vector Based Speaker Recognition Fahimeh Bahmaninezhad, John H.L. Hansen
Monday, 12 September 2016 15:10 Seacliff BCD Speaker Recognition ORAL Mon-O-10-5-6 548 Noise and Metadata Sensitive Bottleneck Features for Improving Speaker Recognition with Non-Native Speech Input Yao Qian, Jidong Tao, David Suendermann-Oeft, Keelan Evanini, Alexei V. Ivanov, Vikram Ramanarayanan
Monday, 12 September 2016 13:30 Seacliff A VAD and Audio Events ORAL Mon-O-10-6-1 123 Robust Audio Event Recognition with 1-Max Pooling Convolutional Neural Networks Huy Phan, Lars Hertel, Marco Maass, Alfred Mertins
Monday, 12 September 2016 13:50 Seacliff A VAD and Audio Events ORAL Mon-O-10-6-2 839 Audio-Based Distributional Representations of Meaning Using a Fusion of Feature Encodings Giannis Karamanolakis, Elias Iosif, Athanasia Zlatintsi, Aggelos Pikrakis, Alexandros Potamianos
Monday, 12 September 2016 14:10 Seacliff A VAD and Audio Events ORAL Mon-O-10-6-3 136 Robust DNN-Based VAD Augmented with Phone Entropy Based Rejection of Background Speech Yuya Fujita, Ken-ichi Iso
Monday, 12 September 2016 14:30 Seacliff A VAD and Audio Events ORAL Mon-O-10-6-4 268 Feature Learning with Raw-Waveform CLDNNs for Voice Activity Detection Ruben Zazo, Tara N. Sainath, Gabor Simko, Carolina Parada
Monday, 12 September 2016 14:50 Seacliff A VAD and Audio Events ORAL Mon-O-10-6-5 550 The SRI System for the NIST OpenSAD 2015 Speech Activity Detection Evaluation Martin Graciarena, Luciana Ferrer, Vikramjit Mitra
Monday, 12 September 2016 15:10 Seacliff A VAD and Audio Events ORAL Mon-O-10-6-6 603 Model Adaptation and Active Learning in the BBN Speech Activity Detection System for the DARPA RATS Program Damianos Karakos, Scott Novotney, Le Zhang, Richard Schwartz
Monday, 12 September 2016 13:30 Pacific Concourse - Poster A Spoken Term Detection POSTER Mon-P-10-1-1 279 Fusion Strategies for Robust Speech Recognition and Keyword Spotting for Channel- and Noise-Degraded Speech Vikramjit Mitra, Julien VanHout, Wen Wang, Chris Bartels, Horacio Franco, Dimitra Vergyri, Abeer Alwan, Adam Janin, John H.L. Hansen, Richard M. Stern, Abhijeet Sangwan, Nelson Morgan
Monday, 12 September 2016 13:30 Pacific Concourse - Poster A Spoken Term Detection POSTER Mon-P-10-1-2 337 Recurrent Neural Network-Based Phoneme Sequence Estimation Using Multiple ASR Systems’ Outputs for Spoken Term Detection Naoki Sawada, Hiromitsu Nishizaki
Monday, 12 September 2016 13:30 Pacific Concourse - Poster A Spoken Term Detection POSTER Mon-P-10-1-3 489 Enhancing Data-Driven Phone Confusions Using Restricted Recognition Mark Kane, Julie Carson-Berndsen
Monday, 12 September 2016 13:30 Pacific Concourse - Poster A Spoken Term Detection POSTER Mon-P-10-1-4 53 Rapid Update of Multilingual Deep Neural Network for Low-Resource Keyword Search Chongjia Ni, Lei Wang, Cheung-Chi Leung, Feng Rao, Li Lu, Bin Ma, Haizhou Li
Monday, 12 September 2016 13:30 Pacific Concourse - Poster A Spoken Term Detection POSTER Mon-P-10-1-5 691 Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis Cheung-Chi Leung, Lei Wang, Haihua Xu, Jingyong Hou, Van Tung Pham, Hang Lv, Lei Xie, Xiong Xiao, Chongjia Ni, Bin Ma, Eng Siong Chng, Haizhou Li
Monday, 12 September 2016 13:30 Pacific Concourse - Poster B Speech Enhancement and Noise Reduction POSTER Mon-P-10-2-1 693 Novel Subband Autoencoder Features for Non-Intrusive Quality Assessment of Noise Suppressed Speech Meet H. Soni, Hemant A. Patil
Monday, 12 September 2016 13:30 Pacific Concourse - Poster B Speech Enhancement and Noise Reduction POSTER Mon-P-10-2-2 224 SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee
Monday, 12 September 2016 13:30 Pacific Concourse - Poster B Speech Enhancement and Noise Reduction POSTER Mon-P-10-2-3 151 A Novel Risk-Estimation-Theoretic Framework for Speech Enhancement in Nonstationary and Non-Gaussian Noise Conditions Jishnu Sadasivan, Chandra Sekhar Seelamantula
Monday, 12 September 2016 13:30 Pacific Concourse - Poster B Speech Enhancement and Noise Reduction POSTER Mon-P-10-2-4 307 Two-Stage Temporal Processing for Single-Channel Speech Enhancement Suman Samui, Indrajit Chakrabarti, Soumya Kanti Ghosh
Monday, 12 September 2016 13:30 Pacific Concourse - Poster B Speech Enhancement and Noise Reduction POSTER Mon-P-10-2-5 236 A Class-Specific Speech Enhancement for Phoneme Recognition: A Dictionary Learning Approach Nazreen P.M., A.G. Ramakrishnan, Prasanta Kumar Ghosh
Monday, 12 September 2016 13:30 Pacific Concourse - Poster B Speech Enhancement and Noise Reduction POSTER Mon-P-10-2-6 671 Robust Example Search Using Bottleneck Features for Example-Based Speech Enhancement Atsunori Ogawa, Shogo Seki, Keisuke Kinoshita, Marc Delcroix, Takuya Yoshioka, Tomohiro Nakatani, Kazuya Takeda
Monday, 12 September 2016 13:30 Pacific Concourse - Poster B Speech Enhancement and Noise Reduction POSTER Mon-P-10-2-7 88 Speech Enhancement in Multiple-Noise Conditions Using Deep Neural Networks Anurag Kumar, Dinei Florencio
Monday, 12 September 2016 13:30 Pacific Concourse - Poster B Speech Enhancement and Noise Reduction POSTER Mon-P-10-2-8 1284 Perception Optimized Deep Denoising AutoEncoders for Speech Enhancement Prashanth Gurunath Shivakumar, Panayiotis Georgiou
Monday, 12 September 2016 13:30 Pacific Concourse - Poster B Speech Enhancement and Noise Reduction POSTER Mon-P-10-2-9 928 HMM-Based Speech Enhancement Using Sub-Word Models and Noise Adaptation Akihiro Kato, Ben Milner
Monday, 12 September 2016 13:30 Pacific Concourse - Poster B Speech Enhancement and Noise Reduction POSTER Mon-P-10-2-10 1286 Semi-Supervised Joint Enhancement of Spectral and Cepstral Sequences of Noisy Speech Li Li, Hirokazu Kameoka, Takuya Higuchi, Hiroshi Saruwatari
Monday, 12 September 2016 13:30 Pacific Concourse - Poster B Speech Enhancement and Noise Reduction POSTER Mon-P-10-2-11 474 A priori SNR Estimation Using a Generalized Decision Directed Approach Aleksej Chinaev, Reinhold Haeb-Umbach
Monday, 12 September 2016 13:30 Pacific Concourse - Poster B Speech Enhancement and Noise Reduction POSTER Mon-P-10-2-12 147 A DNN-HMM Approach to Non-Negative Matrix Factorization Based Speech Enhancement Ziteng Wang, Xu Li, Xiaofei Wang, Qiang Fu, Yonghong Yan
Monday, 12 September 2016 13:30 Pacific Concourse - Poster B Speech Enhancement and Noise Reduction POSTER Mon-P-10-2-13 211 SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement Szu-Wei Fu, Yu Tsao, Xugang Lu
Monday, 12 September 2016 13:30 Pacific Concourse - Poster B Speech Enhancement and Noise Reduction POSTER Mon-P-10-2-14 494 An Iterative Phase Recovery Framework with Phase Mask for Spectral Mapping with an Application to Speech Enhancement Kehuang Li, Bo Wu, Chin-Hui Lee
Monday, 12 September 2016 13:30 Pacific Concourse - Poster B Speech Enhancement and Noise Reduction POSTER Mon-P-10-2-15 772 A Novel Research to Artificial Bandwidth Extension Based on Deep BLSTM Recurrent Neural Networks and Exemplar-Based Sparse Representation Bin Liu, Jianhua Tao
Monday, 12 September 2016 13:30 Pacific Concourse - Poster C Far-Field, Robustness and Adaptation POSTER Mon-P-10-3-1 966 Coping with Unseen Data Conditions: Investigating Neural Net Architectures, Robust Features, and Information Fusion for Robust Speech Recognition Vikramjit Mitra, Horacio Franco
Monday, 12 September 2016 13:30 Pacific Concourse - Poster C Far-Field, Robustness and Adaptation POSTER Mon-P-10-3-2 1230 On the Use of Gaussian Mixture Model Framework to Improve Speaker Adaptation of Deep Neural Network Acoustic Models Natalia Tomashenko, Yuri Khokhlov, Yannick Estève
Monday, 12 September 2016 13:30 Pacific Concourse - Poster C Far-Field, Robustness and Adaptation POSTER Mon-P-10-3-3 1050 Analytical Assessment of Dual-Stream Merging for Noise-Robust ASR Louis ten Bosch, Bert Cranen, Yang Sun
Monday, 12 September 2016 13:30 Pacific Concourse - Poster C Far-Field, Robustness and Adaptation POSTER Mon-P-10-3-4 1028 Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition Erfan Loweimi, Jon Barker, Thomas Hain
Monday, 12 September 2016 13:30 Pacific Concourse - Poster C Far-Field, Robustness and Adaptation POSTER Mon-P-10-3-5 388 Joint Optimization of Denoising Autoencoder and DNN Acoustic Model Based on Multi-Target Learning for Noisy Speech Recognition Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
Monday, 12 September 2016 13:30 Pacific Concourse - Poster C Far-Field, Robustness and Adaptation POSTER Mon-P-10-3-6 681 Optimization of Speech Enhancement Front-End with Speech Recognition-Level Criterion Takuya Higuchi, Takuya Yoshioka, Tomohiro Nakatani
Monday, 12 September 2016 13:30 Pacific Concourse - Poster C Far-Field, Robustness and Adaptation POSTER Mon-P-10-3-7 732 Factorized Linear Input Network for Acoustic Model Adaptation in Noisy Conditions Dung T. Tran, Marc Delroix, Atsunori Ogawa, Tomohiro Nakatani
Monday, 12 September 2016 13:30 Pacific Concourse - Poster C Far-Field, Robustness and Adaptation POSTER Mon-P-10-3-8 733 Data Augmentation Using Multi-Input Multi-Output Source Separation for Deep Neural Network Based Acoustic Modeling Yusuke Fujita, Ryoich Takashima, Takeshi Homma, Masahito Togami
Monday, 12 September 2016 13:30 Pacific Concourse - Poster C Far-Field, Robustness and Adaptation POSTER Mon-P-10-3-9 738 Microphone Distance Adaptation Using Cluster Adaptive Training for Robust Far Field Speech Recognition Animesh Prasad, Khe Chai Sim
Monday, 12 September 2016 13:30 Pacific Concourse - Poster C Far-Field, Robustness and Adaptation POSTER Mon-P-10-3-10 1482 An Investigation on the Use of i-Vectors for Robust ASR Dimitrios Dimitriadis, Samuel Thomas, Sriram Ganapathy
Monday, 12 September 2016 13:30 Pacific Concourse - Poster C Far-Field, Robustness and Adaptation POSTER Mon-P-10-3-11 98 The Sheffield Wargame Corpus - Day Two and Day Three Yulan Liu, Charles Fox, Madina Hasan, Thomas Hain
Monday, 12 September 2016 13:30 Pacific Concourse - Poster C Far-Field, Robustness and Adaptation POSTER Mon-P-10-3-12 326 Recurrent Models for Auditory Attention in Multi-Microphone Distant Speech Recognition Suyoun Kim, Ian Lane
Monday, 12 September 2016 13:30 Pacific Concourse - Poster C Far-Field, Robustness and Adaptation POSTER Mon-P-10-3-13 1625 Semi-Supervised Speaker Adaptation for In-Vehicle Speech Recognition with Deep Neural Networks Wonkyum Lee, Kyu J. Han, Ian Lane
Monday, 12 September 2016 13:30 Pacific Concourse - Poster D Low Resource Speech Recognition POSTER Mon-P-10-4-1 1596 Semi-Supervised Training in Deep Learning Acoustic Model Yan Huang, Yongqiang Wang, Yifan Gong
Monday, 12 September 2016 13:30 Pacific Concourse - Poster D Low Resource Speech Recognition POSTER Mon-P-10-4-2 598 Multilingual Data Selection for Low Resource Speech Recognition Samuel Thomas, Kartik Audhkhasi, Jia Cui, Brian Kingsbury, Bhuvana Ramabhadran
Monday, 12 September 2016 13:30 Pacific Concourse - Poster D Low Resource Speech Recognition POSTER Mon-P-10-4-3 655 An Investigation on Training Deep Neural Networks Using Probabilistic Transcriptions Amit Das, Mark Hasegawa-Johnson
Monday, 12 September 2016 13:30 Pacific Concourse - Poster D Low Resource Speech Recognition POSTER Mon-P-10-4-4 736 Analysis of Mismatched Transcriptions Generated by Humans and Machines for Under-Resourced Languages Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark Hasegawa-Johnson
Monday, 12 September 2016 13:30 Pacific Concourse - Poster D Low Resource Speech Recognition POSTER Mon-P-10-4-5 747 ASR for South Slavic Languages Developed in Almost Automated Way Jan Nouza, Radek Safarik, Petr Cerva
Monday, 12 September 2016 13:30 Pacific Concourse - Poster D Low Resource Speech Recognition POSTER Mon-P-10-4-6 1010 Improving Under-Resourced Language ASR Through Latent Subword Unit Space Discovery Marzieh Razavi, Mathew Magimai-Doss
Monday, 12 September 2016 13:30 Pacific Concourse - Poster D Low Resource Speech Recognition POSTER Mon-P-10-4-7 1143 Language Adaptive DNNs for Improved Low Resource Speech Recognition Markus Müller, Sebastian Stüker, Alex Waibel
Monday, 12 September 2016 13:30 Pacific Concourse - Poster D Low Resource Speech Recognition POSTER Mon-P-10-4-8 1426 Improved Multilingual Training of Stacked Neural Network Acoustic Models for Low Resource Languages Tanel Alumäe, Stavros Tsakalidis, Richard Schwartz