An empirical study of the effect of acoustic-prosodic entrainment on the perceived trustworthiness of conversational avatars
Publication date: Available online 30 July 2020Source: Speech CommunicationAuthor(s): Ramiro H. Gálvez, Agustín Gravano, Štefan Beňuš, Rivka Levitan, Marian Trnka, Julia Hirschberg (Source: Speech Communication)
Source: Speech Communication - August 1, 2020 Category: Speech-Language Pathology Source Type: research

Multimodal Perception of Prominence in Spontaneous Speech: A Methodological Proposal using Mixed Models and AIC
Publication date: Available online 31 July 2020Source: Speech CommunicationAuthor(s): Miguel Jiménez-Bravo, Victoria Marrero-Aguiar (Source: Speech Communication)
Source: Speech Communication - August 1, 2020 Category: Speech-Language Pathology Source Type: research

Parallel Representation Learning for the Classification of Pathological Speech: Studies on Parkinson’s Disease and Cleft Lip and Palate
Publication date: Available online 29 July 2020Source: Speech CommunicationAuthor(s): J.C. Vasquez-Correa, T. Arias-Vergara, M. Schuster, J.R. Orozco-Arroyave, E. Nöth (Source: Speech Communication)
Source: Speech Communication - July 30, 2020 Category: Speech-Language Pathology Source Type: research

Analysis of glottal inverse filtering in the presence of source-filter interaction
Publication date: October 2020Source: Speech Communication, Volume 123Author(s): Anil Palaparthi, Ingo R. Titze (Source: Speech Communication)
Source: Speech Communication - July 25, 2020 Category: Speech-Language Pathology Source Type: research

Accuracy, recording interference, and articulatory quality of headsets for ultrasound recordings
Publication date: Available online 11 July 2020Source: Speech CommunicationAuthor(s): Michael Pucher, Nicola Klingler, Jan Luttenberger, Lorenzo Spreafico (Source: Speech Communication)
Source: Speech Communication - July 12, 2020 Category: Speech-Language Pathology Source Type: research

Enhancement of cleft palate speech using temporal and spectral processing
Publication date: Available online 9 July 2020Source: Speech CommunicationAuthor(s): Protima Nomo Sudro, S. R. Mahadeva Prasanna (Source: Speech Communication)
Source: Speech Communication - July 10, 2020 Category: Speech-Language Pathology Source Type: research

Vowels and tones as acoustic cues in Chinese subregional dialect identification
Publication date: Available online 3 July 2020Source: Speech CommunicationAuthor(s): Huangmei Liu, Jie Liang, Vincent J. van Heuven, Wilbert Heeringa (Source: Speech Communication)
Source: Speech Communication - July 3, 2020 Category: Speech-Language Pathology Source Type: research

Automatic intelligibility assessment of dysarthric speech using glottal parameters
Publication date: Available online 1 July 2020Source: Speech CommunicationAuthor(s): NP Narendra, Paavo Alku (Source: Speech Communication)
Source: Speech Communication - July 3, 2020 Category: Speech-Language Pathology Source Type: research

An Iterative Graph Spectral Subtraction Method for Speech Enhancement
Publication date: Available online 30 June 2020Source: Speech CommunicationAuthor(s): Xue Yan, Zhen Yang, Tingting Wang, Haiyan Guo (Source: Speech Communication)
Source: Speech Communication - July 1, 2020 Category: Speech-Language Pathology Source Type: research

Aoustical and perceptual characteristics of Mandarin consonants produced with an electrolarynx
Publication date: Available online 30 June 2020Source: Speech CommunicationAuthor(s): Ke Xiao, Bo Zhang, Supin Wang, Mingxi Wan, Liang Wu (Source: Speech Communication)
Source: Speech Communication - July 1, 2020 Category: Speech-Language Pathology Source Type: research

Significance of Spectral Cues in Automatic Speech Segmentation for Indian Language Speech Synthesizers
Publication date: Available online 27 June 2020Source: Speech CommunicationAuthor(s): Arun Baby, Jeena J. Prakash, Aswin Shanmugam Subramanian, Hema A. Murthy (Source: Speech Communication)
Source: Speech Communication - June 28, 2020 Category: Speech-Language Pathology Source Type: research

GEDI: Gammachirp Envelope Distortion Index for Predicting Intelligibility of Enhanced Speech
Publication date: Available online 17 June 2020Source: Speech CommunicationAuthor(s): Katsuhiko Yamamoto, Toshio Irino, Shoko Araki, Keisuke Kinoshita, Tomohiro Nakatani (Source: Speech Communication)
Source: Speech Communication - June 18, 2020 Category: Speech-Language Pathology Source Type: research

Editorial Board
Publication date: August 2020Source: Speech Communication, Volume 121Author(s): (Source: Speech Communication)
Source: Speech Communication - June 8, 2020 Category: Speech-Language Pathology Source Type: research

Automatic accent identification as an analytical tool for accent robust automatic speech recognition
Publication date: Available online 4 June 2020Source: Speech CommunicationAuthor(s): Maryam Najafian, Martin Russell (Source: Speech Communication)
Source: Speech Communication - June 5, 2020 Category: Speech-Language Pathology Source Type: research

DeepConversion: Voice conversion with limited parallel training data
Publication date: Available online 4 June 2020Source: Speech CommunicationAuthor(s): Mingyang Zhang, Berrak Sisman, Li Zhao, Haizhou Li (Source: Speech Communication)
Source: Speech Communication - June 5, 2020 Category: Speech-Language Pathology Source Type: research

The Hearing-Aid Speech Perception Index (HASPI) Version 2
Publication date: Available online 24 May 2020Source: Speech CommunicationAuthor(s): James M. Kates, Kathryn H. Arehart (Source: Speech Communication)
Source: Speech Communication - June 2, 2020 Category: Speech-Language Pathology Source Type: research

The Hearing-Aid Speech Perception Index (HASPI)
Publication date: Available online 24 May 2020Source: Speech CommunicationAuthor(s): James M. Kates, Kathryn H. Arehart (Source: Speech Communication)
Source: Speech Communication - May 26, 2020 Category: Speech-Language Pathology Source Type: research

Egyptian Arabic Speech Emotion Recognition using Prosodic, Spectral and Wavelet Features
Publication date: Available online 22 May 2020Source: Speech CommunicationAuthor(s): Lamiaa Abdel-Hamid (Source: Speech Communication)
Source: Speech Communication - May 23, 2020 Category: Speech-Language Pathology Source Type: research

A Review of Multi-Objective Deep Learning Speech Denoising Methods
Publication date: Available online 21 May 2020Source: Speech CommunicationAuthor(s): Arian Azarang, Nasser Kehtarnavaz (Source: Speech Communication)
Source: Speech Communication - May 21, 2020 Category: Speech-Language Pathology Source Type: research

The effect of female voice on verbal processing
Publication date: Available online 20 May 2020Source: Speech CommunicationAuthor(s): Laura Smorenburg, Aoju Chen (Source: Speech Communication)
Source: Speech Communication - May 21, 2020 Category: Speech-Language Pathology Source Type: research

The interplay of prosodic cues in the L2: How intonation, rhythm, and speech rate in speech by Spanish learners of Dutch contribute to L1 Dutch perceptions of accentedness and comprehensibility
Publication date: Available online 18 May 2020Source: Speech CommunicationAuthor(s): Lieke van Maastricht, Tim Zee, Emiel Krahmer, Marc Swerts (Source: Speech Communication)
Source: Speech Communication - May 19, 2020 Category: Speech-Language Pathology Source Type: research

Editorial Board
Publication date: June 2020Source: Speech Communication, Volume 120Author(s): (Source: Speech Communication)
Source: Speech Communication - May 19, 2020 Category: Speech-Language Pathology Source Type: research

Single-Channel Speech Enhancement with Correlated Spectral Components: Limits-Potential
Publication date: Available online 16 May 2020Source: Speech CommunicationAuthor(s): Pejman Mowlaee, Johannes K.W. Stahl (Source: Speech Communication)
Source: Speech Communication - May 18, 2020 Category: Speech-Language Pathology Source Type: research

Integrating Lexical and Prosodic Features for Automatic Paragraph Segmentation
Publication date: Available online 11 May 2020Source: Speech CommunicationAuthor(s): Catherine Lai, Mireia Farrús, Johanna D. Moore (Source: Speech Communication)
Source: Speech Communication - May 12, 2020 Category: Speech-Language Pathology Source Type: research

HiLAM-state discriminative Multi-Task Deep Neural Network in Dynamic Time Warping framework for Text-Dependent Speaker Verification
Publication date: Available online 6 May 2020Source: Speech CommunicationAuthor(s): Mohammad Azharuddin Laskar, Rabul Hussain Laskar (Source: Speech Communication)
Source: Speech Communication - May 6, 2020 Category: Speech-Language Pathology Source Type: research

Editorial Board
Publication date: May 2020Source: Speech Communication, Volume 119Author(s): (Source: Speech Communication)
Source: Speech Communication - May 5, 2020 Category: Speech-Language Pathology Source Type: research

Analytic Phase Features for Dysarthric Speech Detection and Intelligibility Assessment
Publication date: Available online 1 May 2020Source: Speech CommunicationAuthor(s): Krishna Gurugubelli, Anil Kumar Vuppala (Source: Speech Communication)
Source: Speech Communication - May 2, 2020 Category: Speech-Language Pathology Source Type: research

Automatic Speaker Profiling from Short Duration Speech Data
Publication date: Available online 1 May 2020Source: Speech CommunicationAuthor(s): Shareef Babu Kalluri, Deepu Vijayasenan, Sriram Ganapathy (Source: Speech Communication)
Source: Speech Communication - May 2, 2020 Category: Speech-Language Pathology Source Type: research

Adaptive and Hybrid Kronecker Product Beamforming for Far-Field Speech Signals
Publication date: Available online 25 April 2020Source: Speech CommunicationAuthor(s): Rajib Sharma, Israel Cohen, Jacob Benesty (Source: Speech Communication)
Source: Speech Communication - April 29, 2020 Category: Speech-Language Pathology Source Type: research

Discriminative neural network pruning in a multiclass environment: a case study in spoken emotion recognition
Publication date: Available online 2 April 2020Source: Speech CommunicationAuthor(s): Máximo Sánchez-Gutiérrez, Pedro P. González-Pérez (Source: Speech Communication)
Source: Speech Communication - April 4, 2020 Category: Speech-Language Pathology Source Type: research

Speech Emotion Recognition Using Fusion of Three Multi-Task Learning-based Classifiers: HSF-DNN, MS-CNN and LLD-RNN
Publication date: Available online 28 March 2020Source: Speech CommunicationAuthor(s): Zengwei Yao, Zihao Wang, Weihuang Liu, Yaqian Liu, Jiahui Pan (Source: Speech Communication)
Source: Speech Communication - March 30, 2020 Category: Speech-Language Pathology Source Type: research

Single and multiple frame coding of LSF parameters using deep neural network and pyramid vector quantizer
Publication date: Available online 19 March 2020Source: Speech CommunicationAuthor(s): Yaxing Li, Ying Kang, Hao Wu, Yu Guo, Jin Meng (Source: Speech Communication)
Source: Speech Communication - March 21, 2020 Category: Speech-Language Pathology Source Type: research

Editorial Board
Publication date: April 2020Source: Speech Communication, Volume 118Author(s): (Source: Speech Communication)
Source: Speech Communication - March 14, 2020 Category: Speech-Language Pathology Source Type: research

On Quantifying the Quality of Acoustic Models in Hybrid DNN-HMM ASR
Publication date: Available online 10 March 2020Source: Speech CommunicationAuthor(s): Pranay Dighe, Afsaneh Asaei, Hervé Bourlard (Source: Speech Communication)
Source: Speech Communication - March 11, 2020 Category: Speech-Language Pathology Source Type: research

Automatic Classification of Infant Vocalization Sequences with Convolutional Neural Networks
Publication date: Available online 9 March 2020Source: Speech CommunicationAuthor(s): Franz Anders, Mario Hlawitschka, Mirco Fuchs (Source: Speech Communication)
Source: Speech Communication - March 10, 2020 Category: Speech-Language Pathology Source Type: research

A study of continuous space word and sentence representations applied to ASR error detection
Publication date: Available online 7 March 2020Source: Speech CommunicationAuthor(s): Sahar Ghannay, Yannick Estève, Nathalie Camelin (Source: Speech Communication)
Source: Speech Communication - March 8, 2020 Category: Speech-Language Pathology Source Type: research

Analysis and Classification of Phonation Types in Speech and Singing Voice
Publication date: Available online 25 February 2020Source: Speech CommunicationAuthor(s): Sudarsana Reddy Kadiri, Paavo Alku, B. Yegnanarayana (Source: Speech Communication)
Source: Speech Communication - February 27, 2020 Category: Speech-Language Pathology Source Type: research

Affective Synthesis and Animation of Arm Gestures from Speech Prosody
Publication date: Available online 25 February 2020Source: Speech CommunicationAuthor(s): Elif Bozkurt, Yücel Yemez, Engin Erzin (Source: Speech Communication)
Source: Speech Communication - February 27, 2020 Category: Speech-Language Pathology Source Type: research

Multilingual and Multimode Phone Recognition System for Indian Languages
Publication date: Available online 26 February 2020Source: Speech CommunicationAuthor(s): Kumud Tripathi, M. Kiran Reddy, K. Sreenivasa Rao (Source: Speech Communication)
Source: Speech Communication - February 27, 2020 Category: Speech-Language Pathology Source Type: research

Editorial Board
Publication date: February 2020Source: Speech Communication, Volume 117Author(s): (Source: Speech Communication)
Source: Speech Communication - February 23, 2020 Category: Speech-Language Pathology Source Type: research

Speaker discrimination: Citation tones vs. coarticulated tones
Publication date: February 2020Source: Speech Communication, Volume 117Author(s): Ricky KW ChanAbstractThe task of forensic voice comparison (FVC) often involves the comparison of a voice in an offender recording with that in a suspect recording, with the aim to assist the investigating authority or the court in determining the identity of the speaker. One of the main goals in FVC research is to identify speech variables that are useful for differentiating speakers. While French and Stevens (2013) stated that connected speech processes (CSPs) vary across speakers and thus CSPs may be included in the ‘toolbox’ f...
Source: Speech Communication - February 23, 2020 Category: Speech-Language Pathology Source Type: research

Cosine Metric Learning Based Speaker Verification
Publication date: Available online 20 February 2020Source: Speech CommunicationAuthor(s): Zhongxin Bai, Xiao-Lei Zhang, Jingdong ChenAbstractThe performance of speaker verification depends on the overlap region of the decision scores of true and imposter trials. Motivated by the fact that the overlap region can be reduced by maximizing the between-class distance while minimizing the within-class variance of the trials, we present in this paper two cosine metric learning (CML) back-end algorithms. The first one, named m-CML, aims to enlarge the between-class distance with a regularization term to control the within-class va...
Source: Speech Communication - February 21, 2020 Category: Speech-Language Pathology Source Type: research

Wh-question or wh-declarative? Prosody makes the difference
Publication date: Available online 13 February 2020Source: Speech CommunicationAuthor(s): Yang Yang, Stella Gryllia, Lisa Lai-Shen ChengAbstractMandarin wh-words can have question or non-question (e.g., existential, universal quantificational) interpretations. Their interpretations in a sentence are usually not ambiguous, as the distinct interpretations need to be licensed by particular items/contexts. The starting point of our study concerns a case which allows the wh-words to remain ambiguous in a sentence: wh-words such as shénme appearing with diǎnr. After empirically confirming that such sentences are indeed a...
Source: Speech Communication - February 14, 2020 Category: Speech-Language Pathology Source Type: research

Improving Generative Adversarial Networks for Speech Enhancement through Regularization of Latent Representations
Publication date: Available online 6 February 2020Source: Speech CommunicationAuthor(s): Fan Yang, Ziteng Wang, Junfeng Li, Risheng Xia, Yonghong YanAbstractSpeech enhancement aims to improve the quality and intelligibility of speech signals, which is a challenging task in adverse environments. Speech enhancement generative adversarial network (SEGAN) that adopted a generative adversarial network (GAN) for speech enhancement achieved promising results. In this paper, a new network architecture and loss function based on SEGAN are proposed for speech enhancement. Different from most network structures applied in this field,...
Source: Speech Communication - February 8, 2020 Category: Speech-Language Pathology Source Type: research

Subspace Gaussian mixture based language modeling for large vocabulary continuous speech recognition
Publication date: February 2020Source: Speech Communication, Volume 117Author(s): Ri Hyon Sun, Ri Jong CholAbstractThis paper focuses on adaptable continuous space language modeling approach of combining longer context information of recurrent neural network (RNN) with adaptation ability of subspace Gaussian mixture model (SGMM) which has been widely used in acoustic modeling for automatic speech recognition (ASR).In large vocabulary continuous speech recognition (LVCSR) it is a challenging problem to construct language models that can capture the longer context information of words and ensure generalization and adaptation...
Source: Speech Communication - January 30, 2020 Category: Speech-Language Pathology Source Type: research

nnnnSubspace Gaussian Mixture Based Language Modeling for Large Vocabulary Continuous Speech Recognition
Publication date: Available online 23 January 2020Source: Speech CommunicationAuthor(s): Ri Hyon Sun, Ri Jong CholAbstractThis paper focuses on adaptable continuous space language modeling approach of combining longer context information of recurrent neural network (RNN) with adaptation ability of subspace Gaussian mixture model (SGMM) which has been widely used in acoustic modeling for automatic speech recognition (ASR).In large vocabulary continuous speech recognition (LVCSR) it is a challenging problem to construct language models that can capture the longer context information of words and ensure generalization and ada...
Source: Speech Communication - January 24, 2020 Category: Speech-Language Pathology Source Type: research

Effect of articulatory and acoustic features on the intelligibility of speech in noise: an articulatory synthesis study
This study used an analysis-by-synthesis strategy to explore the contributions of multiple of these features. To this end, an articulatory speech synthesizer was used to synthesize the ten German digit words “Null” to “Neun”, for all 16 combinations of four binary features, i.e., modal vs. pressed phonation, normal vs. increased F1 and F2 formant frequencies, normal vs. increased f0 mean and range, and normal vs. increased duration of vowels. Subjects were asked to try to recognize the synthesized words in the presence of strong pink noise and babble noise. Compared to “pla...
Source: Speech Communication - January 22, 2020 Category: Speech-Language Pathology Source Type: research

Editorial Board
Publication date: January 2020Source: Speech Communication, Volume 116Author(s): (Source: Speech Communication)
Source: Speech Communication - January 17, 2020 Category: Speech-Language Pathology Source Type: research

Is segmental foreign accent perceived categorically?
Publication date: Available online 15 January 2020Source: Speech CommunicationAuthor(s): Rubén Pérez-Ramón, Martin Cooke, María Luisa García LecumberriAbstractThe second language learning process involves acquisition of sounds that differ to varying degrees from the sounds of a learner’s native language. Learners’ productions are strongly influenced by their native language particularly for sounds which are similar but non-identical in the two languages. However, foreign accent is typically investigated at the level of utterances and as a consequence the segmental basis of for...
Source: Speech Communication - January 15, 2020 Category: Speech-Language Pathology Source Type: research

Positioning Oneself in Different roles: Structural and Lexical Measures of Power Relations Between Speakers in Map Task Corpus
Publication date: Available online 10 January 2020Source: Speech CommunicationAuthor(s): Vered Silber-Varod, Sarit Malayev, Anat LernerAbstractThis paper focuses on the process whereby speakers position themselves in jointly produced conversations. The expected degree of dominancy (degree of power realization) in the dialogues is derived by the independent variable of the role of a participant – a leader or a follower – in a Map Task setting. We examine the participants’ dominancy as reflected by a set of structural and lexical features. We then observe how the features are realized in four sex pairings: ...
Source: Speech Communication - January 11, 2020 Category: Speech-Language Pathology Source Type: research