Pitch estimation using mutual information
Majid Mirbagheri, Yanbo Xu, Shihab Shamma
Establishing some principles of human speech production through two-dimensional computational models
Mauro Nicolao, Roger K. Moore
A spectral envelope estimation method based on F0-adaptive multi-frame integration analysis
Tomoyasu Nakano, Masataka Goto
Cochlear implant-like processing of speech signal for speaker verification
Cong-Thanh Do, Claude Barras
Evaluating speech intelligibility enhancement for HMM-based synthetic speech in noise
Cassia Valentini-Botinhao, Junichi Yamagishi, Simon King
A generalized Stein’s estimation approach for speech enhancement based on perceptual criteria
Sunder Ram Krishnan, Chandra Sekhar Seelamantula
Non-stationary signal processing and its application in speech recognition
Zoltán Tüske, Friedhelm R. Drepper, Ralf Schlüter
Joint uncertainty decoding with unscented transform for noise robust subspace Gaussian mixture models
Liang Lu, Arnab Ghoshal, Steve Renals
Hierarchical hybrid language models for open vocabulary continuous speech recognition using WFST
M. Ali Basha Shaik, David Rybach, Stefan Hahn, Ralf Schlüter, Hermann Ney
Template-based ASR using posterior features and synthetic references: comparing different TTS systems
Serena Soldo, Mathew Magimai-Doss, Hervé Bourlard
Explicit duration modelling in HMM-based speech synthesis using a hybrid hidden Markov model-multilayer perceptron
Kalu U. Ogbureke, João P. Cabral, Julie Carson-Berndsen
Dimensionality reduction of large TDOA vectors for speaker diarization
Deepu Vijayasenan, Fabio Valente
Joint detection and localization of multiple speakers using a probabilistic interpretation of the steered response power
Youssef Oualil, Mathew Magimai-Doss, Friedrich Faubel, Dietrich Klakow
Structured sparse coding for microphone array location calibration
Afsaneh Asaei, Bhiksha Raj, Hervé Bourlard, Volkan Cevher
Log-normal matrix factorization with application to speech-music separation
Takuya Yoshioka, Daichi Sakaue
Multi-channel speech separation with soft time-frequency masking
Rahil Mahdian Toroghi, Friedrich Faubel, Dietrich Klakow
Smoothing speech trajectories by regularization
Heyun Huang, Louis ten Bosch, Bert Cranen, Lou Boves
Data-driven speech representations for NMF-based word learning
Joris Driesen, Jort F. Gemmeke, Hugo Van hamme
Spectro-temporal features with distribution equalization
Samuel K. Ngouoko M, Martin Heckmann, Britta Wrede
Language identification using spectro-temporal patch features
Kamal Sahni, Pranay Dighe, Rita Singh, Bhiksha Raj
Inharmonic speech: a tool for the study of speech perception and separation
Josh H. McDermott, Daniel P. W. Ellis, Hideki Kawahara
| Article |
|---|