doi: 10.21437/Eurospeech.1995
ISSN: 1018-4074
Syllabic duration control for vocabulary-free speech recognition
Takatoshi Jitsuhiro, Tomokazu Yamada, Shigeki Sagayama
Effectiveness of pause information in the content word detection of spoken dialogues
Kazuyuki Takagi, Shuichi Itahashi
Connected Japanese digit recognition with pitch accent-dependent models
Kazuhiro Kondo
Temporal control and training selection for HMM-based system
C. Barras, M.-J. Caraty, C. Montacie
HMM-based tone recognition of Chinese trisyllables using double codebooks on fundamental frequency and waveform power
Keikichi Hirose, Xinhni Hu
Very low delay and high quality coding of 20 hz -15 khz speech at 64 kbit/S
C. Murgia, Gang Feng, C. Quinquis, A. Le Guyader
Wideband CELP coder at 16-kbit/s with 10-ms frame
Shigeaki Sasaki, Akitoshi Kataoka, Takehiro Moriya
High quality 14.1kb/s wideband speech coder
A. W. Black, I. A. Atkinson, A. M. Kondoz, B. G. Evans
A multipulse-deconvolution codec for wideband speech
V. Abreu-Sernandez, D. Docampo-Amoedo
Multi-channel linear predictive coding of audio signals
T. Chonavel, S. Saoudi
An advanced multi-DSP platform for speech technology integration in computer telephony applications
E. Rohwer
Voice processing architecture for computer-telephony integration
Rafael Ciria, Rafael Sarmiento de Sotomayor, Cristina Aguila, José Parera, Juan Santos
Fast convergent analog adaptive filter
L. Ortiz-Balbuena, H. Perez-Meana, A. Martinez-Gonzalez, L. Nino de Rivera, M. Nakano-Miyatake
Efficient isolated word recognition in Spanish based on static modeling
Manuel A. Leandro, Alvaro Villegas, José M. Pardo
Reliability in a multi-agent spoken language recognition system
Jean-Luc Cochard, Olivier Oppizzi
A study of speaker adaptation based on minimum classification error training
Tomoko Matsui, Sadaoki Furui
Maximum likelihood based discriminative training of acoustic models
Albino Nogueiras-Rodriguez, José B. Marino
Discriminant learning with minimum memory loss for improved non-vocabulary rejection
Hugues Leprieur, Patrick Haffner
Codebook weights adaptation for discriminative training of SCHMM-based speech recognition systems
Cesar Martin del Álamo, F. Javier Caminero-Gil, Celinda de la Torre-Munilla, Lúis Hernandez-Gomez
Discriminative training of hidden Markov models using overall risk criterion and reduced gradient method
Kyungmin Na, Bumki Jeon, Dong-Il Chang, Soo-Ik Chae, Souguil Ann
Discriminative training of HMM based speech recognizer with gradient projection method
Qiang Huo, Chorkin Chan
Optimization of speech parameter weighting for CDHMM word recognition
Javier Hernando, J. Ayarte, E. Monte
Discriminative utterance verification for connected digits recognition
Mazin G. Rahim, Chin-Hui Lee, Biing-Hwang Juang
MCE estimation of VQ parameters for MVQHMM speech recognition
Antonio M. Peinado, Antonio J. Rubio, José C. Segura, Victoria Sanchez, Jesus E. Diaz
Discriminative training for continuous speech recognition
Wolfgang Reichl, Günther Ruske
Minimum classification error training algorithm for feature extractor and pattern classifier in speech recognition
Kuldip K. Paliwal, M. Bacchiani, Yoshinori Sagisaka
Integrated optimization of feature transformation for speech recognition
Stephan Euler
Phoneme transition detection and broad classification using a simple model based on the function of onset detector cells found in the cochlear nucleus
Andrew C. Morris, José M. Pardo
Linear predictive coding of speech using an analogue cochlear model
Eric Fragniere, Andre van Schaik, Eric Vittoz
Pitch extraction of telephone bandwidth speech using a place-temporal approach
E. Jones, E. Ambikairajah
A binaural selectivity model for speech recognition
Markus Bodden, Timothy R. Anderson
Speech recognition experiments in a noisy environment using auditory system modelling
Cristina Dobrin, Petri Haavisto, Kari Laurila, Jaakko Astola
Source separation by a functional model of amplitude demodulation
Frédéric Berthommier, Georg F. Meyer
Real-time implementation of spectral subtraction algorithm for suppression of acoustic noise in speech
V. Davidek, P. Sovka, J. Sika
Speech synthesis for the new pan-european traffic message control system RDS-TMC
Bert Van Coile, Hans-Wilhelm Rühl, L. Vogten, M. Thoone, S. Goß, D. Delaey, E. Moons, Jacques M. B. Terken, Jan Roelof de Pijper, M. Kugler, P. Kaufholz, R. Krüger, S. Leys, S. Willems
Echo cancelling in speech recognition systems
R. Pacifici, G. Manca
Parallel implementation of an hybrid neural network used for speech recognition task
T. Calonge, L. Alonso, R. Ralha, A. L. Sanchez
Hardware design of LPC coding for speech feature extraction
M. Li, J. T. Proudfoot
Modularization in task-specific language modelling
Henning Bergmann, Hans-Hermann Hamer, Andreas Noll, Annedore Paeseler, Horst Tomaschewski
Beyond NYQUIST: towards the recovery of broad-bandwidth speech from narrow-bandwidth speech
Carlos Avendano, Hynek Hermansky, Eric A. Wan
Large vocabulary multilingual speech recognition using HTK
D. Pye, Phil C. Woodland, S. J. Young
Issues in large vocabulary, multilingual speech recognition
Lori Lamel, M. Adda-Decker, Jean-Luc Gauvain
Comparative performance in large-vocabulary isolated-word recognition in five european languages
James Barnett, Paul Bamberg, Martin Held, Juan Huerta, Linda Manganaro, Adam Weiss
French speech recognition in an automatic dictation system for translators: the transtalk project
Julie Brousseau, Caroline Drouin, George Foster, Pierre Isabelle, Roland Kuhn, Yves Normandin, Pierre Plamondon
The Philips large-vocabulary recognition system for american English, French, and German
Christian Dugast, Xavier Aubert, Reinhard Kneser
A syllable-based very-large-vocabulary voice retrieval system for Chinese databases with textual attributes
Sung-Chien Lin, Lee-Feng Chien, Keh-Jiann Chen, Lin-Shan Lee
The AT&t 60,000 word speech-to-text system
Michael Riley, Andrej Ljolje, Donald Hindle, Fernando Pereira
Fast and accurate continuous speech recognition for Chinese language with very large vocabulary
Tai-hsuan Ho, Hsin-min Wang, Lee-feng Chien, Keh-Jiann Chen, Lin-shan Lee
Methods towards the very large vocabulary Chinese speech recognition
Zuoying Wang, Jun Wu, Xi Xiao, Jin Quo
Utterance clustering for large vocabulary continuous speech recognition
G. D. Cook, A. J. Robinson
Vector quantization of glottal pulses
Thomas Eriksson, Jan Linden, Jan Skoglund
A speech coding algorithm based on prototypes interpolation with critical bands and phase coding
Michele Festa, Daniele Sereno
Very low-bitrate speech coding using perceptually-derived spectral data
D. Tsoukalas, Jiannis Mouropoulos, George Kokkinakis
A new very low bit rate speech coder: the step decomposition vocoder
Lorenzo Piazzo
Time envelope LP vocoder: a new coding technique at very low bit rates
I. A. Atkinson, A. M. Kondoz, B. G. Evans
Speech coding based on the discrete-time wavelet transform and human auditory system properties
Dan Stefanoiu, Radwan Kastantin, Gang Feng
Wavelets for low bit rate speech coding applications
F. J. Ancin, M. L. Larreategui, B. L. Burrows, R. A. Carrasco
Adaptive speech vector coding with a multiresolution hierarchical codebook
E. Mandridake, R. Atay, M. Najim
Subband analysis-by-synthesis coding
Andrei Popescu, Nicolas Moreau
A robust 2.4kb/s LP-MBE with iterative LP modelling
Clifford I. Parris, Danny Wong, Francois Chambon
Improved transient representation and quantization for sinusoidal speech coders
M. S. Torres-Guijarro, F. J. Casajus-Quiros
Efficient multiband excitation linear predictive coding of speech at 1.6 kbps
W. M. E. Yu, Cheung-Fat Chan
Voice coding in the MSBN satellite communication system
Bruno Wery, Stephane Deketelaere
Spectral envelope estimation for low bit-rate sinusoidal speech coders
B. M. G. Cheetham, X. Q. Sun, W. T. K. Wong
Shift-invariant adaptive local trigonometric decomposition
Israel Cohen, Shalom Raz, David Malah
Spectral envelope of speech using wavelets
Paul Micallef, Edward Chilton
Multiresolution speech analysis using fast time-varying orthogonal wavelet packet transform algorithms
Andrzej Drygajlo, Nicolas Thevoz
Second- and third-order wigner distributions in hierarchical recognition of speech phonemes
Maria Rangoussi, Flemming Pedersen
The use of maximum a posteriori parameters in linear prediction of speech
G. M. K. Saleh, M. Niranjan, W. J. Fitzgerald
Observed long-term changes in customer calling patterns in a telephone application using automatic speech recognition
William C. G. Ortel
Combining speech algorithms into a "natural" application of speech technology for telephone network services
Ayman Asadi, David Lubensky, L. Madhavrao, Jayant Naik, Vijay Raman, George Vysotsky
Intelligent answering machine-secretary
Ye. Ludovik, V. Sibirtsev
Verbal-gestural behaviors in multimodal spoken language interpreting telecommunications
Kyung-ho Loken-Kim, Young-duk Park, Suguru Mizunashi, Laurel Fais, Tsyuoshi Morimoto
Large vocabulary, word-based Mandarin dictation system
Jung-Kuei Chen, Lin-Shan Lee, Frank K. Soong
Read my lips... and my jaw! how intelligible are the components of a speaker's face?
Bertrand Le Goff, Thierry Guiard-Marigny, Christian Benoît
Mcgurk effect in Spanish and German listeners: influences of visual cues in the perception of Spanish and German conflicting audio-visual stimuli
Angela Fuster Duran
Rule-based visual speech synthesis
Jonas Beskow
A new algorithm for visual synthesis of speech
Fabio Lavagetto, Paolo Lavagetto
Audiovisual speech recognition using the fuzzy shape filters model
H. Kabre
On the use of features from prediction residual signals in speaker identification
Jialong He, Li Liu, Günther Palm
Some nonparametric distance measures in speaker verification
Kai Tat Ng, Haizhou Li, Jean-Paul Haton
Adaptive transforms for speaker recognition
Michael J. Carey, Graham D. Tattersall, Eluned S. Parris
Speaker recognition with discriminative speaker VQ models
Kai Tat Ng, Jian Su, Bingzheng Xu
Parametric speaker recognition over large population of telephonic voices
A. Federico, Andrea Paoloni
Speaker recognition experiments in Estonian using multi-layer feed-forward neural nets
Toomas Altosaar, Einar Meister
Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods
Ivan Magrin-Chagnolleau, Jean-Frangois Bonastre, Frédéric Bimbot
Automatic speaker recognition using formants-based nearest-neighbour distance measure
Pavel V. Labulin, Sergey L. Koval, Andrej N. Raev
Discrimination of voices of twins and siblings for speaker verification
M. Mehdi Homayounpour, Gerard Chollet
Theoretical error prediction for a language identification system using optimal phoneme clustering
Kay M. Berkling, Etienne Barnard
Separation of speakers in audio data
Jesper O. Olsen
Text-dependent speaker verification using dynamic time warping and vector quantization of LSF
J.-L. Bonifas, I. Hernaez Rioja, B. Etxebarria Gonzalez, S. Saoudi
On MMI learning of Gaussian mixture for speaker models
Haizhou Li, Jean-Paul Haton, Yifan Gong
Evaluation of Bayes decision approach to automatic determination of thresholds for speaker verification
Yifan Gong
Comparison of different HMM based methods for speaker verification
Daniele Falavigna
Speaker classification by neural network for short utteranses using phoneme groups in Farsi
J. Sheikhzadegan, M. Tebiani, M. Lotfizad, M. R. Roohani
Speaker recognition experiments on the NTIMIT database
J.-L. Le Floch, C. Montacie, M.-J. Caraty
Speaker identification using vector quantisation with codeword-specific derivative coding
Michael Wagner, John S. Mason, J. Bruce Millar
Speaker recognition with temporal transition models
Haizhou Li, Jean-Paul Haton, Jian Su, Yifan Gong
Speaker recognition using HMM composition in noisy environments
Tomoko Matsui, Tomohito Kanno, Sadaoki Furui
Speaker recognition using HMM with experiments on the yoho database
ChiWei Che, Qiguang Lin
Speaker recognition models
Kin Yu, John S. Mason, John Oglesby
Multi-state predictive neural networks for text-independent speaker recognition
T. Artieres, Patrick Gallinari
A voiced/unvoiced speech discrimination technique based on fuzzy logic
Francesco Beritelli, Salvatore Casale, Marco Russo
Evaluation of a periodic/aperiodic speech decomposition algorithm
V. Darsinos, Christophe d'Alessandro, B. Yegnanarayana
A pitch determination and voiced/unvoiced decision algorithm for noisy speech
Jean Rouat, Yong Chun Liu, Daniel Morissette
Modulated Gaussian wavelet transform based speech analyser (MGWTSA) pitch detection algorithm (PDA)
Leonard Janer
A time-frequency approach to epoch detection
Juan L. Navarro-Mesa, Ignasi Esquerra-Llucia
An improved epoch detection algorithm based on sinusoidal modelling of speech
M. L. Larreategui, F. J. Ancin, R. A. Carrasco
A method for fully automatic analysis and modelling of voice source characteristics
V. Darsinos, D. Galanis, George Kokkinakis
Dynamic vowel quality: a new determination formalism based on perceptual experiments
Hartmut R. Pfitzinger
A method for quantitative analysis of the local speech rate
Sumio Ohno, Hiroya Fujisaki
Voice personality transformation using an orthogonal vector space conversion
Ki Seung Lee, Dae Hee Youn, Il Whan Cha
Spectral mapping for voice conversion using speaker selection and vector field smoothing
Makoto Hashimoto, Norio Higuchi
Analysis of acoustic features affecting speaker identification
Norio Higuchi, Makoto Hashimoto
Speaker individualities in fundamental frequency contours and its control
Masato Akagi, Taw Ienaga
Interpolating MBE v/UV mixture function for high quality synthesis of speech
King-fai Lam, Cheung-fat Chan
Statistical methods for voice quality transformation
Yannis Stylianou, Olivier Cappe, Eric Moulines
High-quality speech modification based on a harmonic + noise model
Yannis Stylianou, Jean Laroche, Eric Moulines
Source generator based stressed speech perturbation
Sahar E. Bou-Ghazah, John H. L. Hansen
Improved algorithms for speech recognition in noise using lateral inhibition and SNR weighting
Nestor Becerra Yoma, Fergus R. McInnes, Mervyn A. Jack
Noise adaptation using linear regression for continuous noisy speech recognition
Olivier Siohan, Yifan Gong, Jean-Paul Haton
Dynamic parameter compensation for speech recognition in noise
Ruikang Yang, Markku Majaniemi, Petri Haavisto
Robust speech recognition in noise using speech enhancement based on masking properties of the auditory system and adaptive HMM
Andrzej Drygajlo, Nathalie Virag, Gregoire Cosendai
Canonical correlation based compensation approach for robust speech recognition in noisy environment
Dong Yu, Taiyi Huang
A unified approach for robust speech recognition
Pedro J. Moreno, Bhiksha Raj, Richard M. Stern
Effect of rasta-type processing for speech recognition with speaking-rate mismatches
Harald Singer, Kuldip K. Paliwal, Tomohiko Beppu, Yoshinori Sagisaka
Fast speakers in large vocabulary continuous speech recognition: analysis & antidotes
Nikki Mirghafori, Eric Foster, Nelson Morgan
Signal conditioned minimum error rate training
Wu Chou, Mazin G. Rahim, Eric Buhrke
Non-uniform unit HMMS for speech recognition
Takeshi Matsumura, Shoichi Matsunaga
Training data clustering for improved speech recognition
Ananth Sankar, Frangoise Beaufays, Vassilios Digalakis
On the use of bi-directional contextual dependence in acoustic modeling for speech recognition
Qiang Huo, Chorkin Chan
Spontaneous speech recognition using dynamic CEPSTRA incorporating forward and backward masking effect
Tomohiko Beppu, Kiyoaki Aikawa
Stochastic trajectory models for speech recognition: an extension to modelling time correlation
Mohamed Afify, Yifan Gong, Jean-Paul Haton
An analysis of cepstral-time matrices for noise and channel robust speech recognition
Ben P. Milner, Saeed V. Vaseghi
A confidence measure for acoustic likelihood scores
Ze'ev Rivlin
A discriminative filter bank model for speech recognition
Alain Biem, Erik McDermott, Shigeru Katagiri
A stochastic grammar for isolated representation of syntactic and semantic knowledge
Holger Stahl, Johannes Müller
Concept-based spontaneous speech understanding system
Esther Levin, Roberto Pieraccini
Semantic decoding of speech in constrained domains
Antonio Bonafonte, José B. Marino, Eduardo Lleida
A speech understanding architecture for an information query system
Marcello Federico, Fabrizio Vernesoni
A one-pass search algorithm for understanding natural spoken time utterances by stochastic models
Josef G. Bauer, Holger Stahl, Johannes Müller
Improvements in an HMM-based speech synthesiser
R. E. Donovan, Phil C. Woodland
Sub-phonemic optimal path search for concatenative speech synthesis
Yoshiharu Itoh, Makoto Hashimoto, Norio Higuchi
Optimising selection of units from speech databases for concatenative synthesis
Alan W. Black, Nick Campbell
Automatic data-driven prosodic modeling for text-to-speech
E. Lopez-Gonzalo, Lúis Hernandez-Gomez
Text-to-speech oriented automatic learning of Italian prosody
F. Mana, Silvia Quazza
A simple method of predicting the duration of syllables
Andrew P. Breen
A neural-network-based model of segmental duration for speech synthesis
Marcel Riedi
Generating French intonation at different speaking rates
Frédéric Beaugendre
Segmental duration in French text-to-speech synthesis
Evelyne Tzoukermann, Olivier Soumoy
Developing the prosodic component for Swedish speech synthesis
M. Home, M. Filipsson
Results of a speaker verification service trial using HMM models
Anand Setlur, Thomas Jacobs
Application of phonetic weighting to the neural tree network based speaker recognition system
Han-Sheng Liou, Richard J. Mammone
Experiments with speaker verification over the telephone
Jean-Luc Gauvain, Lori Lamel, B. Prouts
Improved CELP algorithm suited for various speech coding applications
Sofia Moreno Perez, Ramon Garcia Gomez
Comparative study of two codecs for an enhanced GSM system
S. A. Atungsiri, A. M. Kondoz, B. G. Evans
Fast low-delay CELP coding of speech at 8kbps
Siu-pun Chui, Cheung-fat Chan
A low bit-rate speech coder using the perceptual properties of the human ear
Hong Goo Kang, Jeong Tae Seo, Il Whan Cha, Dae Hee Youn
Very fast CELP coding using stochastic innovations
Silvio Cucchi, Marco Fratti
Fast codebook search algorithm based on hamming ECC for algebraic CELP speech coding
M. Bouraoui, W. Glass, Gang Feng
Implementation aspects of the GSM half-rate speech codec
Tim Fingscheidt, Thomas Wiechers, Eckhard Delfs
Low and variable bit-rate speech coding for ATM networks
S. D. Watson, B. M. G. Cheetham, W. T. K. Wong, A. V. Lewis
A differential encoding method for the LTP delay in CELP coders
Andrei Popescu, Nicolas Moreau, Claude Lamblin
Robust, n-best formant tracking
Philipp Schmid, Etienne Barnard
Formant tracking using reassigned spectrum
F. Plante, William A. Ainsworth
Direct calculation of the vocal tract area function from measured formant frequencies
Jean Schoentgen, S. Ciocca
Robust estimation of spectral center-of-gravity trajectories using mixture spline models
Don X. Sun
Bound for minkowski metric based on LP distortion measure
J. S. Pan, Fergus R. McInnes, Mervyn A. Jack
An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features
Keiichi Tokuda, Takashi Masuko, Tetsuya Yamada, Takao Kobayashi, Satoshi Imai
Deriving articulatory representations of speech
Hywel B. Richards, John S. Mason, Melvyn J. Hunt, John S. Bridle
Improved acoustic-phonetic modeling in philips' dictation system by handling liaisons and multiple pronunciations
Xavier Aubert, Christian Dugast
Digit recognition with stochastic perceptual speech models
Nelson Morgan, Su-Lin Wu, Hervé Bourlard
On incorporating phonemic constraints in hidden Markov models for speech recognition
R. N. V. Sitaram, Thippur Sreenivas
An improvement on syllable-based continuous Mandarin speech recognition via using inter-syllable boundary models
Saga Chang, Sin-Horng Chen
Optimizing baseforms for HMM-based speech recognition
Torbjorn Svendsen, Frank K. Soong, Heiko Purnhagen
Acoustic modeling of context dependent units, for large vocabulary speech recognition in Spanish
J. Alvarez-Cercadillo, Chin-Hui Lee, Luis Hernandez-Gomez
Estimation of statistical phoneme center and its application to accurate phoneme modelling
Shigeki Okawa, Katsuhiko Shirai
Hybrid hidden Markov models in speech recognition
Z. Li, P. Kenny, Douglas O'Shaughnessy
Acoustic-phonetic modeling for flexible vocabulary speech recognition
L. Fissore, F. Ravera, Pietro Laface
Segmental duration and HMM modeling
Pierre Dumouchel, Douglas O'Shaughnessy
A knowledge-based model for speaker-independent, acoustic-phonetic decoding
A. Ghio, Mario Rossi
A database for microphone array experimentation
Ea-Ee Jan, Piergiorgio Svaizer, James L. Flanagan
The OGI 22 language telephone speech corpus
T. Lander, Ronald A. Cole, B. T. Oshika, M. Noel
New telephone speech corpora at CSLU
Ronald A. Cole, M. Noel, T. Lander, T. Durham
The Dutch polyphone corpus
E. A. den Os, T. I. Boogaart, Lou Boves, Esther Klabbers
The waxholm application database
J. Bertenstam, Mats Blomberg, Rolf Carlson, Kjell Elenius, Björn Granström, Joakim Gustafson, Sheri Hunnicutt, J. Hogberg, R. Lindell, L. Neovius, Lennart Nord, Antonio de Serpa-Leitao, N. Strom
A pitch extraction reference database
F. Plante, Georg F. Meyer, William A. Ainsworth
Eagles spoken language working group: overview and results
Richard Winski, Roger K. Moore, Dafydd Gibbon
CEUDEX: a data base oriented to context-dependent units training in Spanish for continuous speech recognition
Celinda de la Torre-Munilla, Luis Hernandez-Gomez, Daniel Tapias
Design of a phonetic corpus for a speech database in basque language
K. Lopez de Ipina, I. Torres, L. Onederra
you'd better say nothing than say something wrong: analogy, accuracy and text-to-speech applications
V. Pirrelli, S. Federici
Bulgarian speech database: a pilot study
A. Misheva, S. Dimitrova, V. Filipov, E. Grigoreva, M. Nikov, Peter Roach, S. Arnfield
The Phondat-verbmobil speech corpus
Wolfgang J. Hess, Klaus J. Kohler, Hans-Günther Tillmann
EUROM - a spoken language resource for the EU - the SAM projects
Dominic Chan, Adrian Fourcin, Dafydd Gibbon, Björn Granström, Mark Huckvale, George Kokkinakis, Knut Kvale, Lori Lamel, Borge Lindberg, Asunción Moreno, Jiannis Mouropoulos, Franco Senia, Isabel Trancoso, Corin 't Veld, Jerome Zeiliger
A flexible formal language for the orthographic transcription of spontaneous spoken dialogues
Gernot A. Fink, Michaela Johanntokrax, Brigitte Schaffranietz
Design and implementation of Mandarin speech database in taiwan
Hsiao-Chuan Wang
Word hypothesizer based on reliably detected phoneme similarity regions
Philippe Morin, Ted H. Applebaum
Experimental analysis of the search space for 20 000-word speech recognition
S. Ortmanns, Hermann Ney
Speech parsing by downward request search based on the divide and conquer method
Ming-Sheng Wang, Satoshi Imai
Fast match based on decision tree
Claire Waast, Lalit Bahl, Marc El-Beze
Fast and accurate beam search using forward heuristic functions in HMM-LR speech recognition
Yoshiaki Noda, Shigeki Sagayama
Utterance verification improves closed-set recognition and out-of-vocabulary rejection
Don Colton, Mark Fanty, Ronald A. Cole
A comparison of two exact algorithms for finding the n-best sentence hypotheses in continuous speech recognition
V. M. Jimenez, A. Marzal, J. Monné
Top-down speech detection and n-best meaning search in a voice activated telephone extension system
Kazuya Takeda, Shingo Kuroiwa, Masaki Naito, Seiichi Yamamoto
Fast likelihood computation for continuous-mixture densities using a tree-based nearest neighbor search
Frank Seide
Hamming distance approximation for a fast log-likelihood computation for mixture densities
Peter Beyerlein, Meinhard Ullrich
An efficient output probability computation for continuous HMM using rough and detail models
Yasuhiro Komori, Masayuki Yamada, Hiroki Yamamoto, Yasunori Ohora
Speeding up the score computation of HMM speech regognizers with the bucket voronoi intersection algorithm
J. Fritsch, I. Rogina, Tilo Sloboda, Alex Waibel
On the speech feature selection problem: are dynamic features more important than the static ones?
Jan Nouza
Filtering the time sequence of spectral parameters for speaker-independent CDHMM word recognition
Climent Nadeu, Pau Paches-Leal, Biing-Hwang Juang
Robust phoneme prototype extraction for speech recognition
Dimitris Tambakas, Nikos Fakotakis, George Kokkinakis
The even transform: a variance-equalizing orthogonal transformation and its application to speech recognition
Melvyn J. Hunt
Using segmental coefficients in HMM speech recognition
Kai Hübener
On the use of the derivative of the pole trajectories of the LPC analysis parameter sequence as an alternative to delta parameters
F. Freitag, E. Monte, Javier Hernando
On the dual role of sequence directionality and coherence in a spectral predictive discrimination model
P. V. S. Rao, R. Raveendran
On the decorrelation of filter-bank energies in speech recognition
Climent Nadeu, Javier Hernando, Monica Gorricho
Time derivatives, cepstrai normaiization, and spectral parameter filtering for continuously spelled names over the telephone
Jean-Claude Junqua, Dominique Fohr, J.-F. Mari, Ted H. Applebaum, Brian A. Hanson
Some new considerations about the spectral form of French stop bursts
Linda Djezzar
Characterization of spectral transition region by various prediction approaches for discriminating stop consonants
P. V. S. Rao, R. Raveendran
Fast automatic segmentation and labeling: results on TIMIT and EUROMO
A. Vorstermanst, Jean-Pierre Martens, Bert Van Coile
Maximum-likelihood estimation for articulatory speech recognition using a stochastic target model
Gordon Ramsay, Li Deng
A comparison of several speech parameters for speaker independent speech recognition and speaker recognition
J. Sirigos, Nikos Fakotakis, George Kokkinakis
Speech parameterization based on phonetic features: application to speech recognition
Nabil N. Bitar, Carol Y. Espy-Wilson
Experiments with linear feature extraction in speech recognition
K. Beulen, L. Welling, Hermann Ney
Nonlinear Feature Transformation Based On Statistical Phoneme Modeling
Christian-M. Westendorf
Skewness and nonstationarity measures applied to reliable speech endpoint detection
Juan L. Navarro-Mesa, Asunción Moreno
The distance set representation of speech segments
Ramesh R. Sarukkai, Dana H. Bollard
Multi-variate mixture probability density modelling of VQ codebook using gradient descent algorithm
S. Dobrisek, R. Mihelic, N. Pavesic
Intensity and vocal effort as cues in the perception of stress
Agaath M. C. Sluijter, Vincent J. van Heuven
The effect of voice pitch on perception of synthetic Polish vowels
Mariusz Owsianny
Perception of prepausal tonal contours: implications for automatic stylization of intonation
David House
Experimental study on perception of the glottal explosive of the Japanese ryukyu dialect
Tomio Takara
Measurement of pitch perception for F0 glides
Christophe d'Alessandro, S. Rosset, O. Piot
Interferences between phonemes: evidence for "perceptual domains" in continuous speech perception
S. Wauquier-Gravelines
The influence of local context on the identification of vowels and consonants
Rob J. J. H. van Son, Louis C. W. Pols
Effect of preceding noise duration on the perception of voiced plosives and vowels
William A. Ainsworth
Influence of a prior knowledge of the vocalic context on stop burst perception
Anne Bonneau, Linda Djezzar, Yves Laprie
Listeners' use of the 'information-accentuation' interdependence in processing implicit and explicit references
Wilma van Donselaar
Analysis and modeling of fundamental frequency contours of English utterances
Hiroya Fujisaki, Sumio Ohno
Symbolic coding of higher-level characteristics of fundamental frequency curves
P. Nicolas, Daniel J. Hirst
A dynamical system model for recognizing intonation patterns
Ken Ross, Mari Ostendorf
Syntactic influence on prosodic phrasing in the framework of the link grammar
Andrew J. Hunt
Effects of time pressure on the choice of accent-lending and boundary-marking pitch configurations in dutch
Johanneke Caspers, Vincent J. van Heuven
Analysis and synthesis of prosodic features in spoken dialogue of Japanese
Mayumi Sakata, Keikichi Hirose
Prosodic influence on segmental quality
Nick Campbell
Towards voice-interactive telephone services in slovenia: on prosody of digits using the sociolinguistic framework
Bojan Petek
Test environment for the two level model of Germanic prominence
Gregor Möhler, Grzegorz Dogil
Linguistic and acoustic characteristics of pause intervals in spontaneous speech
Nancy A. Daly-Kelly
Modeling the contextual effects on prosody in dialog
Y. Yamashita, R. Mizoguchi
Prosodic scoring of word hypotheses graphs
Ralf Kompe, Andreas Kießling, Heinrich Niemann, Elmar Nöth, Ernst Günter Schukat-Talamazzini, A. Zottmann, Anton Batliner
Robust pitch period detection using dynamic programming with an ANN cost function
S. Harbeck, Andreas Kießling, Ralf Kompe, Heinrich Niemann, Elmar Nöth
Automatic detection of major phrase boundaries using statistical properties of superpositional F0 control model parameters
Toshio Hirai, Norio Higuchi, Yoshinori Sagisaka
Using neural networks to locate pitch accents
Paul Taylor
The relation between physiological signals and F0: a quantitative analysis method
Helmer Strik
Analysis of prosodic characteristics in speech advisories and their application to speech output
Masanobu Abe
Pitch and elocution rate of diverts speech
M. GuittonF. Javier Caminero-Gil, Joel Crestel, Laure Charonnat
Detection of accents, phrase boundaries and sentence modality in German with prosodic features
Volker Strom
Synthesis and evaluation of intonation with a superposition model
Yann Morlec, Gérard Bailly, Véronique Aubergé
Pitch accent classification of fundamental frequency contours by hidden Markov models
Marcus Fach, Wolfgang Wokurek
Measuring the perceptual similarity of pitch contours
Dik J. Hermes
Microprosodic study of isolated French word corpora
Philippe Langlais
Interpolation properties of linear prediction parametric representations
Kuldip K. Paliwal
Interpolation of spectral information for low bit rate speech coding
H. B. Choi, W. T. K. Wong, B. M. G. Cheetham, C. C. Goodyear
Adaptive flux interpolation, flow-based prediction, delta or delta-delta coefficients: which is best?
Ladan Baghai-Ravary, Steve Beet, Osman Tokhit
LSP Markov model for reducing the complexity of vector quantization
B. Kovesi, S. Saoudi, J. M. Boucher, Z. Reguly
Predictive delta adaptive scalar quantization: an efficient method for coding the short-term speech spectrum
H. R. Sadegh Mohammadi, W. H. Holmes
Conditional split vector quantization of LSP parameterswith multiple search
Dong-Il Chang, Kyungmin Na, Souguil Ann
Matrix Product Quantization For Very-low-rate Mobile Speech Communications
Stefan Bruhn
Robust vector quantization for low bit rate speech coding
U. Balss, Herbert Reininger, H. Schalk, Dietrich Wolf
Scalar quantization of LSF parameters at 28 bits/frame
H. C. Ng, S. H. Leung
Improvement of the quality of speech synthesis by analysis using segmentation and modeling of the excitation signal
J. M. Gutierrez Arriola, F. M. Gimenez de los Galanes, M. H. Savoji
A text-to-speech synthesizer for the Polish language
Przemyslaw Dymarski, Slawomir Kuklinski, Siawomir Kula
A comparison of different speech units for the German TTS-system tubsy
C. Jürgens, M. Wunderlich
Confusions among Italian consonants in good and in telephone conditions: differences between text-to-speech systems and natural speech with noise
Cristina Delogu, Andrea Paoloni, Paola Ridolfi
Text-to-speech synthesis for welsh and welsh English
Briony Williams
Multi-lingual testing of a self-learning approach to phonemic transcription of orthography
Ove Andersen, Paul Dalsgaard
An integrated multi-dialect speech recognition system with optional speaker adaptation
V. Beattie, S. Edmondson, D. Miller, Y. Patel, G. Talvola
A comparative study of speaker adaptation techniques
Leonardo Neumeyer, Ananth Sankar, Vassilios Digalakis
Adaptation algorithms for large scale HMM recognizers
G. Zavaliagkos, R. Schwartz, John McDonough, John Makhoul
Speaking-style and speaker adaptation for the recognition of spontaneous dialogue speech
Shoichi Matsunaga, Tetsuo Kosaka, Tohru Shimizu
Speaker adaptation for telephone based speech dialogue systems
S. Dobler, Hans-Wilhelm Rühl
Speaker adaptation with autonomous control using tree structure
Koichi Shinoda, Takao Watanabe
Speaker adaptation fitting training data size and contents
Masahiro Tonomura, Tetsuo Kosaka, Shoichi Matsunaga, Akito Monden
Direct and joint-space approaches to the use of spectral transformation for speaker adaptation in continuous speech recognition
H. C. Choi, R. W. King
Flexible speaker adaptation for large vocabulary speech recognition
C. J. Leggetter, Phil C. Woodland
Faust - a directory assistance demonstrator
B. Kaspar, G. Fries, K. Schuhmacher, Antje Wirth
Some signals of emotional arousal: analysis of conversations using a multimodal interaction database
Keiko Watanuki, Fumio Togawa
Speech synthesis in spoken dialogue research
Gösta Bruce, Björn Granström, M. Filipsson, Kjell Gustafson, Merle Horne, David House, B. Lastow, Paul Touati
Dynamically created dialogues for automated telephone answering using uncertain reasoning and linguistic theory
Mary Zajicek, Ken Brownsey, Simon Lippmann, Patrice Palau, Phyl Greenhead
Modeling dialogue control strategies to relieve speech recognition errors
Y. Niimi, Y. Kobayashi
Constraining of input media in a spoken dialogue system
Anders Baekgaard
AN Automatic Creation Of The Language Model For The Spontaneous Czech Speech Recognizer
Jana Kleckova, Vaclav Matousek, Jana Netrvalova
A grammar of conversational English
M. O'Kane, P. E. Kenne, H. G. Pearcy
Robust comprehension in a spoken dialog system
Eric Brison, Nadine Vigouroux
Korean-Japanese speech translation system for hotel reservation - Korean front desk side
Youngjik Lee, Young-Sum Kim, Jung-Chul Lee, Joon-Hyung Ryoo, Jae-Woo Yang
Unconstrained speech retrieval for Chinese document databases with very large vocabulary and unlimited domains
Sung-Chien Lin, Lee-Feng Chien, Keh-Jiann Chen, Lin-Shan Lee
Integrating partial syntactical analysis and plan recognition for understanding DB natural language queries
Cristina Delogu, Andrea Di Carlo, Rino Falcone
Communicative language model: structure and functioning
Yuri A. Kosarev
Integrating heuristic preferences into a neural understanding system
Adelaide Stevenur, Patrick Gallinari
A language interface to a polyphone-based speech synthesizer
Björn Gamback, Martin Eineborg, Mikael Eriksson, Barbro Ekholm, Bertil Lyberg, Tomas Svensson
Using two-level morphology as a generator-synthesizer interface for concept-to-speech
Georg Niklfeld, Hannes Pirker, Harald Trost
KT-STS: a speech translation system for hotel reservation and a continuous speech recognition system for speech translation
Myoung-Wan Koo, Il-Hyun Sohn, Woo-Sung Kim, Du-Seong Chang
Learning language translation in limited domains using finite-state models: some extensions and improvements
J. M. Vilar, A. Marzal, Enrique Vidal
The effect of context on the intelligibility of dialogue
David G. Novick, Karen Ward, Benjamin Corliss
Representation of a finite state grammar as bigram language model for continuous speech recognition
Ute Kilian, Fritz Class, Alfred Kaltenmeier, Peter Regel-Brietzmann
Extensions of absolute discounting for language modeling
M. Generet, Hermann Ney, F. Wessel
Automatic clustering of words for probabilistic language models
Loreia Moisa, Egidio Giachin
Algorithms for bigram and trigram word clustering
Sven Martin, Jörg Liermann, Hermann Ney
More efficient clustering of n-grams for statistical language modeling
Joerg P. Ueberla
Generation of language models using the results of image analysis
Uta Naeve, Gudrun Socher, Gernot A. Fink, Franz Kummert, Gerhard Sagerer
Modelling pronunciation variability for special domains
Gudrun Flach
On the use of pronunciation rules for improved word recognition
Nick Cremelie, Jean-Pierre Martens
Reducing memory requirements and computational costs for the baum-welch algorithm and application to automatic stochastic network grammar acquisition
Jin'ichi Murakami
Language model speaker adaptation
Stefan Besling, Hans-Günter Meier
Sentence hypothesisation using NG-gram models
Jerneja Gros, R. Mihelic, N. Pavesic
Optimizing lexical and N-gram coverage via judicious use of linguistic data
Ronald Rosenfeld
A language model for compound words in speech recognition
Marcus Spies
Collecting and analyzing spoken utterances for a speech controlled application
Johannes Müller, Holger Stahl
Speech intelligibility and loudness assessment in a wireless personal communication
Hiromi Nagabuchi, Akira Takahashi, Mineyoshi Ogawa
An experimental investigation of the input and error correction strategies used by subjects entering digits with the AURIX speech recogniser
K. S. Hone, R. W. Series, C. Baber
Goal-directed generation of intelligibility test vocabularies in the framework of names synthesis
Karim Belhoula
Human factors of a voice-controlled car stereo
Reinhold Haeb-Umbach, Stephan Gamm
Exploring the limits of system-directed dialogue, dialogue evaluation of the danish dialogue system
Niels Ole Bernsen, Hans Dybkjaer, Laila Dybkjaer
Human benchmarks for speaker independent large vocabulary recognition performance
David A. van Leeuwen, Leo-Geert van den Berg, Herman J. M. Steeneken
Predictive assesment for speaker independent isolated word recognisers
Alison Simons
Consistency of inter-transcribers' transcription
Kobayashi Satoshi, Kitazawa Shigeyoshi
Comparison of reference system approaches for the quality assessment of synthesized speech
H. Klaus, A. Niebank
Multi-lingual assessment of speaker independent large vocabulary speech-recognition systems: THE SQALE-PROJECT
Herman J. M. Steeneken, David A. van Leeuwen
Error analysis on field data and improved garbage HMM modelling
K. Bartkova, D. Dubois, D. Jouvet, J. Monné
Interference of speech recognition feedback during diagnostic tasks
E. J. A. Verheijen, F. L. van Nes, L. M. de Bruyn, A. Hasman, J. W. Arends
Geometric and temporal constraints in the production of French consonant sequences x-ray and acoustic data for French
Beatrice Vaxelaire
On the biomechanical control variables of the tongue during speech movements
Rafael Laboissiere, Vitpoxio Sanguineti, Yohan Payan
Multipulse LPC modeling of articulatory movements: analysis and interpretation
Soumya Bouabana, Shinji Maeda
Numerical simulations of fluid flow in the vocal tract
G. Richard, M. Liu, D. Snider, H. Duncan, Qiguang Lin, James L. Flanagan, Stephen Levinson, D. Davis, S. Slimon
3-d fem analysis of sound propagation in the nasal tract
Hisayoshi Suzuki, Takayoshi Nakai, Hiroshi Sakakibara
Effects of tonal clash on downstepped h* accents in Spanish
Pilar Prieto, Chilin Shih
The effect of two factors related to speaking tempo on vowel devoicing in Japanese
Mariko Kondo
Duration perception in subsyllabic constituents
Rob Goedemans, Vincent J. van Heuven
Experimental evidence for a comprehensive theory of vowel reduction
Dick R. van Bergem
Clear speech does not exaggerate phonemic contrast
John J. Ohala
The pronunciation of unfamiliar native and non-native town names
Susan Fitt
Using two-level morphology to transcribe Swedish names
Joakim Gustafson
An MRI study of French vowels
Didier Demolin, Jean-Marie Hombert, Véronique Lecuit, Christoph Segebarth, Alain Soquet
Connected speech processes: a cross-linguistic study
Marie-Josep Solé, E. Estebas
Variable-length sequence matching for phonetic transcription using joint multigrams
Sabine Deligne, Francois Yvon, Frédéric Bimbot
Building multiple pronunciation models for novel words using exploratory computational phonology
Gary Tajchman, Eric Foster, Daniel Jurafsky
Pragmatic factors affecting the phonetic properties of diphthongs
Lourdes Aguilar, Maria Machuca
The spatial and the temporal dimensions of consonant reduction in conversational Italian
Edda Farnetani
Neutralization of consonant length: the case of dutch intervocalic stops
S. Gillis, G. De Schutter, J. Verhoeven
Sound perception between two languages based on analyses of onomatopoeic expression
Manabu Kotani, Haruya Matsumoto
An approach to language identification with enhanced language model
Yonghong Yan, Etienne Barnard
The application of dynamic programming techniques to non-word based topic spotting
P. Nowell, Roger K. Moore
Language identification based on speech fundamental frequency
Itahashi Shuichi, Du Liang
Two novel language model estimation techniques for statistical language identification
Michael A. Lund, Herbert Gish
Recognized phoneme-based N-gram modeling in automatic language identification
HingKeung Kwan, Keikichi Hirose
Discriminative-transitional/steady units for Spanish continuous speech recognition
A. Varona, I. Torres, F. Casacuberta
An HMM with optimized segment-dependent observations for speech recognition
Ji Mingy, Peter O'Boyle, Jack Smith
Improving recognition performances on field data with an a-priori segmentation of the speech signal
T. Moudenc, D. Jouvet, J. Monné
Connected digit recognition using statistical template matching
L. Welling, Hermann Ney, A. Eiden, C. Forbrig
Calculation of distance measures between hidden Markov models
Markus Falkhausen, Herbert Reininger, Dietrich Wolf
A chernoff distance based segmental probability model (CD-SPM) approach for Mandarin syllable recognition
Jia-lin Shen, Lin-shan Lee
Geometric pattern recognition techniques for acoustic-phonetic decoding of Spanish continuous speech
M. J. Castro, F. Prat, P. Aibar, F. Casacuberta
State tying of triphone HMM's for the 1994 AT&t ARPA ATIS recognizer
Enrico Bocchieri, Giuseppe Riccardi
A shared-distribution approach in a hidden Markov model-based continuous speech recognition system
Azarshid Farhat, Douglas O'Shaughnessy
Preliminary experimentation of different methods for continuous speech recognition in Spanish
Javier Ferreiros, José M. Pardo
Computationally efficient speech enhancement by spectral minima tracking in subbands
Gerhard Doblinger
Robust hos-based techniques applied to speech recognition and enhancement
Josep M. Salavedra, Javier Hernando, Enrique Masgrau, Asunción Moreno
A maximum likelihood equalization technique for robust speech recognition in adverse environments
Kuldip K. Paliwal
Joint system for acoustic echo cancellation and noise reduction
G. Faucon, R. Le Bouquin Jeannes
A study of speech recognition system robustness to microphone variations
Jane Chang, Victor Zue
A feature-space transformation for telephone based speech recognition
Altxandros Potamianos, Li Lee, Richard C. Rose
Development and improvement of a real-time ASR system for isolated digits in Spanish over the telephone line
Ricardo de Cordoba, Xavier Menendez-Pidal, Javier Macias-Guarasa, Ascension Gallardo, José M. Pardo
Channel estimation for reference model adaptation in telephone speech recognition
Jen-Tzung Chien, Lee-Min Lee, Hsiao-Chuan Wang
Enhancement of telephone speech quality by simple spectrum extrapolation method
Hiroshi Yasukawa
Low-distortion spectral subtraction for speech enhancement
Peter Handel
Transition-based feature extraction within frame-based recognition
Zhihong Hu, Etienne Barnard, Ronald A. Cole
Noisy speech enhancement with filters estimated from the speaker's lips
L. Girin, Gang Feng, Jean-Luc Schwartz
Audio-visual speech recognition compared across two architectures
A. Adjoudani, Christian Benoît
Noise effects on landmark detection in a speech recognition system
Sharlene A. Liu
AR identification of the vocal filter from noisy hyperbaric speech signals
Laure Charonnat, Joel Crestel, Michel Glutton, Herve Chuberre
The study of speech/pause detectors for speech enhancement methods
Pavel Sovka, Petr Pollak
Neural-fuzzy network for phonetic features recognition
Robert Sokol, Guy Mercier
A system for speech separation
A. Shamsoddini, P. N. Denbigh
Hierarchical mixture models and phonological rules in open-vocabulary speech recognition
Yunxin Zhao
Study of subword units for Spanish speech recognition
Antonio Bonafonte, Rafael Estany, Eugenio Vives
Speech recognition using a linear dynamic segmental HMM
Wendy J. Holmes, Martin J. Russell
Comparative evaluation of segmental unit input HMM and conditional density HMM
Kazumasa Yamamoto, Seiichi Nakagawa
Continuous speech recognition using non-uniform unit based acoustic and language models
Shoichi Matsunaga, Takeshi Matsumura, Harald Singer
Towards improved speech recognition using a speech production model
C. S. Blackburn, S. J. Young
A vocabulary independent discriminatively trained method for rejection of non-keywords in sub word based speech recognition
Rafid A. Sukkar, Chin-Hui Lee, Bling-Hwang Juang
Rejection techniques based on context independent subword units
Juan Carlos Torrecilla, Daniel Tapias, F. Javier Caminero-Gil, Luis Villarrubia
Detection of unknown words in spontaneous speech
Pablo Fetter, Fritz Class, Udo Haiber, Alfred Kaltenmeier, Ute Kilian, Peter Regel-Brietzmann
A minimum error training of garbage model for keyword spotter with artificially generated training data
Atsushi Nakamura
New words: effect on recognition performance and incorporation issues
I. Lee Hetherington
Improving speech recognition using speaker classification
David O. Baldwin, Georg F. Meyer
Modular neural networks with task-specific input parameters for speakerindependent speech recognition
Axel Glaeser
Large vocabulary speaker-independent continuous speech recognition with a new hybrid system based on MMI-neural networks
Gerhard Rigoll, Ch. Neukirchen, J. Rottland
REMAP: recursive estimation and maximization of a posteriori probabilities in connectionist speech recognition
Hervé Bourlard, Yochai Konig, Nelson Morgan
An RNN based speech recognition system with discriminative training
Tan Lee, P. C. Ching, L. W. Chan
Preliminary experiments for automatic speech understanding through simple recurrent networks
M. A. Castano, Enrique Vidal, F. Casacuberta
A neural network using non-uniform units for continuous speech recognition
Ha-Jin Yu, Yung-Hwan Oh
Temporal correlation modeling in a hybrid neural network/hidden Markov model speech recognizer
Horatio Franco, Vassilios Digalakis
Continuous speech segmentation with the gamma memory model
Laurent Buniet, Dominique Fohr
Incorporating fuzzy modelling in a hybrid HMM-ANNs system for CSR tasks
Xavier Menendez-Pidal, Ricardo de Cordoba, Javier Ferreiros, José M. Pardo
Neural networks for nonlinear discriminant analysis in continuous speech recognition
Wolfgang Reichl, S. Harengel, F. Wolfertstetter, Günther Ruske
Speech recognition experiments with a new multilayer LVQ network (MLVQ)
Gerhard Rigoll
Speaker-adaptation for hybrid HMM-ANN continuous speech recognition system
Joao Neto, Luis Almeida, Mike Hochberg, Ciro Martins, Luis Nunes, Steve Renals, Tony Robinson
Distributed binary representations for word recognition by TDNN-DTW hybrid systems
Premysl Puzrla, Frédéric Bimbot, Christoph Windheuser
A robust discrimination method based on selectively trained neural networks for confusable words in noisy conditions
Yolande Anglade
Connectionist speaker normalization and adaptation
Victor Abrash, Horacio Franco, Ananth Sankar, Michael Cohen
A competitive algorithm for training HMM for speech recognition
Pedro L. Galindo
Predictive connectionist speech recognition with a new discriminant learning algorithm
Martin Paping, Hans Marti, Mark Renfer
Preliminary results on speech signal segmentation with recurrent neural networks
Antonio J. Rubio, Ronan G. Reilly
Text independent neural network/rule based hybrid, continuous speech recognition
Klara Vicsi, Attila Vig
Automatic recognition of Cantonese lexical tones in connected speech by multi-layer perceptron
Ying Pang Ng, P. C. Ching, L. W. Chan
Combining HMM processing and formant measurements in automatic speech recognition
Dave Abberley, Phil Green
Recurrent neural prediction models for speech recognition
Kyungmin Na, Jekwan Ryu, Dong-Il Chang, Soo-Ik Chae, Souguil Ann
Exploiting acoustic-phonetic knowledge and neural networks for stop recognition
Linda Djezzar, Jean-Paul Haton
Estimation of speech formant-dynamics using neural networks
P. Gomez, V. Rodellar, A. Alvarez, J. Bobadilla, J. Bernal, V. Nieto, M. Perez
The role of linguistic stress in the time course of word recognition in stress-accent languages
Willy Jongenburger, Vincent J. van Heuven
Inter-language differences in the mcgurk effect for dutch and Cantonese listeners
Beatrice de Gelder, Paul Bertelson, Jean Vroomen, Hsuan Chin Chen
Listeners representations of within-word structure: a cross-linguistic and cross-dialectal investigation
Takashi Otake, Sally M. Davis, Anne Cutler
The use of phonotactic constraints in the segmentation of dutch
James M. McQueen, Ethan Cox
Lexical inhibition in spoken word recognition
Jean Vroomen, Beatrice de Gelder
Methodological aspects in a multimedia database of vocal fold pathologies
Maurilio Nunes Vieira, Fergus R. McInnes, Mervyn A. Jack, Arnold Maran, Colin Watson, Moira Little
The application of volterra LMS adaptive filtering to speech enhancement for the hearing impaired
V. Udayashankara, A. P. Shivaprasad
CAPDA: managing intelligibility in children and young adults with down's syndrome or speech disorders
P. Rosso, J. H. Wright, M. Smith
Evaluation of a system for segmental speech quality assessment: voiceless fricatives
Alan A. Wrench, Mary S. Jackson, David S. Soutar, A. Gerry Robertson, Janet MacKenzie Beck
A diagnostic and rehabilitation aid workstation for speech and voice pathologies
B. Teston, B. Galindo
Improvement, evaluation and testing of a low cost multilingual portable speaking aid for the speech impaired
Geza Nemeth, Gabor Olaszy, Laszlo Pataki, Luis Hernandez Gomez, Diamantino Freitas
Empirical study to test the independence of different acoustic voice parameters on a large voice database
Dirk Michaelis, Hans Werner Strube
The spectral analysis of infant cry: an initial approximation
Sergio D. Cano Ortiz, Daniel Escobedo Beceiro, Manuel Socarras Reyes
Portable speech rate conversion system
N. Seiyama, A. Nakamura, A. Imai, T. Takagi, E. Miyasaka
A field test of sivo aid in China
Jialu Zhang
Analysis for palatalized articulation of [s] sounds using synthetic speech
Takayuki Arai, Keiko Okazaki, Setsuko Imatomi, Yuichi Yoshida
An attempt to classify LX signals
Krzysztof Marasek
Time series analysis of glottal cycle lengths of healthy and dysphonic speakers
Jean Schoentgen, Raoul de Guchteneere
Permugram language models
Ernst-Günter Schukat-Talamazzini, R. Hendrych, Ralf Kompe, Heinrich Niemann
GLR-parsing of word lattices using a beam search method
Steffen Staab
Integrating natural language into the word graph search for simultaneous speech recognition and understanding
Stephanie Seneff, Michael McCandless, Victor Zue
A statistical approach to language modelling for the ATIS task
Joshua Koppelman, Stephen Delia Pietra, Mark Epstein, Salim Roukos, Todd Ward
Lattice parsing and application of integrated language models for speech recognition
G. J. F. Jones, H. Lloyd-Thomas, J. H. Wright
Robust parsing of n-best speech hypothesis lists using a general grammar-based language model
Manny Rayner, Peter Wyard
Improvements in tree-based language model representation
Fabio Brugnara, Mauro Cettolo
Language modeling of spontaneous speech in a court context
P. E. Kenne, M. O'Kane, H. G. Pearcy
Study of vowel variations for a Mandarin speech synthesizer
Chilin Shih
Using statistical models to predict phrase boundaries for speech synthesis
Eric Sanders, Paul Taylor
Naturalness in a high-level synthetic speech system
Mark Tatham, Eric Lewis
Pragmatic effects in speech synthesis
Katherine Morton, Mark Tatham
A scheme for a model-based synthesis by rule of F0 contours of German utterances
Hansjörg Mixdorff, Hiroya Fujisaki
Speech synthesis by rule based on synthesis units considering prosodic features
Yasushi Ishikawa, Kunio Nakajima
The generation of prosody in the nijmegen rule oriented speech synthesis system
J. Kerkhoff, T. Rietveld
Phonological rules modelling style variations of 'e' caduc in French parisian spontaneous speech for text-to-speech synthesis
Pean Vincent, Lacheret-Dujour Anne
Voiced diphone synthesis using a parametric model and formant based mapping
Dongbing Wei, J. W. Devaney, C. C. Goodyear
Robust automatic extraction of diphones with variable boundaries
Debra Yarrington, H. Timothy Bunnell, Gene Ball
Generation of articulatory synthesiser parameters from formant frequencies using a cubic mapping function
A. R. Greenwood
Two-mass models for speech synthesis
R. N. J. Veldhuis, I. J. M. Bogaert, N. J. C. Lous
Introducing a parametric consonantal model to the articulatory speech synthesiser
Mats Bavegard
High-quality Japanese text-to-speech system: NARSYS
Nobuyuki Katae, Tatsuro Matsumoto, Shinta Kimura, Mitsuko Kaseda, Takayuki Ohyama
An investigation of locus equations as a source of information for consonantal place
Mohamed Yeou
Automatic labelling of multi-sensor speech database: issues and perspectives
Nathalie Parlangeau, Regine Andre-Obrecht, Alain Marchal
What does consonant reduction look like, if it exists?
Rob J. J. H. van Son, Louis C. W. Pols
Articulatori-acoustic vowel prototypes for speech production
Gérard Bailly, Louls-Jean Boe, N. Vallee, Pierre Badin
Experimental study of the target theory of vowel production
M. Pitermann, Jean Schoentgen
Comparing tongue kinematic and acoustic phasing patterns for vowel quantity contrasts in WOLOF
R. Sock, Pascal Perrier, Anders Löfqvist
Evaluation of a vowel normalisation procedure based on speech production knowledge
Pascal Perrier, Lian Apostol, Yohan Payan
Dialect specific features of australian English diphthongs in spontaneous speech
Santha Sampath
Adaptation of a two-mass model of the vocal cords to a particular speaker
Christophe Vescovi, Eric Castelli, Xavier Pelorson
The money talks interactive speech technology assessment: a report from the field
Stephen Springer, Sara Basson, Ashok Kalyanswamy, Edward Man, Dina Yashchin
Speech and tactile-based georal system
J. Siroux, M. Guyomard, Y. Jolly, F. Multon, C. Remondeau
Vocalist: a robust, portable spoken language dialogue system for telephone applications
Norman M. Fraser, J. H. Simon Thornton
A prototype of a Japanese-Korean realtime speech translation system
Masami Suzuki, Naomi Inoue, Fumihiro Yato, Kazuya Takeda, Seiichi Yamamoto
Query-response relationships in the oasis speech-recognition system
B. L. Zeigler, B. Mazor
Development of spoken language corpora for travel information
Lori Lamel, S. Rosset, S. Bennacef, H. Bonneau-Maynard, L. Devillers, Jean-Luc Gauvain
Empirical evaluation of human performance and agreement in parsing discourse constituents in spoken dialogue
Giovanni Flammia, Victor Zue
Multimodal spoken dialogue systems and rapid-prototyping
Tsuneo Nitta, Mika Amamiya, Hiroyuki Kamio, Hiroshi Matsu'ura, Arisa Uchiyama, Masafumi Tamura
Development and evaluation of a spoken dialogue for a telephone based transaction system
Lars Bo Larsen
Integrating spelling into spoken dialogue recognition
Hermann Hild, Alex Waibel
The application of parallel model combination to a large vocabulary dictation task
M. J. F. Gales, S. J. Young
Blind equalization using adaptive filtering for improving speech recognition over telephone
C. Mokbel, D. Jouvet, J. Monné
Speech-seeking microphone array with multi-stage processing
Yuchang Cao, Sridha Sridharan, Miles Moody
Recognition of noisy speech using an auditory model
Halewijn Vereecken, Jean-Pierre Martens
Stress independent robust HMM speech recognition using neural network stress classification
Brian D. Womack, John H. L. Hansen
Speech enhancement using two versions of the noisy speech signal
Klaus Linhard
Design and optimization of a two microphone speech enhancement system
Rainer Martin
Time delay estimation for microphone array speech enhancement systems
Martin Drews
Speech enhancement by eigen decomposition with two-channel observations
Yuchang Cao, Sridha Sridharan, Miles Moody
Robust continuous speech recognition using a microphone array
D. Giuliani, M. Matassoni, M. Omologo, P. Svaizer
Factors affecting F0 peak displacement in Spanish
Joaquim Llisterri, Rafael Mann, Carme de la Mota, Antonio Rios
Generation of intonation: a global approach
Véronique Aubergé, Gérard Bailly
Prevocalic consonant duration in Swedish: effects of vowel quality and postvocalic place of articulation
Peter E. Czigler, Dawn M. Behne
Intonation gesture of slovene: first indications
P. Vitez, Véronique Aubergé
Intrinsic prosodic values and segmental context
Barbara Heuft, Thomas Portele
Describing speech styles using prosody: a pilot study
Andy Tams, Mark Tatham, Julian H. Page
Stress and intonation in Spanish for affirmative and interrogative sentences
C. Franchon Cabrera
Statistical methods for the automatic labelling of German prosody
Michael Lehning
Investigation on unknown word processing and strategies for spontaneous speech understanding
Atsuhiko Kai, Seiichi Nakagawa
New n-best based rejection techniques for improving a real-time telephonic connected word recognition system
F. Javier Caminero-Gil, Celinda de la Torre-Munilla, Lúis Hernandez-Gomez, Cesar Martin del Álamo
Detection of unknown words using garbage cluster models for continuous speech recognition
Hiroyuki Sakamoto, Shoichi Matsunaga
Detection of unknown words and its evaluation
A. Jusek, Gernot A. Fink, Franz Kummert, H. Rautenstrauch, Gerhard Sagerer
Minimum duration constrained non-keyword modeling and rejection for word spotting
Seung-Bae Lee, Lag-Yong Kim, Min-Seong Kim, Jong-Seok Lee, Shin-Wook Kang
Rejection capabilities for HMM-based speech recognizers
Sari Accaino, Bart D'hoore, Johan Vantieghem, Dirk Van Compernolle
Multi-lingual connected digits recognition
Joan Salavedra, Claus Jacobsen, Mazin G. Rahim, Ilija Zeljkovic, Jay G. Wilpon
Recognition of spontaneously spoken connected numbers in Spanish over the telephone line
Celinda de la Torre-Munilla, Lúis Hernandez-Gomez, F. Javier Caminero-Gil, Cesar Martin del Álamo
Lexical fillers for task-independent-training based keyword spotting and detection of new words
R. El Meliani, Douglas O'Shaughnessy
Topic spotting with task independent models
Michael J. Carey, Eluned S. Parris
Phrase spotting using pitch pattern information
Toshiyuki Hanazawa, Yoshiharu Abe, Kunio Nakajima
Speech understanding and speech retrieval for TV news by using connected word spotting
Yoshiaki Itoh, Jiro Kiyama, Ryuichi Oka
Talker-independent keyword spotting for information retrieval
J. T. Foote, G. J. F. Jones, K. Sparck Jones, S. J. Young
Large vocabulary word scoring as a basis for transcription generation
P. Jeanrenaud, M. Siu, Herbert Gish
New word spotting algorithm based on forward decoding
Jin'ichi Murakami
Word- and phrase spotting with syllable-based garbage modelling
H. Klemm, Fritz Class, Ute Kilian
Influence of short-time phase on the perception of stop consonants
Li Liu, Jialong He, Günther Palm
Enhancing the perceptual salience of information-rich regions of natural intervocalic consonants
Andrew Simpson, Valerie Hazan
A method to quantify the error distribution in confusion matrices
Rob J. J. H. van Son
Speech and nonspeech signal densities for the perception of temporal order
Luis E. Lopez-Bascuas
Researching the processing structures of human phoneme recognition by analysis of natural stop-consonant-vowel utterances that elicit correct recognition through unusual acoustic patterns
Eduardo Sa Marta, Fernando Perdigao, Luis Vieira de Sa
The perception of voicing in Spanish sibilants
Kirk A. Widdison
Development of the perception of initial prevocalic [r] and [l] by English children
Elzbieta B. Slawinski
On phonetic boundaries across categories for synthetic and natural vocalic speech sounds
Francesco Cutugno, Renata Savy
Effects of syllabic position in the perception of spoken English
Athanassios Protopapas, Steven Finney, Peter D. Eimas
Article |
---|