doi: 10.21437/Eurospeech.1993
ISSN: 1018-4074
Dictation, directories, and data bases; emerging PC applications forlarge vocabulary speech recognition
Janet M. Baker
Speech database annotation. the importance of a multi-lingual approach
William Barry, Paul Dalsgaard
Identifying non-linguistic speech features
Lori F. Lamel, Jean-Luc Gauvain
A new generation of spoken dialogue systems: results and lessons from the sundial project
Jeremy Peckham
Whither a theory of speech pattern processing?
Roger K. Moore
Speech coding for communications
Peter Noll
Modeling and search in continuous speech recognition
Hermann Ney
Trends in speaking styles research
Maxine Eskenazi
Models of speech recognition; personal perspectives on particular approaches
John S. Bridle
The conversational computer: an apple perspective
Kai-Fu Lee
Speech quality assessment and evaluation
Ute Jekosch
Timing in text-to-speech systems
Jan P. H. van Santen
Learning how to understand language
Roberto Pieraccini, Esther Levin, Enrique Vidal
M-LCELP speech coding at bit-rates below 4kbps
Kazunori Ozawa, Masahiro Serizawa, Toshiki Miyano, Toshiyuki Nomura
Fast vector quantization using neural maps for CELP at 2400bps
Eduardo Lopez-Gonzalo, Luis A. Hernandez-Gomez
Improving the speech quality of CELP-coders by optimizing the long-term delay determination
U. Balss, U. Kipper, Herbert Reininger, Dietrich Wolf
A stochastic speech coder with multi-band long-term prediction
Carmen Garcia-Mateo, J. L. Alba-Castro, Luis A. Hernandez-Gomez
Intelligibility evaluation of 4-5 kbps CELP and MBE vocoders: the hermes program experiment
B. W. M. Wery, Herman J. M. Steeneken
Algorithms for the CELP coder with ternary excitation
P. Dymarski, N. Moreau
Complexity reduction for federal standard 1016 CELP coder
M. Mauc, G. Baudoin, M. Jelinek
Objective analysis of the GSM half rate speech codec candidates
F. Wuppermann, Christiane Antweiler, M. Kappelan
A 5600 BPS VSELP speech coder candidate for half-rate GSM
Ira A. Gerson, Mark A. Jasiuk
A speech coder for TV programme description
A. M. Kondoz, B. G. Evans, M. R. Suddle
Pitch synchronous innovation CELP (PSI-CELP)
Satoshi Miki, Kazunori Mano, Hitoshi Ohmuro, Takehiro Moriya
Vocoder design based on HOS
Asunción Moreno, José A. R. Fonollosa, Josep Vidal
Emulation of a formant vocoder at 600 and 800 bps
Nigel Sedgwick
A pitch synchronized synthesizer for the IMBE vocoder
W. Ma, A. M. Kondoz, B. G. Evans
An analysis of the performances of the MBE model when used in the context of a text-to-speech system
Thierry Dutoit, Henri Leich
High-quality synthesis of LPC speech using multiband excitation model
C. F. Chan
High-quality speech coding at 2.4 kbps based on time-frequency interpolation
Yair Shoham
Coding of speech signal by fractal techniques
Luca Marcato, Enzo Mumolo
A new reference signal for evaluating the quality of speech coded at low bit rates
Naomi Asanuma, Hiromi Nagabuchi
A psychophysical study of fourier phase and amplitude coding of speech
Changxue Ma, Douglas O'Shaughnessy
Recovery of vocal tract midsagittal and area functions from speech signal for vowels and fricative consonants
Denis Beautemps, Pierre Badin, Rafael Laboissiere
Strange attractors and chaotic dynamics in the production of voiced and voiceless fricatives
Shrikanth S. Narayanan, Abeer A. Alwan
Frequency variations of the lowest main spectral peak in sibilant clusters
Noel Nguyen, Philip Hoole
Vocalic reduction : prediction of acoustic and articulatory variabilities with invariant motor commands
Helene Loevenbruck, Pascal Perrier
Compensating for labial perturbation in a rounded vowel: an acoustic and articulatory study
C. Savariaux, Pascal Perrier, J. P. Orliaguet
Resistance of bilabials /p, b/ to anticipatory labial and mandibular coarticulation from vowel types /i, a, u/
Rudolph Sock, Anders Löfqvist
Jaw phasings and velocity profiles in arabic
Mounir Jomaa, Christian Abry
Derivation of the transfer function for a speech production model including the nasal cavity
Morten Olesen
Using artificial neural nets to compare different vocal tract models
Mats Bavegard, Jesper Högberg
A time-evolving three-dimensional vocal tract model by means of magnetic resonance imaging (MRI)
Arne Kjell Foldvik, Ulf Kristiansen, Jorn Kvaerness
Physiologically-motivated modeling of the voice source in articulatory analysis/synthesis
Juergen Schroeter, Bert Cranen
Estimation of source parameters by frequency analysis
Luis C. Oliveira
Fitting a LF-model to inverse filter signals
Helmer Strik, Bert Cranen, Lou Boves
Modelling the glottal pulse with a self-excited threshold auto-regressive model
Jean Schoentgen
Going back to the source: inverse filtering of the speech signal with ANNs
J. Denzler, Ralf Kompe, Andreas Kießling, Heinrich Niemann, Elmar Nöth
Low cost speaker dependent isolated word speech preselection system using static phoneme pattern recognition
Manuel A. Leandro, Jose M. Pardo
High performance speaker-independent phone recognition using CDHMM
Lori F. Lamel, Jean-Luc Gauvain
Speaker-independent continuous speech dictation
Jean-Luc Gauvain, Lori F. Lamel, Gilles Adda, M. Adda-Decker
Automatic speech recognition without phonemes
Ernst G. Schukat-Talamazzini, Heinrich Niemann, Wieland Eckert, T. Kuhn, S. Rieck
Spoken language identification using ergodic HMM with emphasized state transition
Takashi Seino, Seiichi Nakagawa
Neural time warping
Bruno Apolloni, Dario Crivelli, Marco Amato
Speaker independent small vocabulary speech recognition using MLPs for phonetic labeling
Philippe Le Cerf, Dirk Van Compernolle
Multiresolution time-sequency speech processing based on orthogonal wavelet packet pulse forms
Andrzej Drygajlo
The application of the wavelet transform for speech processing
Eliathamby Ambikairajah, M. Keane, L. Kilmartin, G. Tattersall
Duration modelling with multiple split regression
Naoto Iwahashi, Yoshinori Sagisaka
Factors affecting adaptation to time-compressed speech
Gerry T. M. Altmann, Duncan Young
Waveform similarity based overlap-add (WSOLA) for time-scale modification of speech: structures and evaluation
Marc Roelands, Werner Verhelst
A study on the weighting factors of two-dimensional cepstral distance measure
Hsiao-Chuan Wang, Hsiao-Fen Pai
Connection between weighted LPC and higher-order statistics for AR model estimation
Yves Kamp, Changxue Ma
Integration of acoustic and visual speech for speaker recognition
C. C. Chibelushi, J. S. Mason, R. Deravi
Discriminant AR-vector models for free-text speaker verification
Claude Montacie, Jean-Luc Le Floch
Within class optimization of cepstra for speaker recognition
J. Thompson, J. S. Mason
Text-free speaker recognition using an arithmetic-harmonic sphericity measure
Frédéric Bimbot, Luc Mathan
Albayzin speech database: design of the phonetic corpus
Asunción Moreno, Dolors Poch, Antonio Bonafonte, Eduardo Lleida, Joaquim Llisterri, Jose B. Marino, Climent Nadeu
A software tool for speech collection, recognition and reproduction
Carlos Ribeiro, Isabel Trancoso, Antonio Serralheiro
An object-oriented database for speech processing
Matti Karjalainen, Toomas Altosaar
Automatic annotation using multi-sensor data
Dominic S. F. Chan, Adrian J. Fourcin
Prolog tools for accessing the phondat database of spoken German
Christoph Draxler, Hans G. Tillmann, Barbara Eisen
Cluster-similarity: a useful database for speech processing
Ute Jekosch
SIRVA - a large speech database collected on the Italian telephone network
G. Castagneri, G. Di Fabbrizio, A. Massone, M. Oreglia
Objective assessment of speech communication systems; introduction of a software based procedure
Herman J. M. Steeneken, J. A. Verhave, Tammo Houtgast
Enhanced direct assessment of speech input systems within the SAM-a esprit project
Sven W. Danielsen
Evaluation of prosody in the French version of multilingual text-to-speech synthesis: neutralising segmental information in preliminary tests
Pascale Nicolas, Pascal Romeas
A clinical voice evaluation system
Sokol Saliu, Hideki Kasuya, Yasuo Endo, Yoshinobu Kikuchi
A speech therapy workstation for the assessment of segmental quality: voiceless fricatives
Alan A. Wrench, M. S. Jackson, Mervyn A. Jack, D. S. Soutar, A. G. Robertson, J. MacKenzie, John Laver
A speech enhancement system using higher order ar estimation in real environments
Josep M. Salavedra, Enrique Masgrau, Asunción Moreno, Xavier Jove
Proposal of a composite measure for the evaluation of noise cancelling methods in speech processing
R. Le Bouquin, G. Faucon, A. Akbariazirani
The use of linear prediction and spectral scaling for improving speech enhancement
P. M. Crozier, B. M. G. Cheetham, C. Holt, E. Munday
Robust speaker-independent speech recognition using non-linear spectral subtraction based IMELDA
Helge B. D. Sorensen, Uwe Hartmann
Intra- and interspeaker variation of /r/ in dutch
Willem H. Vieregge, A. P. A. Broeders
An acoustic approach to fricatives in Japanese and German
Mechtild Tronnier, Masatake Dantsuji
The relationship between spelled and spoken portuguese: implications for speech synthesis and recognition
M. Céu Viana, Isabel Trancoso, Carlos Ribeiro, Amalia Andrade, Ernesto d'Andrade
Phonetic transcription standards for european names (ONOMASTICA)
M. Schmidt, S. Fitt, C. Scott, Mervyn A. Jack
Data-driven identification of poly- and mono-phonemes for four european languages
Ove Andersen, Paul Dalsgaard, William Barry
Reversible letter-to-sound sound-to-letter generation based on parsing word morphology
Sheri Hunnicutt, Helen Meng, Stephanie Seneff, Victor W. Zue
The role of context in the automatic recognition of stressed syllables
Jan Moore, Peter Roach
Metrical structure and the perception of time-compressed speech
Duncan Young, Gerry T. M. Altmann, Anne Cutler, Dennis Norris
Are stress and phonemic string processed separately? evidence from speech illusions
Valerie Pasdeloup, José Morais, Régine Kolinsky
Vowel identification as influenced by vowel duration and formant track shape
Rob J. J. H. van Son, Louis C. W. Pols
Modelling spectral dynamics for vowel classification
William D. Goldenthal, James R. Glass
Perceptive and spectral volumes of synthesized and natural vowels
Milan Stamenkovic, Juraj Bakran, Peter Tancig, Marijan Miletic
Labeller - a system for automatic labelling of speech continuous signal
Ryszard Gubrynowicz, Adam Wrzoskowicz
Towards automatic speech-to-text alignment
Ake Andersson, Holger Broman
Sound duration modelling and time-variable speaking rate in a speech recognition system
Nelly Suaudeau, Regine Andre-Obrecht
Using relative duration in large vocabulary speech recognition
M. Jones, Phil C. Woodland
Duration of phones as function of utterance length and its use in automatic speech recognition
Yifan Gong, William C. Treurniet
Duration modelling and multiple codebooks in semi-continuous HMMs for speaker verification
M. E. Forsyth, Mervyn A. Jack
Constraining model duration variance in HMM-based connected-speech recognition
Michael M. Hochberg, Harvey F. Silverman
A new frequency shift function for reducing inter-speaker variance
Christine Tuerk, Tony Robinson
Speaker normalization using constrained spectra shifts in auditory filter domain
Yoshio Ono, Hisashi Wakita, Yunxin Zhao
Self-learning speaker adaptation based on spectral variation source decomposition
Yunxin Zhao
A dynamic approach to speaker adaptation of hidden Markov networks for speech recognition
Tetsuo Kosaka, Edward Willems, Jun-Ichi Takami, Shigeki Sagayama
Speaker normalization and adaptation based on feature-map projection
Lars Knohl, Ansgar Rinscheid
Pitch synchronous calculation of acoustic cues using a cochlea model
Marcel de Leeuw, Jean Caelen
Nonlinear dynamical systems concepts in speech analysis
Stephen McLaughlin, Andrew Lowry
Grouping of acoustical events using cable neurons and the theory of neuronal group selection
Arno J. Klaassen
Computationally efficient methods of calculating instantaneous frequency for auditory analysis
I. R. Gransden, S. W. Beet
Analysing connected speech with wavelets: some Italian data
Francesco Cutugno, Pietro Maturi
Speech transients analysis using AR-smoothed wigner-ville distribution
Krzysztof Marasek
Comparison of the variability of formants and formant targets using dynamic modeling
Michel Pitermann, Jean Caelen
Pitch-synchronous formant extraction by means of a compound auto-regressive model
Jean Schoentgen, Zoubir Azami
A new air flowmeter design for the investigation of speech production
Bernard Teston
Articulatory dynamics of lips in Italian /'vpv/ and /'vbv/ sequences
Emanuela Magno Caldognetto, Kyriaki Vagges, Giancarlo Ferrigno, Claudio Zmarich
Restricted distribution of pharyngeal segments: acoustical or mechanical constraints?
Ahmed M. Elgendy
Vowel normalization by articulatory normalization first attemps for vowel transitions
Yohan Payan, Pascal Perrier
Synthesis and analysis of vocal source with vibration of larynx
Nobuhiro Miki, Naohisa Kamiyama, Nobuo Nagai
Towards an acoustic-phonetic classification of modern standard arabic vowels
Imad Znagui, Sami Boudelaa
Divers' speech: variable encoding strategies
Alain Marchal, Christine Meunier
Phonetic reduction processes in spontaneous speech
L. Aguilar, B. Blecua, M. Machuca, R. Mann
Spectral characteristics of fricative sound
N. R. Ganguli
Automatic speaker recognition and analytic process
Jean-Francois Bonastre, Henri Meloni
Second formant locus-nucleus patterns in French and Swedish
Danielle Duez
Temporal organisation of segments and sub-segments in consonant clusters.
Christine Meunier
Automatic recognition of arabic stop consonants
Abdelkader Betari, Remy Bulot
Acoustic-phonetic decoding of Spanish occlusive consonants
I. Torres, P. Iparraguirre
Normalized vowel system representation for comparative phonetic studies
Philip Christov
Influence of prevocalic consonant on vowel duration in French CV[p] utterances
Cécile Thilly
Temporal variation in consonant clusters in Swedish
Peter Czigler
Discriminant analysis of continuous consonantal spectra
Wiktor Jassem
Training consonants in a computer-aided system for pronunciation teaching
Edmund Rooney, Miriam Eckert, Steven Hiller, Rebecca Vaughan, John Laver
Rhythm analysis of speech and music signals
Andrej Miksic, Bogomir Horvat
The contribution of pitch contour, phoneme durations and spectral features to the character of spontaneous and read aloud speech
Gitta P.M. Laan, Dick R. van Bergem
Prosodic differences in reading style: isolated vs. contextualized sentences
Juan M. Garrido, Joaquim Llisterri, Carme de la Mota, Antonio Rios
Duration and intonation in emotional speech
Jean Vroomen, Rene Collier, Sylvie Mozziconacci
A discriminatively derived linear transform for improved speech recognition
C. M. Ayer, Melvyn J. Hunt, D. M. Brookes
Hidden Markov models assuming a continuous-time dynamic emission of acoustic vectors
Marco Saerens
Speech modelling using cepstral-time feature matrices
Saeed V. Vaseghi, P. N. Conner, Ben P. Milner
A bounded transition hidden Markov model for continuous speech recognition
Yoshiharu Abe, Kunio Nakajima
Speaker independent phoneme recognition using a heuristic search
Ami Moyal, Arnon Cohen
Optimization of an HMM - based continuous speech recognizer
F. Class, A. Kaltenmeier, Peter Regel-Brietzmann
Linear and nonlinear prediction for speech recognition with hidden Markov models
Marco S. Aerens, Hervé Bourlard
Segmental post-processing of the n-best solutions in a speech recognition system
M. N. Lokbani, D. Jouvet, J. Monne
A study of on-line Bayesian adaptation for HMM-based speech recognition
Tatsuo Matsuoka, Chin-Hui Lee
Hidden Markov models using shared vector linear predictors
B. A. Maxwell, Phil C. Woodland
Talker localization and speech enhancement in a noisy environment using a microphone array based acquisition system
M. Omologo, P. Svaizer
Generalized cepstral modeling of speech degraded by additive noise
Takao Kobayashi, Toshio Kanno, Satoshi Imai
Noise quality improvement through SVD equalization
Stylianos Bakamidis, George Carayannis
Speech enhancement by nonlinear spectral estimation - a unifying approach
Fei Xie, Dirk Van Compernolle
Subband array processing for speech enhancement
Kristian Kroschel, Keld Lange
The design and recording of icy, a corpus for the study of intraspeaker variability and the characterisation of speaking styles#
Vincent Pean, Sheila Williams, Maxine Eskenazi
Speaker clustering for improved speech recognition
Andrej Ljolje
Speaker-variability in spectral bands of dutch vowel segments
Henk van den Heuvel, Bert Cranen, A. C. M. Rietveld
A method of classification among Japanese dialects
Shuichi Itahashi, Kimihito Tanaka
Measuring similarities among speakers by means of neural networks
J. A. Hernandez-Mendez, Anibal R. Figueiras-Vidal
Robust endpoint detection of speech in the presence of noise
Maria Rangoussi, Stylianos Bakamidis, George Carayannis
Automatic segmentation and labeling of English and Italian speech databases
B. Angelini, F. Brugnara, D. Falavigna, D. Giuliani, R. Gretter, M. Omologo
A segmental approach versus a centisecond one for automatic phonetic time-alignment
Azarshid Farhat, Guy Perennou, Regine Andre-Obrecht
A segmentation algorithm based on acoustical features using a self organizing neural network
I. Heroaez, J. Barandiaran, E. Monte, B. Etxebarria
SLAM: segmentation and labelling automatic module
Piero Cosi
Phone and syllable segmentation by concurrent window modules
Christian Heise, Hans-H. Bothe
Reliability of speech segmentation and labelling at different levels of transcription
Barbara Eisen
On the perception of acoustic and lexical vowel reduction
Dick R. van Bergem
Click detection in Italian and English
Brit van Ooyen, Anne Cutler, Pier Marco Bertinetto
Phonological variation and mismatch in lexical access
Andrew Nix, Gareth Gaskell, William Marslen-Wilson
Perception of word boundaries by dutch listeners
Monique van Zon, Beatrice de Gelder
Perception of French stop bursts, implications for stop identification
Anne Bonneau, Linda Djezzar, Yves Laprie
Using isofrequency neural column for harmonic sound scene decomposition
Zdravko Kacic, Bogomir Horvat
Do ear perceive vowel through formants?
A. K. Datta
Speech recognition using auditory models and neural networks
Trupti Vyas, Michael J. Pont, Seyed J. Mashari
The influence of temporal processes on spectral masking patterns of harmonic complex tones and vowels
Changxue Ma, Armin Kohlrausch
Temporal effect on the perception of continuous speech and a possible mechanism in the human auditory system
Hisao Kuwabara
Comparison of various adaptation mechanisms in an auditory model for the purpose of speech processing
Edward Jones, Eliathamby Ambikairajah
Sensory-motor manifestations of speech-hearing interaction
I. A. Vartanian, T. V. Chernigovskaya
Syllable perception: lateralization of native and foreign languages
T. V. Chernigovskaya, I. A. Vartanian, T. I. Tokareva
Simulation of short-latency auditory evoked potentials: a pilot study
Michael J. Pont
Intermediate representations in spoken word recognition: a cross-linguistic study of word illusions
Regine Kolinsky, Jose Morais
Time - varing manner on formant trajectories of Chinese diphthongs
Jianfen Cao
Iterative transformation and alignment for speech labeling
Yifan Gong, Jean-Paul Haton
Controlling search in segmentation lattices of speech signals
Kai Hübener, Andreas Hauenstein
Accent phrase segmentation using transition probabilities between pitch pattern templates
Hiroshi Shimodaira, Mitsuru Nakai
Syllable segmentation of continuous speech with artificial neural networks
W. Reichl, Günther Ruske
Labelling of speech given its text representation
Mats Blomberg, Rolf Carlson
On the automatic classification of pitch movements
Louis F. M. ten Bosch
Modelling of intonation contours at the sentence level using CHMMs and the 1961 o'connor and arnold scheme
U. Jensen, Roger K. Moore, Paul Dalsgaard, Borge Lindberg
Automatic recognition of intonation from F0 contours using the rise/fall/connection model
Paul Taylor
A pitch contour analysis guided by prosodic event detection
Edouard Geoffrois
Analysis and synthesis of pitch movements in a read polish text
Grazyna Demenko, Ignacy Nowak, Janusz Imiolczyk
Noise adaptation: speech recognition by auditory models and human listeners
William A. Ainsworth, G. F. Meyer
Adapting a HMM-based recogniser for noisy speech enhanced by spectral subtraction
J. A. Nolazco Flores, Steve J. Young
Speech recognition under the unstationary noise based on the noise Markov model and spectral-subtraction
Tetsunori Kobayashi, Ryuji Mine, Katsuhiko Shirai
HMM recognition in noise using parallel model combination
M. J. F. Gales, Steve J. Young
Selectively trained neural networks for connected word recognition in noisy environments
Laurent Buniet, Dominique Fohr, Yolande Anglade, Jean-Claude Junqua, Jean-Marie Pierrel
A baseline of a speaker independent continuous speech recognizer of Italian
B. Angelini, F. Brugnara, D. Falavigna, D. Giuliani, R. Gretter, M. Omologo
Word lookahead scheme for cross-word right context models in a stack decoder
L. R. Bahl, P. V. de Souza, P. S. Gopalakrishnan, D. Nahamoo, Michael Picheny
Recognition of obstruent phonemes in speaker-independent fluent speech using a hierarchical approach
David B. Grayden, Michael S. Scordilis
A continuous speech recognition system using phonotactic constraints
Bernd Plannerer, Günther Ruske
Joint arabic-hebrew speech synthesis system
M. Ouadou, A. Rajouani, M. Zyoute, J. Rosenfeld, M. Najim
Improvements of the Spanish version of the multivox text-to-speech system
Eduardo Lopez-Gonzalo, Gabor Olaszy, Geza Nemeth
Generating intonation for Swedish text-to-speech conversion using a quantitative model for the F0 contour
Mats Ljungqvist, Hiroya Fujisaki
PHRITTS - a text-to-speech synthesizer for the German language
P. Meyer, Hans-Wilhelm Rühl, R. Krüger, M. Kugler, L. L. M. Vogten, A. Dirksen, Karim Belhoula
Rule-based grapheme-to-phoneme conversion of names
Karim Belhoula
A prototype text-to-speech system for scottish gaelic
Iain R. Murray, Morag M. Black
A text-to-speech system for polish
Janusz Imiolczyk, Ignacy Nowak, Grazyna Demenko
Intelligibility as a function of speech coding method for template-based speech synthesis
Marian Macchi, Mary Jo Altom, Dan Kahn, Sharad Singhal, Murray F. Spiegel
Pronunciation and text normalisation in applied text-to-speech systems
Maggie Gaved
Evaluating synthesised prosody in simulations of an automated telephone enquiry service
Jill House, Catriona MacDermid, Scott McGlashan, Andrew Simpson, Nick Youd
Speech synthesis in dialogue systems
Katherine Morton, Marcel Tatham
Applying analysis of human emotional speech to enhance synthetic speech
Elissaveta Abadjieva, Iain R. Murray, John L. Arnott
A generic front end for text-to-speech synthesis systems
Eric Lewis, Marcel Tatham
Experiments with silent-e and affix correspondences in stochastic phonographic transduction
Robert W. P. Luk, Robert I. Damper
Phoneme-dependent speech synthesis in the time and frequency domains
Georg Fries
Speech synthesis experiments with the glove synthesiser
Inger Karlsson, Lennart Neovius
Auditory detection of discontinuities in synthesis-by-concatenation
Volker Kraft
Effects of the phase jitters on naturalness of synthesized speech
Yun-Keun Lee, Seung-Kwon Ahn
Letter-to-sound rules for the welsh language
Briony Williams
Dialogue design principles - key for usability of voice processing
Christel Müller, Fred Runge
Wizard-of-oz and the trade-off between naturalness and recogniser constraints
Hans Dybkjaer, Niels Ole Bernsen, Laila Dybkjaer
Dialogue analysis and generation: a theory for modelling natural English dialogue
Cerian Jones, Roberto Garigliano
Features of naive callers' dialogues with a simulated speech understanding and dialogue system
Catriona MacDermid
Refering to actions in man-machine command dialogues
Fabrice Duermael, Bertrand Gaiffe
Next utterance prediction based on two kinds of dialog models
Yoichi Yamashita, Riichiro Mizoguchi
The design of a real world wizard of oz experiment for a speech driven telephone directory information system
T. Andemach, G. Deville, L. Mortier
Dialog structure and plan recognition in spontaneous spoken dialog
Sheryl R. Young
A speech-first model for repair identification in spoken language systems
Julia Hirschberg, Christine Nakatani
Recognition confidence measures for spontaneous spoken dialog
Sheryl R. Young, Wayne Ward
Issues in large scale statistical language modeling
R. Zhao, P. Kenny, P. Labute, Douglas O'Shaughnessy
A data-driven case for a spontaneous speech grammar
Roberto Garigliano, Kevin Johnson, Russell J. Collingham
Improved clustering techniques for class-based statistical language modelling
Reinhard Kneser, Hermann Ney
A consolidated language model for speech recognition
J. H. Wright, G. J. F. Jones, H. Lloyd-Thomas
Empirical acquisition of word and phrase classes in the atis domain
Michael K. McCandless, James R. Glass
The effects of parameter smoothing on robust learning in syntactic ambiguity resolution
Tung-Hui Chiang, Keh-Yih Su
Learning associations between grammars: a new approach to natural language understanding
Enrique Vidal, Roberto Pieraccini, Esther Levin
Language modelling for CSR of large corpus using automatic classification of words
Michele Jardino, Gilles Adda
Inference of stochastic context-free grammar rules from example data using the theory of Bayesian belief propagation
Helmut Lucke
Constructing linguistic oriented language models for large vocabulary speech recognition
Petra Witschel
New frequency domain prosodic modification techniques
Eduardo R. Banga, Carmen Garcia-Mateo
A prosody modification approach for auditory user feedback in the spell pronunciation teaching system
H. D. Wang, D. Degryse, Fabrizio Carrara
A speech prosody conversion system with a high quality speech analysis-synthesis method
Tohru Takagi, Eiichi Miyasaka
On the perceived serial position of discourse units
Marc Swerts, Rene Collier
Enhanced pitch tracking and the processing of F0 contours for computer aided intonation teaching
Paul C. Bagshaw, Steven Hiller, Mervyn A. Jack
Improved DVQ algorithm for speech recognition: a new adaptive learning rule with neurons annihilation
Chakib Tadj, Franck Poirier
Speaker-independent 212 word recognition using combNET-II
Taro Sasaki, Tadashi Kitamura, Akira Iwata
Learning direct acoustic-to-semantic mappings through simple recurrent networks
M. A. Castano, Enrique Vidal, Francicso Casacuberta
Noise-adaptive hidden Markov models based on wiener filters
Saeed V. Vaseghi, Ben P. Milner
Noisy speech recognition using singular value decomposition and two-sided linear prediction
K. F. Wong, S. H. Leung, H. C. Ng
Recognition of noisy speech by composition of hidden Markov models
Franck Martin, Kiyohiro Shikano, Yasuhiro Minami
Noise reduction and speech recognition in noise conditions tested on LPNN-based continuous speech recognition system
Yuqing Gao, Jean-Paul Haton
Combination of distortion-robust feature extraction and neural noise reduction for ASR
Michael Trompf, Ralf Richter, Harald Eckhardt, Heidi Hackbarth
On-line adaptation of a speech recognizer to variations in telephone line conditions
C. Mokbel, J. Monné, D. Jouvet
Online channel compensation for robust speech recognition
Matthias Wittmann, Otto Schmidbauer, Abdulmesih Aktas
Evaluation of car noise reduction/compensation techniques for digit recognition in a speaker-independent context
Patrice Alexandre, Jerome Boudy, Philip Lockwood
Experiments on noise reduction techniques with robust voice detector in car environment
A. Brancaccio, C. Pelaez
Robust word spotting in adverse car environments
Satoshi Nakamura, Toshio Akabane, Seiji Hamaguchi
Definition of subword acoustic units for wordspotting
Richard C. Rose
Spontaneous speech recognition by sentence spotting
Jiro Kiyama, Yoshiaki Itoh, Ryuichi Oka
Phonetic-based word spotter: various configurations and application to event spotting
P. Jeanrenaud, K. Ng, M. Siu, J.R. Rohlicek, H. Gish
An application of word-spotting in a voice activated service entry system
Akihiro Imamura, Mikio Kitai
Out-of-vocabulary word modelling and rejection for keyword spotting
Eduardo Lleida, Jose B. Marino, Josep M. Salavedra, Antonio Bonafonte, E. Monte, A. Martinez
Word and phrase spotting with limited training
M. J. O'Kane, P. E. Kenne
A new approach towards keyword spotting
Jean-Marc Boite, Hervé Bourlard, Bart D'hoore, Marc Haesen
Grammar learning and word spotting using recurrent neural networks
J. Alvarez-Cercadillo, Luis A. Hernandez-Gomez
Word spotting in conversational speech based on phonemic unit likelihood by mutual information criterion
Shigeki Okawa, Tetsunori Kobayashi, Katsuhiko Shirai
Generalized frequency domain adaptive filter for acoustic echo canceller
F. Dohnal
Estimation of speech signal classification features in a simulated hyperbaric environment
J. Crestel, M. Guitton
Noise suppression system for a car
Petr Pollak, Pavel Sovka, Jan Uhlir
Adaptive gain control and echo cancellation for hands-free telephone systems
Peter Heitkamper, Michael Walker
Predicting segmental durations for accommodation within a syllable-level timing framework
W. Nick Campbell
A filtersank based on physiologically measured characteristics in an auditory model for speech signal processing
Tore Fjallbrant, Fisseha Mekuria, Shahrokh Amirijoo
Spectral sensitivity weighted transform coding for LSP parameters
Fu-Rong Jean, Chih-Chung Kuo, Hsiao-Chuan Wang
An efficient algorithm to estimate the instantaneous SNR of speech signals
Rainer Martin
Speech/non-speech detection for voice response systems
L. Mauuary, J. Monne
Time-spectral approach to compiling speech reconstruction
Alexander Osipov, Vladimir Zentsov
A voice activity detector based on cepstral analysis
J. A. Haigh, J. S. Mason
High quality coding of wideband speech at 24 kbit/s
Jürgen Paulus, Christiane Antweiler, Christian G. Gerlach
A 32 kbit/s wideband speech coder based on transform coding
H. Dia, Gang Feng, Y. Mahieux
Realtime implementation of high-quality 32 kbps wideband LD-CELP coder
Oded Gottesman, Yair Shoham
A fixed-point implementation of the 16 kb/s LD-CELP speech coding algorithm
A. Popescu, D. Vicard, F. Druilhe
Optimality of sequential quantization in analysis-by-synthesis speech codecs
Christian G. Gerlach
A sub-band MPLPC coder for high quality speech coding at 16 kbit/s
Radwan Kastantin, Gang Feng
Optimal multepulse excitation determination by simulated annealing
Enzo Mumolo, Alessio Rebelli
Split vector quantization of the LPC parameters using weighted lattice structure
K. W. Law, C. F. Chan
A new approach to noiseless interframe coding of LPC parameters in vector quantizer applications
Stefan Bruhn
Efficient quantization of speech spectral information
Torbjörn Svendsen
Enhancing robustness of coded LPC-spectra to channel errors by use of residual redundancy
Stefan Feldes
Multi-rate source and channel coding for mobile communication systems
S. A. Atungsiri, A. M. Kondoz, B. G. Evans
Training method of the excitation codebook for CELP
Takehiro Moriya, Satoshi Miki, Kazunori Mano, Hitoshi Ohmuro
Phrasing strategies in prosodic parsing and speech synthesis
Gösta Bruce, Björn Granström, Kjell Gustafson, David House
Prosody in the perception of syntactic boundaries
Eva Strangert, Bo Strangert
Prosodic cues to the perception of constituent boundaries
Jan Roelof de Pijper, Angelien Sanderman
Acoustic cues to syntactic structure - evidence from prosodic and segmental effects
Esther Grabe, Tara Hoist, Francis Nolan, Paul Warren
Automatic generation of French intonation based on a perceptual study and morpho-syntactic information
Frédéric Beaugendre, Anne Lacheret-Dujour
A partitioned neural network approach for vowel classification using smoothed time/frequency features
Stephen A. Zahorian, Zaki B. Nossair, Claude A. Norton III
Speaker-independent 100 word recognition using dynamic spectral features of speech and a neural network
Tadashi Kitamura
Speaker independent isolated word recognition using vector quantization and neural networks
Ming Zhu, Klaus Fellbaum
Multi-layer perceptrons and probabilistic neural networks for phoneme recognition
Kjell O. E. Elenius, Hans G. C. Traven
Automatic accent classification using artificial neural networks
C. S. Blackburn, Julie P. Vonwiller, R. W. King
The benefits of tiered segmentation for the recognition of phonetic properties
Mark Huckvale
Generalized context-dependent phone modeling using artificial neural networks
David M. Lubensky
Speaker-independent connected letter recognition with a multi-state time delay neural network
Hermann Hild, Alex Waibel
Tuning by doing: flexibility through automatic structure optimization
Ulrich Bodenhausen, Alex Waibel
Phonetic features for spelled letter recognition with a time delay neural network
Christoph Windheuser, Frédéric Bimbot
Training of a time-delay neural network for speech recognition by solving stiff differential equations
Veronika Bappert, Matthias Jobst
ATREUS: a speech recognition front-end for a speech translation system
Shigeki Sagayama, Jun-ichi Takami, Akito Nagai, Harald Singer, Kouichi Yamaguchi, Kzumi Ohkura, Kenji Kita, Akira Kurematsu
ATR's speech translation system: ASURA
Tsuyoshi Morimoto, Toshiyuki Takezawa, Fumihiro Yato, Shigeki Sagayama, Toshihisa Tashiro, Masaaki Nagata, Akira Kurematsu
Recent advances in JANUS: a speech translation system
Monika Woszczyna, N. Coccaro, A. Eisele, A. Lavie, A. McNair, T. Polzin, Ivica Rogina, C. P. Rose, Tilo Sloboda, M. Tomita, J. Tsutsumi, N. Aoki-Waibel, Alex Waibel, Wayne Ward
Spoken language translation with MID-90's technology: a case study
Manny Rayner, Ivan Bretan, David Carter, Michael Collins, Vassilios Digalakis, Bjorn Gamback, Jaan Kaja, Jussi Karlgren, Bertil Lyberg, Stephen Pulman, Patti Price, Christer Samuelsson
Automatic language identification using a segment-based approach
Timothy J. Hazen, Victor W. Zue
A comparison of approaches to automatic language identification using telephone speech
Yeshwant Muthusamy, Kay Berkling, Takayuki Arai, Ronald Cole, Etienne Barnard
Integration of neural networks and robust parsers in natural language understanding
Ying Cheng, Yves Normandin, Paul Fortier
Joint speech and gesture analysis some experimental results on multimodal interface
Pierre Dauchy, Christophe Mignot, Claude Valot
Generation of speech reply in the speech response system
Keikichi Hirose, Yasuharu Asano
A fast multilingual probabilistic tagger
Evangelos Dermatas, George Kokkinakis
The possibility for acquisition of statistical network grammar using ergodic HMM
Jin'ichi Murakami, Hiroki Yamatomo, Shigeki Sagayama
A robust analyzer for spoken language understanding
Evelyne Millien, Roland Kuhn
Identifying usability attributes of automated telephone services
R. T. Dutton, John C. Foster, Mervyn A. Jack, F. W. Stentiford
Utilising prosody to perform syntactic disambiguation
Andrew Hunt
Spell: an automated system for computer-aided pronunciation teaching
Steven Hiller, Edmund Rooney, Jean-Paul Leffevre, Mervyn A. Jack
Training vowel pronunciation using a computer-aided teaching system
Edmund Rooney, Rebecca Vaughan, Steven Hiller, Fabrizio Carraro, John Laver
Methods for traversing a pre-recorded speech message network to optimise dialogue in telephone answering systems
Mary Zajicek, Ken Brownsey
Service creation tools for creating speech interactive services
Roger Hanes, Jo Salter, Paul Popay, Frances Hedley
Deaccentuation and persistence of grammatical function and surface position
Julia Hirschberg, Jacques Terken
Design and implementation of a speech server for unix based multimedia applications
Stefan Euler, K. Riedel
Romaine: a lattice based approach to lexical access
David Goodine, Victor W. Zue
A system for clustering spoken documents
Toffee A. Albina, Erica G. Bernstein, David M. Goblirsch, Douglas E. Lake
Spoken dialogues for human-computer interaction over the telephone: complexity measures
Nathalie A. Vergeynst, Keith Edwards, John C. Foster, Mervyn A. Jack
The cost of errors in a spoken language system
Lynette Hirschman, Christine Pao
Black box and glass box evaluation of the SUNDIAL system
Andrew Simpson, Norman M. Eraser
A methodology for evaluating human-machine spoken language interaction
Cristina Delogu, Andrea Di Carlo, Ciro Sementina, Silvia Stecconi
Error correction and ambiguity resolution in multimodal man-machine dialogue
Philippe Morin, Jean-claude Junqua
Analysis of the speaker and operator behaviours
Marie-Franoise Castaing, Dominique True-Martini
Multi-level transcription of speech corpora from orthographic forms
Alix de Ginestel-Mailland, Martine de Calmes, Guy Perennou
Automatic segmentation of speech for TTS
Andrej Ljolje, Michael D. Riley
Automatic segmentation and quality evaluation of speech unit inventories for concatenation-based, multilingual PSOLA text-to-speech systems
O. Boeffard, B. Cherbonnel, F. Emerard, S. White
On the development of pronunciation rules for text-to-speech synthesis
Bert Van Coile
Tabtalk: reusability in data-oriented grapheme-to-phoneme conversion
Walter Daelemans, Antal van den Bosch
A modular architecture supporting multiple hypotheses for conversion of text to phonetic and linguistic entities
Anders Lindström, Mats Ljungqvist, Kjell Gustafson
The use of a non-linear model for text-to-speech conversion
Jon Iles, William Edmondson
The perceptual relevance of CV- and VC- transitions in identifying stop consonants: cross-language results
Astrid van Wieringen, John K. Cullen, Louis C. W. Pols
Perceptual effects of place and voicing assimilation in dutch consonants
Vincent J. van Heuven, Willy Jongenburger
Detection of vowels and consonants by human listeners: effects of minimising auditory memory load
Brit van Ooyen
Resonances as possible representation of speech in the auditory-to-articulatory transform
Gerard Bailly
A perceptual explanation of the weightlessness of the syllable onset
Rob Goedemans, Vincent J. van Heuven
A study of the beam-search algorithm for large vocabulary continuous speech recognition and methods for improved efficiency
Enrico Bocchieri
Using grammars in forward and backward search
L. Fissore, E. Giachin, Pietro Laface, P. Massafra
Robust interpretation of speech
Gernot A. Fink, Franz Kummert, Gerhard Sagerer, Bernd Seestaedt
A* word network search for continuous speech recognition
I. Lee Hetherington, Michael S. Phillips, James R. Glass, Victor W. Zue
Efficient lexical access strategies
Roxane Lacouture, Yves Normandin
Multiple codebook Spanish phone recognition using semicontinuous hidden Markov models
I. Torres, Francisco Casacuberta
An efficient algorithm to find the best state sequence in HSMM
Antonio Bonafonte, Xavier Ros, Jose B. Marifio
Robust HMM-based endpoint detector
Alex Acero, C. Crespo, C. de la Torre, J. C. Torrecilla
Experiments on Spanish phone recognition using automatically derived phonemic baseforms
Isabel Galiano, Francisco Casacuberta
Evaluation of VQ-distortion based HMM
Seiichi Nakagawa, Hideyuki Suzuki, Li Zhao
Continuous HMM for word spotting and rejection of non vocabulary word in speech recognition over telephone networks
Jianming Song
Bayesian learning of the parameters of discrete and tied mixture HMMs for speech recognition
Qiang Huo, Chorkin Chaw, Chin-Hui Lee
Speech recognition using semantic hidden Markov networks
Gernot A. Fink, Franz Kummert, Gerhard Sagerer, Ernst G. Schukat-Talamazzini
Experiments in vocabulary independent speech recognition using phoneme decision trees
Simon Downey, Martin Russell, Peter Nowell, David Bijl, Kirsta Galloway, Keith Ponting
Segmental hidden Markov models
M. J. F. Gales, Steve J. Young
Impact of dimensionality and correlation of observation vectors in HMM-based speech recognition
Xue Wang, Louis F. M. ten Bosch, Louis C. W. Pols
Evaluation of an HMM speech recognizer with various continuous speech databases
F. Class, A. Kaltenmeier, Peter Regel-Brietzmann
Hidden Markov models for noisy speech recognition
Adam Wrzoskowicz
Neural network speech enhancer utilizing masking properties
D. E. Tsoukalas, J. Mourjopoulos, George Kokkinakis
Comparison of geometric, connections and structural techniques on a difficult isolated word recognition task
Maria J. Castro, Juan C. Perez
Prediction and discrimination in neural networks for continuous speech recognition
A. Mellouk, P. Gallinari, F. Rauscher
Two schemes of phonetic feature extraction using artificial neural networks
Shuping Ran, J. Bruce Millar
On use of discriminant analysis in predictive connectionist speech recognition
Bojan Petek, Anuska Ferligoj
Non-linear time compression for lexical access
N. H. Russell, Frank Fallside, R. W. Prager
Talker enrollment for speech recognition by synthesis
Richard Brierton, Nigel Sedgwick
Improving robustness of network grammar by using class HMM
Kauzya Takeda, Naomi Inoue, Shingo Kuroiwa, Tomohiro Konuma, Seiichi Yamamoto
Parallelising k-means clustering on distributed memory MIMD computers
J. A. Elliott, M. E. Forsyth, F. R. McInnes, N. W. Ramsey
On the proper sub-word unit inventory for CSR
P. Berenyi, Klára Vicsi
Speech recognition using the atomic speech units constructed from overlapping articulatory features
Li Deng, Don Sun
A Bayesian approach to phone duration adaptation for lombard speech recognition
Olivier Siohan, Yifan Gong, Jean-Paul Haton
Multiple multilabeling to improve HMM-based speech recognition in noise
J. Hernando, Jose B. Marino, Climent Nadeu
Discrimination of polish stop consonants based on mapped techniques
Lutoslawa Richter, Piotr Domagaia
Managing spoken dialogues for information services
Wieland Eckert, Scott McGlashan
Ambiguity and uncertainty in spoken dialogue
Paul Heisterkamp
Managing dialogue in a continuous speech understanding system
Elisabetta Gerbino, Morena Danieli
Speaking with computers: a multimodal approach
P. Lefebvre, G. Duncan, Franck Poirier
Habitable interaction in goal-oriented multimodal dialogue systems
Philippe Morin, Jean-Claude Junqua
Test of voice quality on ATM based equipment
Jorn Stern Nielsen, Bo Baungaard
An evaluation system for ascertaining the quality of synthetic speech based on subjective category rating tests
Harald Klaus, H. Klix, Jochem Sotscheck, Klaus Fellbaum
A global framework for the assessment of synthetic speech without subjects
Arnd Mariniak
Comprehension of KTH text-to-speech with "listening speed" paradigm
Lennart Neovius, Parimala Raghavendra
Theoretical principles concerning segmentation, labelling strategies and levels of categorical annotation for spoken language database systems
Hans G. Tillmann, Bernd Pompino-Marschall
The comparative assessment of commercial speech recognisers
Peter Wyard
Reliable assessment of speech recognisers for telephone environment
A. Riccio, F. Ceglie, A. Brancaccio
Evaluation of a rule-based text-to-speech system for French at the segmental level
Martine Garnier-Rizet
Intelligibility of speech produced by text-to-speech synthesizers over the orthophonic and telephonic channel
Cristina Delogu, Andrea Paoloni, P. Ridolfi, Kyriaki Vagges
Using the ORATOR® synthesizer for a public reverse-directory service: design, lessons, and recommendations
Murray F. Spiegel
A speech formant synthesizer based on harmonic + random formant-waveforms representations
Sophie Grau, Christophe d'Alessandro, Gael Richard
SPEAKEZ: a first experiment in concatenation synthesis from a large corpus
Alexander G. Hauptmann
Designing control rules for a serial pole-zero vocal tract model
J. Kerkhoff, Lou Boves
English speech synthesis based on multi-layered context oriented clustering; towards multi-lingual speech synthesis
Shin'ya Nakajima
Speech synthesis using artificial neural networks trained on cepstral coefficients
Christine Tuerk, Tony Robinson
Bayesian regularisation methods in a hybrid MLP-HMM system
Steve Renals, David MacKay
Real-time, neural network-based, French alphabet recognition with telephone speech
P. Schmid, Ronald Cole, M. Fanty, Hervé Bourlard, M. Haessen
Joint optimization of multiple neural codebooks in a hybrid connectionist-HMM speech recognition system
Gerhard Rigoll
Using LVQ to enhance semi-continuous hidden Markov models for phonemes
Mikko Kurimo
An improvement of the two-level DP matching algorithm using k-NN techniques for acoustic-phonetic decoding
Pablo Aibar, Francisco Casacuberta
Performance comparison of hidden Markov models and neural networks for task dependent and independent isolated word recognition
Hervé Bourlard, Jean-Marc Boite, Bart D'Hoore, Marco Saerens
Connectionist speech recognition with a global MMI algorithm
Patrick Haffner
Connectionist segmental post-processing of the n-best solutions in isolated and connected word recognition task
Denys Boiteau, Patrick Haffner
A new dynamic programming/multi-layer perceptron hybrid for continuous speech recognition
Jean-Pierre Martens, Annemie Vorstermans, Nick Cremelie
A neural network based, speaker independent, large vocabulary, continuous speech recognition system: the WERNICKE project
Tony Robinson, L. Almeida, Jean-Marc Boite, Hervé Bourlard, Frank Fallside, Michael M. Hochberg, D. Kershaw, P. Kohn, Y. Konig, Nelson Morgan, J. P. Neto, Steve Renals, Marco Saerens, C. Wooters
Visual coarticulation effects in syllable environment
Hans-H. Bothe, Frauke Rieger, Robert Tackmann
Depth measurement of face and palate by structured light
Christine H. Shadle, J. N. Carter, T. P. Monks, J. Field
Visiolab: a multimedia environment for the study of bimodal speech perception
Louis-Jean Boe, Sonia Kandel, Annie Chappelet, Tahar Lallouache
Integrating auditory and visual representations for audiovisual vowel recognition
Jordi Robert-Ribes, Tahar Lallouache, Pierre Escudier, Jean-Luc Schwartz
Speech recognition over packetized voice systems
Bo Baungaard, Jorn Stern Nielsen
Voice applications on BT's derived services network
I. W. G. Jenkins
A French oral dialogue system for flight reservations over the telephone
Jean-Yves Magadur, Frédéric Gavignet, Francois Andry, Francis Charpentier
A voice-activated extension telephone exchange system
Shingo Kuroiwa, Kazuya Takeda, Naomi Inoue, Izuru Nogaito, Seiichi Yamamoto, Makoto Shouzakai, Kunihiko Owa, Masahiko Takahashi, Ryuuji Matsumoto
The VOIS project in retrospect
William C. G. Ortel, Dina Yashchin
TELEMACO - a real time keyword spotting application for voice dialling
Eduardo Lleida, Jose B. Marino, Arturo Moreno
The relative importance of the factors affecting recogniser performance with telephone speech
Peter Wyard
A robust acoustic echo canceller for a hands-free voice-controlled telecommunication terminal
Thomas Burger, Ulrich Schultheiß
Polyphase allpass IIR structures for sub-band acoustic echo cancellation
J. E. Hart, P. A. Naylor, O. Tanrikulu
Speech input systems and their effect on written language skills
James Monaghan, Christine Cheepen
Voxaid: an interactive speaking communication aid software for the speech impaired
Gabor Olaszy, Geza Nemeth
Feature extraction for profoundly deaf people
U. Hartmann, K. Hermansen, F. K. Fink
Architecture of a 10,000 word real time speech recognizer
Alfred Hauenstein
A noise-robust real-time word recognition hardware module
Thomas Hermann, Harald Eckhardt, Michael Trompf, Heidi Hackbarth
KARS: a speaker-independent, vocabulary-independent speech recognition system
Myoung-Wan Koo
A parallel processing keyword recogniser for police national computer enquiries
F. R. McInnes, J. A. Elliott, N. W. Ramsey, M. E. Forsyth, A. M. Sutherland, Mervyn A. Jack
Cost232: speech recognition over the telephone line
Andrea Paoloni, Torbjörn Svendsen, B. Kaspar, Denis Johnston, Gunnar Hult
Individual variability in the perception of synthetic speech
Valerie Hazan, Bo Shi
Speech recognition system and its application for blind PC users
Ye. K. Ludovic, V. V. Pilipenko, G. E. Tseitlin, L. I. Nagornaya, T. Terzian
The NLP module of a spoken language dialogue system for Danish flight reservations
Bradley Music, Claus Povlsen
A man-machine dialogue system for speech access to train timetable information
D. Clementino, L. Fissore
An experimental dialogue system: waxholm
Mats Blomberg, Rolf Carlson, Kjell O. E. Elenius, Björn Granström, Joakim Gustafson, Sheri Hunnicutt, Roger Lindell, Lennart Neovius
A spoken dialogue system for German intercity train timetable inquiries
Wieland Eckert, T. Kuhn, Heinrich Niemann, S. Rieck, A. Scheuer, Ernst G. Schukat-Talamazzini
A telephone banking system based on HMM keyword recognition
Kyriaki Labropoulou, Nikos Fakotakis
A speech-based route enquiry system built from general-purpose components
Ian Lewin, Martin Russell, David Carter, Sue Browning, Keith Ponting, Stephen Pulman
The inks ATIS system and its n-best interface
Changwen Yang, Douglas O'Shaughnessy
A multimodal directory guidance system with an interactive mechanism
T. Nitta, Y. Masai, J. Iwasaki, S. Tanaka, Bi Karwo, H. Matsu'ura
A French version of the MIT-ATIS system: portability issues
H. Bonneau-Maynard, Jean-Luc Gauvain, David Goodine, Lori F. Lamel, Joseph Polifroni, Stephanie Seneff
A bilingual Voyager system
James R. Glass, David Goodine, Michael Phillips, Shinsuke Sakai, Stephanie Seneff, Victor W. Zue
A gestural approach for controlling an articulatory speech synthesizer
Bernd J. Kröger
An articulatory synthesizer for the simulation of consonants
Paul Boersma
Vowel dynamics in a text-to-speech system some considerations
Rolf Carlson, Lennart Nord
Improving the spectral balance of digital speech synthesis applied to a female, synthetic voice
Ida Frehr, Marianne Elmlund, Henrik Nielsen
A new model of excitation for text-to-speech synthesis
Yasushi Ishikawa, Tadashi Ebihara, Kunio Nakajima
A level-building top-down parsing algorithm for context-free grammars in continuous speech recognition
Francois Charpillet, Joseph Di Martino
Using anti-grammar and semantic categories for the recognition of spontaneous speech
Russell J. Collingham, Roberto Garigliano
Speech recognition using particle n-grams and content-word n-grams
Ryosuke Isotani, Shigeki Sagayama
Dynamic use of syntactical knowledge in continuous speech recognition
Pierre Dupont
Acoustic detection of laryngeal diseases in children
Fabrice Plante, Jocelyne Borel, Christian Berger-Vachon, Isabelle Kauffmann
Acoustic model and evaluation of pathological voice production
Dimitar D. Deliyski
Novel acoustic measurements of jitter and shimmer characteristics from pathological voice
Hideki Kasuya, Yasuo Endo, Sokol Saliu
An experiment involving the consistency and reliability of voice quality ratings for different types of speech fragments
Guus de Krom
Laryngectomee speech in noise - voice effort and intelligibility
Lennart Nord, Britta Hammarberg, Elisabet Lundstrom
Analysing prosody by means of a double tree structure
Berit Horvei, Georg Ottesen, Sveire Stensby
Prosody and discourse interpretation
Geneviève Caelen-Haumont
Duration modelling for the greek language
George Epitropakis, D. Tambakas, Nikos Fakotakis, George Kokkinakis
Prosody control of TTS-systems based on linguistic analysis
George Epitropakis, Nickolas Yiourgalis, George Kokkinakis
Prosody takes over: a prosodically guided dialog system
Ralf Kompe, Andreas Kießling, T. Kuhn, Marion Mast, Heinrich Niemann, Elmar Nöth, K. Ott, Anton Batliner
Integration of a prosodic component in an automatic speech recognition system
P. Langlais, Henri Meloni
Referent tracking in restricted texts using a lemmatized lexicon: implications for generation of intonation
Merle Horne, Marcus Filipsson, Mats Ljungqvist, Anders Lindström
Perceptual significance of focus accent in spoken Swedish
Robert Bannert
Pitch estimation of speech signal with the wavelet transform
Silvio Montresor, Marc Baudry
A spectral AMDF method for pitch extraction of noise-corrupted speech
Jae Yeol Rheem, Myung Jin Bae, Sou Guil Ann
A reliable postprocessor for pitch determination algorithms
Gao Yang, Henri Leich
Vowel pitch period extraction by models of neurones in the mammalian brain-stem
G. F. Meyer, William A. Ainsworth
Auto-regressive linear models of jitter
Jean Schoentgen, Raoul de Guchteneere
Larynx period detection methods in speech pattern hearing AIDS
Jianing Wei, David Howells, Andrew Faulkner, Adrian Fourcin
Fundamental frequency of dutch women: an evaluative study
Renee van Bezooijen
Proposal and implementation of a spoken word recognizer using utterance normalization and multiple templates on a single VLSI chip
Hiroya Fujisaki, Sumio Ohno, Hideki Nasuno, Keikichi Hirose
CASPER: a speech interface for the macintosh
Robert Strong
Dragon systems' experiences in small to large vocabulary multi-lingual speech recognition applications
Claudia Ellermann, Stijn Van Even, Caroline Huang, Linda Manganaro
Application of the n-best solutions algorithm to speaker-independent spelling recognition over the telephone
D. Jouvet, M. N. Lokbani, J. Monne
Language based approach to system control in speech recognition systems
Jerome Braun, Baruch Mazor
The CSELT system for Italian text-to-speech synthesis
Marcello Balestri, Stefano Lazzaretto, Pier Luigi Salza, Stefano Sandri
COMPOST: a client-server model for applications using text-to-speech systems
Mamoun Alissali, Gerard Bailly
Syntactic processing and prosody control in the SVOX TTS system for German
Christof Traber
Using context to specify intonation in speech synthesis
Scott Prevost, Mark Steedman
Statistical analysis of the acoustic and prosodic characteristics of different speaking styles
Masanobu Abe, Hirokazu Sato
Detection of unknown words in large vocabulary speech recognition
Satoru Hayamizu, Katunobu Itou, Kazuyo Tanaka
A very fast method for scoring phonetic transcriptions
P. Kenny, P. Labute, Z. Li, R. Hollan, M. Lennig, Douglas O'Shaughnessy
New words: implications for continuous speech recognition
I. Lee Hetherington, Victor W. Zue
The Philips research system for large-vocabulary continuous-speech recognition
Volker Steinbiss, Hermann Ney, Reinhold Haeb-Umbach, B.-H. Iran, U. Essen, Reinhard Kneser, M. Oerder, H.-G. Meier, X. Aubert, Christian Dugast, D. Geller, W. Hollerbauer, H. Bartosik
Very-large-vocabulary continuous speech recognition algorithm for telephone directory assistance
Yasuhiro Minami, Kiyohiro Shikano, Tomokazu Yamada, Tatsuo Matsuoka
Dictation system using inductively auto-generated syntax
Shoichi Matsunaga, Tomokazu Yamada, Kiyohiro Shikano
Syntax-semantics cooperation in micro: a multi-agent speech understanding system
Jean-Yves Antoene, Bertrand Caillaud, Jean Caelen
Senones, multi-pass search, and unified stochastic modeling in sphinx-II
M. Y. Hwang, F. Alleva, X. Huang
CMLPs robust spoken language understanding system
Sunil Issar, Wayne Ward
J-SUMMIT: Japanese spontaneous speech recognition
Shinsuke Sakai, Michael Phillips
A new interface paradigm: automatic recognition of integrated speech and handwriting information
Jerome R. Bellegarda, Dimitri Kanevsky
Factors affecting choice of speech over keyboard and mouse in a simple data-retrieval task
Alexander I. Rudnicky
Comparing synthesizers for name and address provision: field trial results
Sara Basson, Dina Yashchin, Ashok Kalyanswamy, Kim Silverman
Synthesiser intelligibility in the context of a name-and-address information service
Kim Silverman, Ashok Kalyanswamy, Julie Silverman, Sara Basson, Dina Yashchin
Enhancing user acceptance at the managerial workplace
Ruth Marzi
Detection and transcription of new words
B. Suhm, Monika Woszczyna, Alex Waibel
Efficient enumeration of sentence hypotheses in connected word recognition
Victor M. Jimenez, Andres Marzal, Enrique Vidal
Locating disfluencies in spontaneous speech: an acoustical analysis
Douglas O'Shaughnessy
Integration of phonological knowledge in a continuous speech recognition system
Roselyne Nguyen, Kamel Smaili, Jean-Paul Haton, Guy Perennou
Prosody and continuous speech recognition
Pierre Dumouchel, Douglas O'Shaughnessy
Spoken-language processing for restricted domains: a sublanguage approach
H. Bergmann, H.-H. Hamer, A. Noll, A. Paeseler, H. Tomaschewski
The use of state tying in continuous speech recognition
Steve J. Young, Phil C. Woodland
The HTK tied-state continuous speech recogniser
Phil C. Woodland, Steve J. Young
Combination of training criteria to improve continuous speech recognition
Laurence Devillers, Christian Dugast
Experiments with an articulatory speech recognizer
Igor Zlokarnik
Techniques for robust recognition in restricted domains
Giuliano Antoniol, Mauro Cettolo, Marcello Federico
Use of explicit context-dependent phonemic model in continuous speech recognition
Feriel Mouria, Yifan Gong, Jean-Paul Haton
Base transformation for environment adaptation in continuous speech recognition
Yifan Gong
Improved a-posteriori processing for keyword spotting
Baruch Mazor, Ming-Whei Feng
Single and multi-channel speech enhancement for a word spotting system
J. Ortega-Garcia, J. M. Paez-Borrallo, Luis A. Hernandez-Gomez
Estimating 'small' probabilities by leaving-one-out
Hermann Ney, Ute Essen
Semantic and pragmatically based re-recognition of spontaneous speech
Sheryl R. Young, Wayne Ward
Modeling of time constituents for speech understanding
Bernd Hildebrandt, Gernot A. Fink, Franz Kummert, Gerhard Sagerer
Phonetic segmentation method for the continuous czech speech recognition
Vaclav Matousek
Speech recognition applied to reading assistance for children: a baseline language model
Alexander G. Hauptmann, Lin L. Chase, Jack Mostow
Modelling speaker normalization by adapting the BIAS in a neural net
David J. M. Weenink, Louis C. W. Pols
Neural models for extracting speaker characteristics in speech modelization systems
T. Artieres, P. Gallinari
Influence of pattern compression on speaker verification
J. Zinke
A comparative study of speaker adaptation under realistic conditions
Florian Schiel
A comparison of speaker recognition techniques for telephone speech
D. A. Irvine, F. J. Owens
Speaker verification over telephone channels based on concatenated phonemic hidden Markov models
Johan de Veth, Guido Gallopyn, Hervé Bourlard
Speaker adaptation using a predictive model
Stephen Cox
Combining features via LDA in speaker recognition
Z. P. Sun, J. S. Mason
Neural networks for speech and speaker recognition through a digital telephone exchange
J. M. Elvira, R. A. Carrasco
Performance comparison of machine and human speaker verification
M. Mehdi Homayounpour, J. Philippe Goldman, Gérard Chollet, Jacqueline Vaissière
The effect of utterance length and content on speaker-verifier performance
M. I. Hannah, A. T. Sapeluk, Robert I. Damper, I. M. Roger
The use of pseudostationary segments for speaker identification
Antanas Lipeika, Joana Lipeikiene
Bayesian decision in the speaker recognition by acoustic parametrization of voice samples over telephone lines
A. Federico, Andrea Paoloni
Article |
---|