doi: 10.21437/ICSLP.1992
ISSN: 2958-1796
Knowing enough to analyze spoken languages
Peter Ladefoged
Speech understanding strategies based on string classification trees
Renato De Mori, R. Kuhn
Infants' perception and representation of speech: development of a new theory
Patricia K. Kuhl
The behavior of the larynx in spoken language production
Hajime Hirose
Syllabic fillers for Spanish HMM keyword spotting
Eduardo Lleida, José B. Marino, J. Salavedra, Antonio Bonafonte
Minimum error classification training for HMM-based keyword spotting
Yasuhiro Komori, David Rain Ton
A novel speech recognizer for keyword spotting
Gregory J. Clary, John H. L. Hansen
Secondary processing using speech segments for an HMM word spotting system
Herbert Gish, Kenney Ng, J. Robin Rohlicek
Continuous word spotting for applications in telecommunications
Ming-Whei Feng, Baruch Mazor
A low bit-rate CELP coder based on multi-path search methods
Maurizio Copperi
Fully vector quantized arm a analysis combined with glottal model for low bit rate coding
Katsushi Seza, Hirohisa Tasaki, Shinya Takahashi
Vector quantization of speech LSF parameters with generalized product codes
Erdal Paksoy, Wai-Yip Chan, Allen Gersho
Low-rate speech coding based on time-frequency interpolation
Yair Shoham
Improved CELP speech coding at 4 kbit/s and below
Tomohiko Taniguchi, Yoshinori Tanaka, Yasuji Ohta, Fumio Amano
Efficient integration of coarticulation and lexical information in a finite state grammar
Antonio Bonafonte, Jose B. Marino, Montse Pardas
Characteristics of nasalance in canadian speakers of English and French
H. A. Leeper, A. P. Rochet, I. R. A. MacKay
Ensemble averaging applied to the analysis of fricative consonants
Christine H. Shadle, Andre Moulinier, Christian U. Dobelke, Celia Scully
Effects of stress and vowel context on velar stops in british English
Andrew Slater, Sarah Hawkins
Lip rounding coarticulation in Italian
E. Magno Caldognetto, K. Vagges, G. Ferrigno, Maria Grazia Busa
Intelligibility of audio-visually desynchronised speech: asymmetrical effect of phoneme position
P. M. T. Smeele, A. C. Sittig, Vincent J. van Heuven
Speech analysis using complex orthogonal auditory transform (coat)
Unto K. Laine
Auditory model based speech processing
Yuqing Gao, Taiyi Huang, Shaoyan Chen, Jean-Paul Haton
Phonetic classification of timit segments preprocessed with lyon's cochlear model using a supervised/unsupervised hybrid neural network
Gary N. Tajchman, Nathan Intrator
Formant and pitch-pulse detection using models of auditory signal processing
Thomas Holton, Steven D. Love, Stephen P. Gill
Towards handling the acoustic environment in spoken language processing
Hynek Hermansky, Nelson Morgan
Real-time speaker-independent large-vocabulary CDHMM-based continuous telephonic speech recognizer
Alberto Ciaramella, Davide Clementino, Roberto Pacifici
Flexible vocabulary recognition of speech
Matthew Lennig, Douglas Sharp, Patrick Kenny, Vishwa Gupta, Kristin Precoda
The effects of signal representations, phonetic classification techniques, and the telephone network
Benjamin Chigier, Hong C. Leung
A lexicon for a text-to-speech system
Leon Gulikers, Rijk Willemse
Word class assignment in a text-to-speech system
Rijk Willemse, Leon Gulikers
Aspects of prosodic phrasing in Swedish
Gösta Bruce, Björn Granström, Kjell Gustafson, David House
Synthesis-by-analogy: a bilingual investigation using German and English
K. P. H. Sullivan, Robert I. Damper
Degas: a system for rule-based diphone speech synthesis
Leonard C. Manzara, David R. Hill
Towards synthesis of Hindi consonants using KLSYN88
Shyam S. Agrawal, Kenneth N. Stevens
Multi-lingual synthesis evaluation methods
Louis C. W. Pols, SAM Partners SAM Partners
The interaction of phonetics, phonology and morphology in an icelandic text-to-speech system
Björn Granström, Petur Helgason, Hoskuldur Thrainsson
Comparing methods for automatic extraction of voice source parameters from continuous speech
Helmer Strik, Joop Jansen, Louis Boves
The influence of linguistic variations on the voice source characteristics
Jacques Koreman, Louis Boves, Bert Cranen
Dynamic voice source changes in natural and synthetic speech
Sarah K. Palmer, Jill House
Acoustic and perceptual modelling of the voice quality caused by fundamental frequency perturbation
Satoshi Imaizumi, Jan Gauffin
Vocal cord vibration during consonants - high-speed digital imaging using a fiberscope
Shigeru Kiritani, H. Imagawa, Hajime Hirose
A "speech acts" approach to grounding in conversation
David R. Traum, James F. Allen
Antecedent activation by empty pronominals in Spanish
Sheila Meltzer
Multiple feature matching in pronoun resolution: a new look at parallel function
Ron Smyth
A discriminative approach for ambiguity resolution based on a semantic score function
Keh-Yih Su, Jing-Shin Chang, Yi-Chung Lin
The influence of semantic and syntactic information on spoken sentence recognition
Nobuaki Minematsu, Sumio Ohno, Keikichi Hirose, Hiroya Fujisaki
Effects of speaking rate and talker variability on the representation of spoken words in memory
Lynne C. Nygaard, Mitchell S. Sommers, David B. Pisoni
On the absence of word segmentation at "weak" syllables
Hugo Quene, Yvette Smits
Stimulus variability and the perception of spoken words: effects of variations in speaking rate and overall amplitude
Mitchell S. Sommers, Lynne C. Nygaard, David B. Pisoni
Words within words: lexical statistics and lexical access
James M. McQueen, Anne Cutler
Experiments on the use of the generalized probabilistic descent method in speech recognition
Stephan Euler, Joachim Zinke
Improving and optimizing speaker independent, 1000 words speech recognition in Spanish
Ricardo de Cordoba, José M. Pardo, Jose Colás
Multiple-level evaluation of speech recognition systems
John F. Pitrelli, David Lubensky, Benjamin Chigier, Hong C. Leung
Speaker independent word recognition using continuous matching of parameters in time-spectral form based on statistical measure
Tatsuya Kimura, Mitsuru Endo, Shoji Hiraoka, Katsuyuki Niyada
Automatic derivation of lexical models for a very large vocabulary speech recognition system
R. Roddeman, H. Drexler, Louis Boves
Response time as a metric for comparison of speech recognition by humans and machines
Anne Cutler, Tony Robinson
Characterization of directory assistance operator-customer dialogues in AGT limited
S. M. (Raj) Ulagaraj
Analysis of the effectiveness of system error messages in a human-machine travel planning task
Sheri Hunnicutt, Lynette Hirschman, Joseph Polifroni, Stephanie Seneff
Evaluating interactive spoken language systems
David Goodine, Lynette Hirschman, Joseph Polifroni, Stephanie Seneff, Victor Zue
The cluster-identification test
Ute Jekosch
Experiments in continuous speech recognition with a 60,000 word vocabulary
Patrick Kenny, R. Hollan, G. Boulianne, H. Garudadri, Yan-Ming Cheng, Matthew Lennig, Douglas O'Shaughnessy
HMM training on unconstrained speech for large vocabulary, continuous speech recognition
G. Boulianne, Patrick Kenny, Matthew Lennig, Douglas O'Shaughnessy, Paul Mermelstein
Appropriate error criterion selection for continuous speech HMM minimum error training
David Rainion, Shigeki Sagayama
Hardware implementation of realtime 1000-word HMM-LR continuous speech recognition
Akito Nagai, Kenji Kita, Toshiyuki Hanazawa, Tadashi Suzuki, Tomohiro Iwasaki, Tsuyoshi Kawabata, Kunio Nakajima, Kiyohiro Shikano, Tsuyoshi Morimoto, Shigeki Sagayama, Akira Kurematsu
Design and performance of HARC, the BBN spoken language understanding system
Madeleine Bates, Robert Bobrow, Pascale Fung, Robert Ingria, Francis Kubala, John Makhoul, Long Nguyen, Richard Schwartz, David Stallard
Performance of speaker-independent Japanese recognizer as a function of training set size and diversity
O. Shirotsuka, G. Kawai, Michael Cohen, J. Bernstein
Continuous mixture HMM-LR using the a* algorithm for continuous speech recognition
Kouichi Yamaguchi, Shigeki Sagayama, Kenji Kita, Frank K. Soong
Continuously spoken sentence recognition by HMM-LR
Kenji Kita, Tsuyoshi Morimoto, Kazumi Ohkura, Shigeki Sagayama
Word pre-selection using a redundant hash addressing method for continuous speech recognition
Akinori Ito, Shozo Makino
Optimal speech recognition using phone recognition and lexical access
Andrej Ljolje, Michael D. Riley
A trellis-based language model for speech recognition
Nick Waegner, Steve J. Young
PARSEC: a constraint-based framework for spoken language understanding
Carla B. Zoltowski, Mary P. Harper, Leah H. Jamieson, Randall A. Helzerman
The HMM interface with hybrid grammar-bigram language models for speech recognition
G. J. F. Jones, J. H. Wright, E. N. Wrigley
A frame-synchronous continuous speech recognition algorithm using a top-down parsing of context-free grammar
Atsuhiko Kai, Seiichi Nakagawa
Empirical properties of finite state approximations for phrase structure grammars
Fernando Pereira, David Roe
Language modelling for recognition and understanding using layered bigrams
Stephanie Seneff, Helen Meng, Victor Zue
Using probabilistic shift-reduce parsing in speech recognition systems
David Goddeau
Broca, an integrated parser for spoken language
Tim Howells, David Friedman, Mark Fanty
Blank slate language processor for speech recognition
P. V. S. Rao, Nandini Bondale
Integrating two complementary approaches to spoken language understanding
Eric Jackson
Learning compatibility coefficients for word-class disambiguation relaxation processes
Marcello Pelillo, Mario Refice
INTERTALKER: an experimental automatic interpretation system using conceptual representation
Kaichiro Hatazaki, Jun Noguohi, Akitoshi Okumura, Kazunaga Yoshida, Takao Watanabe
Enhancement of ATR's spoken language translation system: SL-TRANS2
Tsuyoshi Morimoto, Toshiyuki Takezawa, Kazumi Ohkura, Masaaki Nagata, Fumihiro Yato, Shigeki Sagayama, Akira Kurematsu
Continuous speech recognition using a combination of syntactic constraints and dependency relationship
Tsuyoshi Morimoto
Automatic learning in spoken language understanding
Roberto Pieraccini, Zakhar Gorelov, Esther Levin, Evelyne Tzoukermann
Prespeech and early speech coarticulation: american English and Japanese characteristics
Michael P. Robb, Harold R. Bauer
Formation of phonological concept structures from spoken word samples
Hiroaki Kojima, Kazuyo Tanaka, Satoru Hayamizu
Acquisition of the French VOT contrasts by adult speakers of Mandarin Chinese
Bernard L. Rochet, Fangxin Chen
Phonology as a byproduct of learning to recognize and produce words: a connectionist model
Michael Gasser
The development of lexical effects on children's phoneme identifications
Michael S. Hurlburt, Judith C. Goodman
Word recognition before production of first words?
P. A. Halle, B. de Boysson-Bardies
The effect of fundamental frequency for vowel perception in infants
Toshisada Deguchi, Shigeru Kiritani, Akiko Hayashi, Fumi Katoh
Objective measurement of phoneme similarity
William C. Treurniet
Recognizing phonemes vs. recognizing phones: a comparison
Michael D. Riley, Andrej Ljolje
On the role of the segment in speech processing by human listeners: evidence from speech perception and from global sound similarity judgments
B. L. Derwing, Terrance M. Nearey, R. A. Beinert, T. A. Bendrien
The syllabic status of postvocalic resonants in an unwritten low German dialect
Grace E. Wiebe, Bruce L. Derwing
The influence of focus distribution and lexical stress on the temporal organisation of the syllable
Agaath Sluijter, Vincent J. van Heuven, A. H. Neijt
The development and perceptive evaluation of a model for paragraph intonation in dutch
Agaath Sluijter, Jacques Terken
Pause characteristics and local phrase-dependency structure in Japanese
Nobuyoshi Kaiki, Yoshinori Sagisaka
F0 synthesis based on a quantitative model of German intonation
Bernd Möbius, Matthias Pätzold
Factors affecting pitch accent placement
K. Ross, Mari Ostendorf, Stefanie Shattuck-Hufnagel
Prosodic correlates of discourse units in spontaneous speech
Marc Swerts, Ronald Geluykens, Jacques Terken
Prosody as a cue for discourse structure
Shin'ya Nakajima, James F. Allen
Some intonational characteristics of discourse structure
Barbara Grosz, Julia Hirschberg
Prosody and syntax in spoken sentences of standard Chinese
Hiroya Fujisaki, Keikichi Hirose, Haitao Lei
Modeling sentential stress in the context of a large vocabulary continuous speech recognizer
Kathleen Bishop
Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs
Kazumi Ohkura, Masahide Sugiyama, Shigeki Sagayama
Speaker adaptation by modifying mixture coefficients of speaker-independent mixture Gaussian HMMs
Tatsuo Matsubka, Kiyohiro Shikano
Minimization of speech alignment error by iterative transformation for speaker adaptation
Yifan Gong, Olivier Siohan, Jean-Paul Haton
Vector field smoothing principle for speaker adaptation
Hiroaki Hattori, Shigeki Sagayama
Spectral mapping onto probabilistic domain using neural networks and its application to speaker adaptive phoneme recognition
Tetsunori Kobayashi, Katsuhiko Shirai
An interactive system for automated pronunciation improvement
Jean-Paul Lefevre, Mervyn A. Jack, Claudio Maggio, Mario Refice, Fabio Gabrieli, Michelina Saving, Luigi Santangelo
Prosodic features for automated pronunciation improvement in the spell system
Edmund Rooney, Steven M. Hiller, John Laver, Mervyn A. Jack
Vowels pronunciation assessment in the spell system
Maria-Gabriella Di Benedetto, Fabrizio Carraro, Steven M. Hiller, Edmund Rooney
Self-organizing map with supervision for speech recognition
Franck Poirier
Topology preservation for speech recognition
Gregory R. De Haan, Ömer Egecioglu
Towards the performance limits of connectionist feature detectors
Gary Bradshaw, Alan Bell
Context-dependent and -independent self-structuring hidden control models for speech recognition
Helge B. D. Sorensen
Integration of frequential and temporal structurations in a symbolic learning system
Marie-José Caraty, Claude Montacié, Claude Barras
Smoothing hidden Markov models ay means of a self organizing feature map
E. Monte, José B. Marino, Eduardo LLeida
LVQ-based speech recognition with high-dimensional context vectors
Jyri Mantysalo, Kari Torkkola, Teuvo Kohonen
Application of self-organizing maps and LVQ in training continuous density hidden Markov models for phonemes
Mikko Kurimo, Kari Torkkola
Identification of mono- and poly-phonemes using acoustic-phonetic features derived by a self-organising neural network
Paul Dalsgaard, Ove Andersen
Using phoneme group specific LVQ-codebooks with HMMs
Pekka Utela, Samuel Kaski, Kari Torkkola
Speech segment network approach for an optimal synthesis unit set
Naoto Iwahashi, Yoshinori Sagisaka
ATR μ-talk speech synthesis system
Yoshinori Sagisaka, Nobuyoshi Kaiki, Naoto Iwahashi, Katsuhiko Mimura
On the development of a name pronunciation system
Bert Van Coile, Steven Leys, Luc Mortier
Consonants for female speech synthesis
Inger Karlsson
Diagnostic perceptual experiments for text-to-speech system evaluation
Jan P. H. van Santen
Comparison of natural and synthetic speech intelligibility for a reverse telephone directory service
Marcello Balestri, Enzo Foti, Luciano Nebbia, Mario Oreglia, Pier Luigi Salza, Stefano Sandri
A corpus-based synthesizer
Richard Sproat, Julia Hirschberg, David Yarowsky
High quality speech synthesis based on wavelet compilation of phoneme segments
Tomohisa Hirokawa, Kenzo Itoh, Hirokazu Sato
Inventory of phonetic contrasts generated by high-level control of a formant synthesizer
David R. Williams, Corine A. Bickley, Kenneth N. Stevens
Is % overall error rate a valid measure of speech synthesiser and natural speech performance at the segmental level?
Mikael Goldstein, Ove Till
Text-to-speech conversion for dutch: comprehensibility and acceptability
Willy Jongenburger, Renee van Bezooijen
The rhythm rules in Japanese based on the centers of energy gravity of vowels
Masayo Katoh, Shin'ichiro Hashimoto
Segmental power control for Japanese speech synthesis
Kenzo Itoh, Tomohisa Hirokawa, Hirokazu Sato
Glottal waveform synthesis with volterra shapers
Jean Schoentgen
Yet another rule compiler for text-to-speech conversion?
Ken Ceder, Bertil Lyberg
Prosody generation models constructed by considering speech tempo influence on prosody
Kazuhiko Iwata, Yukio Mitome
Extracting microprosodic information from diphones - a simple way to model segmental effects on prosody for synthetic speech
Alex I. C. Monaghan
Generation of natural sounding speech stimuli by means of linear cepstral interpolation
Arjan van Hessen
Prosodic encoding of syntactic structure for speech synthesis
W. Nick Campbell, Colin Wightman
A nucleus-based timing model applied to multi-dialect speech synthesis by rule
Susan R. Hertz, Marie K. Huffman
Evaluating the prosody of synthesized utterances within a dialogue system
Jill House, Nick Youd
Prosodics in a syllable-based text-to-speech synthesis system
Marcel Tatham, Eric Lewis
From lexicon to rules: toward a descriptive method of French text-to-phonetics transcription
R. Belrhali, Véronique Aubergé, Louis-Jean Boe
Formant transformation from male to female synthetic voices
Marianne Elmlund, Ida Frehr, Niels Reinholt Petersen
Multilingual phoneme to grapheme conversion system based on HMM
P. A. Rentzepopoulos, George K. Kokkinakis
Fundamental frequency control using linguistic information
Noriyo Hara, Hisayoshi Tsubaki, Hisashi Wakita
A comparison of statistical and rule based methods of determining segmental durations
Andrew P. Breen
Generation and extraction of high quality synthesis units
J. R. Andrews, K. M. Curtis, Volker Kraft
Evaluating the overall comprehensibility of speech synthesizers
T. Boogaart, Kim Silverman
Automatic generation of optimized unit dictionaries for text to speech synthesis
Olivier Boeffard, Laurent Miclet, S. White
Relationships between syllable, word and sentence intelligibilities of synthetic speech
Hideki Kasuya, Seiki Kasuya
Unrestricted text-to-speech revisited: rhythm and intonation
David R. Hill, Craig-Richard Schock, Leonard C. Manzara
Wavelet speech synthesizer in the classroom and speech laboratory
Anton J. Rozsypal
HADIFIX - a speech synthesis system for German
Thomas Portele, Birgit Steffan, Rainer Preuß, Walter F. Sendlmeier, Wolfgang Hess
Two different methodologies for evaluating the comprehension of synthetic passages
Cristina Delogu, S. Conte, A. Paoloni, C. Sementina
A target-interpolation model for the intonation of dutch
Carlos Gussenhoven, Toni Rietveld
Best exemplars of English velar stops: a first report
Katharine Davis, Patricia K. Kuhl
Implementation of a model for lexical access based on features
Kenneth N. Stevens, Sharon Y. Manuel, Stefanie Shattuck-Hufnagel, Sharlene Liu
Perception of aperiodic speech signals
Dieter Huber
Acceptability and discrimination threshold for distortion of segmental duration in Japanese words
Hiroaki Kato, Minoru Tsuzaki, Yoshinori Sagisaka
Two level acoustic cues for consistent stop identification
Anne Bonneau, Sylvie Coste, Linda Djezzar, Yves Laprie
Vowel classification based on analysis-by-synthesis
Rolf Carlson, James Glass
Extrinsic normalization of vowel formant values based on cardinal vowels mapping
Maria-Gabriella Di Benedetto, Jean-Sylvain Lienard
Applications of generalized linear modeling to vowel data
Terrance M. Nearey
Some comments on invariance, variability and perceptual normalization in speech perception
David B. Pisoni
Words and voices: perceptual details are preserved in lexical representations
Stephen D. Goldinger, Thomas J. Palmeri, David B. Pisoni
Speech enhancement using a statistically derived filter mapping
Yan Ming Cheng, Douglas O'Shaughnessy, Peter Kabal
Hidden Markov model state-based cepstral noise compensation
V. L. Beattie, Steve J. Young
A computational model of auditory scene analysis
Guy J. Brown, Martin P. Cooke
A new dual-channel speech enhancement technique with application to CELP coding in noise
S. Nandkumar, John H. L. Hansen, Robert J. Stets
CUMULANT - based voicing decision in noise corrupted speech
Asunción Moreno, José A. R. Fonollosa
Selectively trained neural networks for the discrimination of normal and lombard speech
Yolande Anglade, Dominique Fohr, Jean-Claude Junqua
The use of cohort normalized scores for speaker verification
Aaron E. Rosenberg, Joel DeLong, Chin-Hui Lee, Biing-Hwang Juang, Frank K. Soong
Speaker recognition using concatenated phoneme models
Tomoko Matsui, Sadaoki Furui
Speaker identification through a modular connectionist architecture: evaluation on the timit database
Younes Bennani
AR-vector models for free-text speaker recognition
Claude Montacié, Jean-Luc Le Floch
Rapid non-supervised speaker adaptation of semicontinuous hidden Markov models
Florian Schiel
Rule-based recognition of phoneme classes
D. Ederveen, Louis Boves
A new method of speaker-independent speech recognition using multiphone HMM
Jie Yi, Kei Miki
A speaker adaptation based on corrective training and learning vector quantization
Myoung-Wan Koo, Chong-Kwan Un
Phoneme recognition in continuous speech based on mutual information considering phonemic duration and connectivity
Katsuhiko Shirai, Shigeki Okawa, Tetsunori Kobayashi
A real-time speaker-independent continuous speech recognition system based on demi-syllable units
Shinji Koga, Ryosuke Isotani, Satoshi Tsukada, Kazunaga Yoshida, Kaichiro Hatazaki, Takao Watanabe
Speech recognition in noisy environments
Saeed V. Vaseghi, Ben P. Milner
An enhanced interpolation technique for context-specific probability estimation in speech and language modelling
Fergus R. McInnes
Channel adaptation for a continuous speech recognizer
Lorenzo Fissore, Pietro Laface, G. Micca, G. Sperto
A new algorithm for connected digit recognition
S. Cifuentes, J. Colas, M. Savoji, José M. Pardo
Stochastic modeling of syllable-based units for continuous speech recognition
Günther Ruske, Bernd Plannerer, Tanja Schultz
HARK: an experimental speech recognition system
David M. Goblirsch, Toffee A. Albina
The SSS-LR continuous speech recognition system: integrating SSS-derived allophone models and a phoneme-context-dependent LR parser
Akito Nagai, Jun-Ichi Takami, Shigeki Sagayama
J-SUMMIT: a Japanese segment-based speech recognition system
Shinsuke Sakai, Michael Phillips
Optimal discriminative training for HMMs to recognize noisy speech
Shinobu Mizuta, Kunio Nakajima
Architecture and algorithms of a real-time word recognizer for telephone input
Shingo Kuroiwa, Kazuya Takeda, Fumihiro Yato, Seiichi Yamamoto, Kunihiko Owa, Makoto Shozakai, Ryuji Matsumoto
Speaker independent speech recognition method using word spotting technique and its application to VCR programming
Hiroyasu Kuwano, Kazuya Nomura, Atsushi Ookumo, Shoji Hiraoka, Taisuke Watanabe, Katsuyuki Niyada
Transputer implementation of front-end processors for speech recognition systems
S. Lennon, E. Ambikairajah
Phoneme HMM evaluation algorithm without phoneme labeling
Yasuhiro Minami, Tatsuo Matsuoka, Kiyohiro Shikano
Architecture of a configurable application interface for speech recognition systems
A. Noll, H. Bergmann, H. H. Hamer, Annedore Paeseler, H. Tomaschewski
An interactive environment for speech recognition research
Mark Fanty, John Pochmara, Ron Cole
An approach to unlimited vocabulary continuous speech recognition based on context-dependent phoneme modeling
Y. Abe, K. Nakajima
Acoustic subword models in the berkeley restaurant project
Chuck Wooters, Nelson Morgan
SIRtrain, an open standard environment for CHMM recognizer development
Claus Nedergaard Jacobsen
Segmented trellis algorithms for the continuous speech recognition
Yutaka Kobayashi, Yasuhisa Niimi
A. 46,500 word Chinese speech recognition system
Bo Xu, Z. W. Lin, Taiyi Huang, D. X. Xu, Y. Q. Gao
Study of the time extension flat net for speech recognition
Dao Wen Chen
A hidden Markov model structure for the acquisition of speech by machine, ASM
Frank Fallside
Speaker-independent keyword recognition based on SMQ/HMM
Yasuyuki Masai, Shin'ichi Tanaka, Tsuneo Nitta
CRIM's spontaneous speech recognition system for the ATIS task
Regis Cardin, Diane Goupil, Roxane Lacouture, Evelyne Millien, Charles Snow, Yves Normandin
Improved connected digit recognition using spectral variation functions
F. Brugnara, Renato De Mori, D. Giuliani, Maurizio Omologo
Alternative preprocessing techniques for discrete hidden Markov model phoneme recognition
Andrew Tridgell, Bruce Millar, Kim-Anh Do
Linguistic modelling in the context of oral dialogue
Gerhard Th. Niedermair
Static and dynamic predictions : a method to improve speech understanding in cooperative dialogues
Frangois Andry
Dialogue semantics for an oral dialogue system
Paul Heisterkamp, Scott McGlashan, Nick Youd
Using pragmatics to rule out recognition errors in cooperative task-oriented dialogues
Masaaki Nagata
A real-time speech dialogue system using spontaneous speech understanding
Yoichi Takebayashi, Hiroyuki Tsubo, Yoichi Sadamoto, Hideki Hashimoto, Hideaki Shinchi
A semantic and pragmatic analysis of tone and intonation in Mandarin Chinese
Li-chiung Yang
On prosodic features in speech - comparative studies between Japanese and standard Chinese
Yoshimasa Tsukuma
Prosodic encoding of English speech
W. Nick Campbell
Prediction of syllable duration, speech rate and tempo
Gunnar Fant, Anita Kruckenberg, Lennart Nord
Experiments with emotive speech - acted utterances and synthesized replicas
Rolf Carlson, Björn Granström, Lennart Nord
Phonetic properties of dutch accent lending pitch movements under time pressure
J. Caspars, Vincent J. van Heuven
Judgments of relative prominence for adjacent and non-adjacent accents
Jacques Terken, Karin van den Hombergh
A perceptual study of French intonation
F. Beaugendre, Christophe d'Alessandro, Anne Lacheret-Dujour, Jacques Terken
The phonetics of IGBO tone
Mark Liberman, J. Michael Schultz, Soonhyun Hong, Vincent Okeke
Stress shift as pitch accent placement: within-word early accent placement in american English
Stefanie Shattuck-Hufnagel
Adding emotion to synthetic speech dialogue systems
Katherine Morton
Emotional modalities and intonation in spoken language
Cari Spring, Donna Erickson, Thomas Call
Are any "press-conferences", "interviews" or "dialogues" true dialogues?
Tatiana Slama-Cazacu
The categorization of the dialects and speech styles of north american English
Arthur J. Bronstein
Phonetic differences between read and spontaneous speech
Eleonora Blaauw
Changing speech styles: strategies in read speech and casual and careful spontaneous speech
Maxine Eskenazi
Usage of words and sentence structures in spontaneous versus text material
Noriko Umeda, Karen Wallace, Josephine Horna
Statistical and linguistic analyses of F0 in read and spontaneous speech
Nancy A. Daly, Victor Zue
Spontaneous speech in English and Italian
Linda Shockey, Edda Farnetani
Further optimisation of a robust IMELDA speech recogniser for applications with severely degraded speech
Claude Lefebvre, Dariusz A. Zwierzyriski, David R. Starks, Gary Birch
Multiple approaches to robust speech recognition
Richard M. Stern, Fu-Hua Liu, Yoshiaki Ohshima, Thomas M. Sullivan, Alejandro Acero
Speaker-independent spoken digit recognition in noisy environments using dynamic spectral features and neural networks
Tadashi Kitamura, Satoshi Ando, Etsuro Hayahara
ICARUS: an mwave-based real-time speech recognition system in noise and lombard effect
Douglas A. Cairns, John H. L. Hansen
Word recognition in the car: adapting recognizers to new environments
C. Mokbel, L. Barbier, Y. Kerlou, Gérard Chollet
German announcements using synthetic speech the Gauss system
P. Meyer, Hans-Wilhelm Rühl, L. L. M. Vogten
Intelligent dialogues in automated telephone services
Mervyn A. Jack, J. C. Foster, F. W. Stentiford
Experience with a dialogue description formalism for realistic applications
Palle Bach Nielsen, Anders Baekgaard
Compensating for additive-noise in automatic speech recognition
Solomon Lerner, Baruch Mazor
Continuous speech recognition for medical diagnoses using a character trigram model
Sho-ichi Matsunaga, Toshiaki Tsuboi, Tomokazu Yamada, Kiyohiro Shikano
A three-dimensional FEM simulation of the effects of the vocal tract shape on the transfer function
Chengxiang Lu, Takayoshi Nakai, Hisayoshi Suzuki
Mandibular contributions to speech production
Kiyoshi Oshimat, Vincent L. Gracco
Measurement of three-dimensional shapes of vocal tract and nasal cavity using magnetic resonance imaging technique
Masafumi Matsumura
Electromyographie studies on the production of pitch contour in accentless dialects in Japanese
Shigeru Kiritani, Hajime Hirose, Kikuo Maekawa, Tsutomu Sato
Improvements of magnetometer sensing system for monitoring tongue point movements during speech
Yorinobu Sonoda, Kohichi Ogata
Inverse filtering of the glottal waveform using the Itakura-saito distortion measure
Paavo Alku
Measurement of intraoral sound pressure distributions of Japanese vowels
Kunitoshi Motoki, Nobuhiro Miki
Non-linear annotation of multi-channel speech data
Alain Marchal, William J. Hardcastle, K. Nicolaidis, N. Nguyen, F. Gibbon
A phoneme labelling workbench using HMM and spectrogram reading knowledge
Shingo Fujiwara, Yasuhiro Komori, Masahide Sugiyama
Automatic discovery of acoustic measurements for phonetic classification
Michael Phillips, Victor Zue
Detection of unknown words and automatic estimation of their transcriptions in continuous speech recognition
Itou Katunobu, Hayamizu Satoru, Tanaka Hozumi
A HMM-based system for automatic segmentation and labeling of speech
F. Brugnara, D. Falavigna, Maurizio Omologo
A modification of the viterbi algorithm for stochastic phonographic transduction
Robert W. P. Luk, Robert I. Damper
Criteria for labelling prosodic aspects of English speech
Paul C. Bagshaw, Briony J. Williams
DTW-based phonetic labeling using explicit phoneme duration constraints
Yifan Gong, Jean-Paul Haton
TOBI: a standard for labeling English prosody
Kim Silverman, Mary Beckman, John Pitrelli, Mori Ostendorf, Colin Wightman, Patti Price, Janet Pierrehumbert, Julia Hirschberg
Consistency of judgements in manual labelling of phonetic segments: the distinction between clear and unclear cases
Barbara Eisen, Hans-Günther Tillmann, Christoph Draxler
Vocal tract area functions of Swedish vowels and a new three-parameter model
Gunnar Fant
Acoustic and production pilot studies of speech vowels produced in noise
Jean-Claude Junqua
Active models for regularizing formant trajectories
Yves Laprie, Marie-Odile Berger
Vowel-consonant-vowel transitions: analysis, modeling, and synthesis
Rene Carré, Samir Chennoukh, Mohamad Mrayati
Representing the tongue surface with curve fits
Maureen Stone, Subhash Lele
Muscle forces in vowel vocal tract formation
Katherine S. Harris, Eric Vatikiotis-Bateson, Peter J. Alfonso
Neural network modeling of speech motor control
Makoto Hirayama, Eric Vatikiotis-Bateson, Mitsuo Kawato, Kiyoshi Honda
The articulatory dynamics of running speech: gestures from phonemes?
Eric Vatikiotis-Bateson, Makoto Hirayama, Kiyoshi Honda, Mitsuo Kawato
Phonetic analyses of the TIMIT corpus of american English
Patricia Keating, B. Blankenship, D. Byrd, E. Flemming, Y. Todaka
Sex, dialects, and reduction
Dani Byrd
Phonetic universals and hindi segment duration
Manjari Ohala, John J. Ohala
Acoustic and articulatory correlates of contrastive emphasis in repeated corrections
Donna Erickson, Osamu Fujimura
Effects of context and redundancy in the perception of naturally produced English vowels
Gary N. Tajchman, Marcia A. Bush
A telephone speech database of spelled and spoken names
Ronald Cole, Krist Roginski, Mark Fanty
The OGI multi-language telephone speech corpus
Yeshwant K. Muthusamy, Ronald A. Cole, Beatrice T. Oshika
The design for the wall street journal-based CSR corpus
Douglas B. Paul, Janet M. Baker
Multi-site data collection for a spoken language corpus - MAD COW
Lynette Hirschman
Collection and analyses of WSJ-CSR corpus at MIT
Michael Phillips, James Glass, Joseph Polifroni, Victor Zue
Connectionist gender adaptation in a hybrid neural network / hidden Markov model speech recognition system
Victor Abrash, Horacio Franco, Michael Cohen, Nelson Morgan, Yochai Konig
Hybrid neural network/hidden Markov model continuous-speech recognition
Michael Cohen, Horacio Franco, Nelson Morgan, David Rumelhart, Victor Abrash
Semantic hidden Markov networks
Gernot A. Fink, Franz Kummert, Gerhard Sagerer, Ernst-Günter Schukat-Talamazzini, Heinrich Niemann
Hesitation sounds: is there coarticulation across pause?
Use Lehiste, Donna Erickson
Acoustic analysis of laughter
Corine A. Bickley, Sheri Hunnicutt
Analysis of false starts in spontaneous speech
Douglas O'Shaughnessy
Processing disfluent speech: recognising disfluency before lexical access
Robin J. Lickley, Ellen G. Bard
A flexible multimodal dialogue architecture independent of the application
Philippe Morin, Jean-Claude Junqua, Jean-Marie Pierrel
Familiarity with the language transcribed and context as determinants of intratranscriber agreement
Catia Cucchiarini, Renee van Bezooijen
Intonation of clause-internal filled pauses
Elizabeth E. Shriberg, Robin J. Lickley
User behaviors affecting speech recognition
Elizabeth Wade, Elizabeth Shriberg, Patti Price
The lip benefit: auditory and visual intelligibility of French speech in noise
Christian Benoît, Tayeb Mohamadi
Phonological assessment of deaf children's productive knowledge as a basis for speech-training
Anne-Marie Öster
Factors affecting voicing distinction of stops for the hearing impaired
Hideaki Seki, Akiko Hayashi, Satoshi Imaizumi, Takehiko Harada, Hiroshi Hosoi
Investigations into the auditory F0 speechreading enhancement effect using a sinusoidal replica of the F0 contour
Arthur Boothroyd, Robin S. Waldstein, Eddy Yeung
Some considerations on pitch and timing control in deaf children
Francesco Cutugno
Rate of speech effects in aphasia: an acoustic analysis of voice onset time
Shari R. Baum
Fundamental frequency attributes following unilateral left or right temporal lobe lesion
Parth M. Bhatt
Cue extraction and integration in speech perception for the hearing impaired
Hiroshi Hosoi, Satoshi Imaizumi, Akiko Hayashi, Takehiko Harada, Hideaki Seki
The relationship between spectral details in naturally produced vowels and identification errors in noise and reverberation
Anna K. Nabelek
Speech processing effects on intelligibility for hearing-impaired listeners
Donald G. Jamieson, Leonard Cornelisse
Chinese recognition and synthesis system based on Chinese syllables
Mo Fuyuan, Li Changli, Chen Tao
Accelerated stochastic approximation method based parameter estimation of monosyllables and their recognition using a neural network
Hirofumi Yogo, Naoki Inagaki
Diphone-based speech recognition using time-event neural networks
Toomas Altosaar, Matti Karjalainen
Segment based variable frame rate speech analysis and recognition using a spectral variation function
Giovanni Flammia, Paul Dalsgaard, Ove Andersen, Borge Lindberg
Intelligibility of the French spoken in France compared across listeners from France and from the Ivory Coast
Christian Benoît
Dialect-dependent speech recognizers for canadian and european French
Julie Brousseau, Sally Anne Fox
Automatic segmentation and identification of ten languages using telephone speech
Yeshwant K. Muthusamy, Ronald A. Cole
Speaker-independent, text-independent language identification by HMM
Seiichi Nakagawa, Yoshio Ueda, Takashi Seino
A discrimination method between Japanese dialects
Shuichi Itahashi, Tsutomu Yamashita
Pathological voice analysis using cepstra, bispectra and group delay functions
B. Boyanov, Gérard Chollet
Lateralization of speech sounds by binaural distributing processing
Qianje Fu, Peyu Xia, Ren Hua Wang
Timing of pitch movements and perceived vowel duration
H. H. Rump
Studies of glottal excitation and vocal tract parameters using inverse filtering and a parameterized input model
J. P. Liu, G. Baudoin, Gérard Chollet
Speeded detection of vowels and steady-state consonants
Dennis Norris, Brit van Ooyen, Anne Cutler
Temporal factors in the perception of consonants for different age and hearing impairment groups
Elzbieta B. Slawinski
The role of F3 and F4 in identifying place of articulation for stop consonants
Abeer Alwan
A new measure for perceptual weight of acoustic cues: an experiment on voicing in French intervocalic [t,d]
Thomas R. Sawallis
Objective speech quality assessment in patients with intra-oral cancers: voiceless fricatives
Alan A. Wrench, Mervyn A. Jack, John Laver, M. S. Jackson, D. S. Soutar, A. G. Robertson, J. MacKenzie
Tongue contact, active articulators, and coarticulation
Bruce Connell
Cross-languages differences in the identification of intervocalic stop consonants by Japanese and dutch listeners
Makio Kashino, Astrid van Wieringen, Louis C. W. Pols
Effects of typicality and interstimulus interval on the discrimination of speech stimuli: within-subject comparison
Minoru Tsuzaki
Perceptual studies on vowels excised from continuous speech
Ronald A. Cole, Yeshwant K. Muthusamy
The relative perceptual salience of spectral and durational differences
Raymond S. Weitzman
Can 'level words' from one speaking style become teaks' when spliced into another speaking style?
Florien J. Koopmans-van Beinum
Speech errors and task demand
Beverley Gable, Helen Nemeth, Martin Haran
Analysis of phonation type using laryngographic techniques
John H. Esling, B. Craig Dickson, Roy C. Snell
Effect of prototypes of vowels on speech perception in Japanese and English
Sumi Shigeno
Characteristics of voice picked up from outer skin of larynx
Tomo-o Morohashi, Tetsuya Shimamura, Hiroyuki Yashima, Jouji Suzuki
Coding of voicing in whispered plosives
Igor V. Nabelek
Performance on a nonsense syllable test using the articulation index
Margaret F. Cheesman, Shelly Lawrence, Allison Appleyard
CSRE: a speech research environment
Donald G. Jamieson, Ketan Ramji, Issam Kheirallah, Terrance M. Nearey
A study of F0 reset in naturally-read utterances in Japanese
Kazue Hata, Yoko Hasegawa
On the nature of tone sandhi rules in taiwanese
H. Samuel Wang, Fu-Dong Chiu
How shallow is phonology: declarative phonologies meet fast speech
Geoffrey S. Nathan
Analyzing postposition drops in spoken Japanese
Junko Hosaka, Toshiyuki Takezawa, Noriyoshi Uratani
Fundamental frequency patterns of Chinese in different speech modes
Jialu Zhang, Xinghui Hu
The multifarious r-sound
Knut Kvale, Ante Kjell Foldvik
The role of preaspiration duration in the voicing contrast in skolt sami
Zita McRobbie-Utasi
Parameter setting for abstract stress in tokyo Japanese
Eiji Yamada
A method for studying prosody in texts read aloud
Georg E. Ottesen
Linguistic versus phonetic explanation of consonant lengthening after short vowels: a contrastive study of dutch and English
Vincent J. van Heuven
Comparing phoneme and feature based speech recognition using artificial neural networks
Kjell Elenius, Mats Blomberg
Prosodic cues to the perception of syntactic boundaries
Eva Strangert
A new model of intonation for use with speech synthesis and recognition
Paul Taylor, Stephen Isard
Computerized error detection/correction in teaching German sounds: some problems and solutions
Rudolf Weiss
Velum and epiglottis behavior during the production of Arabic pharyngeals and laryngeals: a fiberscopic study
Ahmed M. Elgendy
A prosodic comparison of spontaneous speech and read speech
Kim Silverman, Eleonora Blaauw, Judith Spitz, John F. Pitrelli
Phonological and psychological evidence that listeners normalize the speech signal
John J. Ohala, Maria Grazia Busa, Karen Harrison
Intonation and the request/question distinction
Elizabeth A. Hinkelman
The English voicing contrast as velocity perturbation
Robert F. Port, Fred Cummins
How many phonologies are there in one speaker? some experimental evidence
Michael S. Ziolkowski, Mayumi Usami, Karen L. Landahl, Brenda K. Tunnock
Decomposition into syllable complexes and the accenting of Japanese loanwords
Hirokazu Sato
Temporal structure in bisyllabic word frame: an evidence for relational invariance and variability from standard Chinese
Jianfen Cao
The integration of phonetics and phonology: a case study of taiwanese "gemination" and syllable structure
Shih-ping Wang
Towards a robust speech interface for teleoperation systems
James H. Bradford
Phonetic recognition experiments with recurrent neural networks
Piero Cosi, P. Frasconi, M. Gori, N. Griggio
Some aspects on context and response range effects when assessing naturalness of Swedish sentences generated by 4 synthesiser systems
Mikael Goldstein, Björn Lindström, Ove Till
Probabilistic prediction of parts-of-speech from word spelling using decision trees
Marcello Pelillo, Franca Moro, Mario Refice
Single word detection system with a neural classifier for recognizing speech at variable levels of background noise
D. Barschdorff, U. Gartner
A rapid semi-automatic simulation technique for investigating interactive speech and handwriting
Sharon Oviatt, Philip Cohen, Martin Fong, Michael Frank
Speech understanding on a massively parallel computer
Sang-Hwa Chung, Dan Moldovan
Rationale for "performance phonology"
Chan-Do Lee
The effect of information feedback on the performance of a phoneme recognizer using kohonen map
Takuya Koizumi, Jyoji Urata, Shuji Taniguchi
A method of dialogue management for the speech response system
Yasuharu Asano, Keikichi Hirose, Hiroya Fujisaki
Syllable duration prediction for speech recognition
Yumi Takizawa, Eiichi Tsuboka
Comparison between two methodologies of testing isolated word speech recognizers
F. Canavesio, G. Castagneri, G. Di Fabbrizio, F. Senia
Extracting fuzzy features from MLP for recognition of speech
He Jun, Henri Leich
A fuzzy partition model (FPM) neural network architecture for speaker-independent continuous speech recognition
Keiji Fukuzawa, Yoshinaga Kato, Masahide Sugiyama
Conception of speech filters based on a neural network
A. Ennaji, Jean Rouat
Speaker set identification through speaker group modeling
Jeff Kuo, Chin-Hui Lee, Aaron E. Rosenberg
Identification of principal ergonomic requirements for interactive spoken language systems
Stephen Springer, Sara Basson, Judith Spitz
Performance of the united kingdom intelligent network automatic speech recognition system
Thomas E. Jacobs, Eric R. Buhrke
Evaluation of parsing strategies in natural language spoken man-machine dialogue
Guy Deville, Pierre Mousel
An information retrieval system with a speech interface
Yasuhisa Niimi, Yutaka Kobayashi
Phoneme performance in speaker recognition
J. P. Eatock, J. S. D. Mason
Natural language processing in the chronus system
Evelyne Tzoukermann, Roberto Pieraccini, Zakhar Gorelov
Contribution of neural networks for phoneme identification in the APHODEX expert system
Dominique Francois, Dominique Fohr
A CSR-NL interface architecture
Douglas B. Paul
Speech interface for a man-machine dialog with the unix operating system
R. Lefebvre, F. Poirier, G. Duncan
Transformation of databases for the evaluation of speech recognizers
P. Bardaud, F. Capman, C. Mokbel, C. Tadj, Gérard Chollet
Dialog management for speech output from concept representation
Yoichi Yamashita, Riichiro Mizoguchi
Speaker verification using locations and sizes of multipulses on neural networks
Seiichiro Hangai, Shigetoshi Sugiyama, Kazuhiro Miyauchi
Word rejection using multiple sink models
Carlos J. Teixeira, Isabel M. Trancoso
Verification of language specific performance factors from recogniser testing on EUROM.1 CVC material
Boerge Lindberg
Modeling task driven oral dialogue
Alain Cozannet
Introducing neural predictor to hidden Markov model for speech recognition
Wei-ying Li, Kechu Yi, Zheng Hu
A neural network based on subnets - SNN
Feng Liu, Jianxin Jiang, Jun Cheng, Kechu Yi
Syntactic anaphora resolution in a speech understanding system
Ute Ziegenhain
The dialog module of the speech recognition and dialog system EVAR
Marion Mast, Ralf Kompe, Franz Kummert, Heinrich Niemann, Elmar Noth
Statistical recovery of wideband speech from narrowband speech
Yan Ming Cheng, Douglas O'Shaughnessy, Paul Mermelstein
Speaker related variability in cepstral representations of dutch speech segments
Henk van den Heuvel, Toni Rietveld
Experiences from a real-world telephone application: teledialogue
Per Rosenbeck, Bo Baungaard
Robust estimation of time-varying LP parameters on speech
K. Y. Lee, P. Ha, J. Rheem, S. Ann, I. Song
On the AR modelling of the one-sided autocorrelation sequence for noisy speech recognition
Javier Hernando, Climent Nadeu, Eduardo Lleida
Robust pitch detection by narrow band spectrum analysis
Hiroshi Shimodaira, Mitsuru Nakai
A microcomputer-based system for real-time analysis and display of laryngograph signals
S. Eady, B. Craig Dickson, Roy C. Snell, J. Woolsey, P. Ollek, A. Wynrib, J. Clayards
Parse scoring with prosodic information
N. M. Veilleux, Mari Ostendorf, Colin Wightman
Topic identification using a neural network with a keyword-spotting preprocessor
Ying Cheng, Paul Fortier, Yves Normandin
Frequency domain speech coding
Shane Switzer, Tim Anderson, Matthew Kabrisky, Steven K. Rogers, Bruce Suter
MEDIATEX-TASF: a closed captioning real-time service in French
Raymond Descout, Robert Bergeron, Bernard Meriald
The wavelet transform for speech analysis
S. A. Wilde, K. M. Curtis
Problems and algorithms in optimal linguistic decoding: a unified formulation
Pablo Aibar, Andres Marzal, Enrique Vidal, Francisco Casacuberta
A spectro-temporal analysis of speech based on nonlinear operators
Jean Rouat, Sylvain Lemieux, Alain Migneault
A PC graphic tool for speech research based on a DSP board
Miguel A. Berrojo, Javier Corrales, Jesus Macias, Santiago Aguilera
A spoken language dialogue system for automatic collection of spontaneous speech
Satoru Hayamizu, Katunobu Itou, Masafumi Tamoto, Kazuyo Tanaka
A powerful disambiguating mechanism for speech understanding systems based on ATMs
Shingo Nishioka, Yoichi Yamashita, Riichiro Mizoguchi
A mixed Gaussian-stochastic code book for CELP coder in LSP speech coding
Najib Naja, Jean Marc Boucher, Samir Saoudi
A method to estimate the transfer function of ARMA model of speech wave using prony method and homomorphic analysis
Hiroyuki Kamata, Yoshihisa Ishida
An integrated dialogue design and continuous speech recognition system environment
Boerge Lindberg, Bjarne Andersen, Anders Baekgaard, Tom Broendsted, Paul Dalsgaard, Jan Kristiansen
The PSH/DISPE helium speech cdrom
Alain Marchal, C. Meunier, P. Gavarry
Article |
---|