doi: 10.21437/ICSLP.1998
ISSN: 2958-1796
Unsupervised training of phone duration and energy models for text-to-speech synthesis
Paul C. Bagshaw
Improved duration modeling of English phonemes using a root sinusoidal transformation
Jerome R. Bellegarda, Kim E. A. Silverman
Efficient adaptation of TTS duration model to new speakers
Chilin Shih, Wentao Gu, Jan P. H. van Santen
Duration modeling for HMM-based speech synthesis
Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura
Prosody prediction for speech synthesis using transformational rule-based learning
Cameron S. Fordyce, Mari Ostendorf
Representing the environments for phonological processes in an accent-independent lexicon for synthesis of English
Susan Fitt, Stephen Isard
Efficient lexical retrieval for English text-to-speech synthesis
Daniel Faulkner, Charles Bryant
The IBM trainable speech synthesis system
Robert E. Donovan, Ellen M. Eide
Prosynth: an integrated prosodic approach to device-independent, natural-sounding speech synthesis
Sarah Hawkins, Jill House, Mark Huckvale, John Local, Richard Ogden
Total quality evaluation of speech synthesis systems
Jialu Zhang, Shiwei Dong, Ge Yu
Comparative evaluation of synthetic prosody with the PURR method
Gerit P. Sonntag, Thomas Portele
SABLE: a standard for TTS markup
Richard Sproat, Andrew Hunt, Mari Ostendorf, Paul Taylor, Alan W. Black, Kevin Lenzo, Mike Eddington
Prosodic vs. segmental contributions to naturalness in a diphone synthesizer
H. Timothy Bunnell, Steve R. Hoskins, Debra Yarrington
A mixed-excitation frequency domain model for time-scale pitch-scale modification of speech
Alex Acero
Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS)
Masami Akamine, Takehiko Kagoshima
Modeling the microprosody of pitch and loudness for speech synthesis with neural networks
Martti Vainio, Toomas Altosaar
Spectral smoothing for concatenative speech synthesis
David T. Chappell, John H. L. Hansen
MIMIC : a voice-adaptive phonetic-tree speech synthesiser
Aimin Chen, Saeed Vaseghi, Charles Ho
Automatic generation of Korean pronunciation variants by multistage applications of phonological rules
Jehun Jeon, Sunhwa Cha, Minhwa Chung, Jun Park, Kyuwoong Hwang
Techniques for accurate automatic annotation of speech waveforms
Stephen Cox, Richard Brady, Peter Jackson
Optimized stopping criteria for tree-based unit selection in concatenative synthesis
Andrew Cronk, Michael W. Macon
Automatic transcription of intonation using an identified prosodic alphabet
Stephanie de Tournemire
Frequency analysis of phonetic units for concatenative synthesis in catalan
Ignasi Esquerra, Albert Febrer, Climent Nadeu
Investigating the syntactic characteristics of English tone units
Alex Chengyu Fang, Jill House, Mark Huckvale
The UPC text-to-speech system for Spanish and catalan
Antonio Bonafonte, Ignasi Esquerra, Albert Febrer, José A.R. Fonollosa, Francesc Vallverdu
The new version of the ROMVOX text-to-speech synthesis system based on a hybrid time domain-LPC synthesis technique
Attila Ferencz, Istvan Nagy, Tunde-Csilla Kovacs, Maria Ferencz, Teodora Ratiu
An F0 contour control model for totally speaker driven text to speech system
Takehiko Kagoshima, Masahiro Morita, Shigenobu Seto, Masami Akamine
On the relationship of speech rates with prosodic units in dialogue speech
Keikichi Hirose, Hiromichi Kawanami
On the reduction of concatenation artefacts in diphone synthesis
Esther Klabbers, Raymond Veldhuis
Error analysis and confidence measure of Chinese word segmentation
Chih-Chung Kuo, Kun-Yuan Ma
Energy contour generation for a sentence using a neural network learning method
Jungchul Lee, Donggyu Kang, Sanghoon Kim, Koengmo Sung
A computational algorithm for F0 contour generation in Korean developed with prosodically labeled databases using k-toBI system
Yong-Ju Lee, Sook-hyang Lee, Jong-Jin Kim, Hyun-Ju Ko, Young-Il Kim, Sang-Hun Kim, Jung-Cheol Lee
Rapid-deployment text-to-speech in the DIPLOMAT system
Kevin Lenzo, Christopher Hogan, Jeffrey Allen
Formant diphone parameter extraction utilising a labelled single-speaker database
Robert H. Mannell
A new synthetic speech/sound control language
Osamu Mizuno, Shin'ya Nakajima
A study on the natural-sounding Japanese phonetic word synthesis by using the VCV-balanced word database that consists of the words uttered forcibly in two types of pitch accent
Ryo Mochizuki, Yasuhiko Arai, Takashi Honda
Letter to sound rules for accented lexicon compression
Vincent Pagel, Kevin Lenzo, Alan W. Black
A name announcement algorithm with memory size and computational power constraints
Ze'ev Roth, Judith Rosenhouse
How a French TTS system can describe loanwords
Frederique Sannier, Rabia Belrhali, Véronique Aubergé
Improvements in slovene text-to-speech synthesis
Tomaz Sef, Ales Dobnikar, Matjaz Gams
Automatic rule generation for linguistic features analysis using inductive learning technique: linguistic features analysis in TOS drive TTS system
Shigenobu Seto, Masahiro Morita, Takehiko Kagoshima, Masami Akamine
Segmental duration control based on an articulatory model
Yoshinori Shiga, Hiroshi Matsuura, Tsuneo Nitta
Text analysis for the bell labs French text-to-speech system
Evelyne Tzoukermann
Modeling vowel duration for Japanese text-to-speech synthesis
Jennifer J. Venditti, Jan P. H. van Santen
Towards a Chinese text-to-speech system with higher naturalness
Ren-Hua Wang, Qinfeng Liu, Yongsheng Teng, Deyu Xia
A phonologically motivated method of selecting non-uniform units
Andrew P. Breen, Peter Jackson
A synthesis method based on concatenation of demisyllables and a residual excited vocal tract model
Steve Pearson, Nick Kibre, Nancy Niedzielski
Exploration of acoustic correlates in speaker selection for concatenative synthesis
Ann K. Syrdal, Alistair Conkie, Yannis Stylianou
A perceptual evaluation of distance measures for concatenative speech synthesis
Johan Wouters, Michael W. Macon
HMM-based smoothing for concatenative speech synthesis
Mike Plumpe, Alex Acero, Hsiao-Wuen Hon, Xuedong Huang
A nonlinear unit selection strategy for concatenative speech synthesis based on syllable level features
Martin Holzapfel, Nick Campbell
How to handle "foreign" sounds in Swedish text-to-speech conversion: approaching the 'xenophone' problem
Robert Eklund, Anders Lindström
Multi-lingual concatenative speech synthesis
Nick Campbell
On the use of F0 features in automatic segmentation for speech synthesis
Takashi Saito
A linguistic and prosodic database for data-driven Japanese TTS synthesis
Atsuhiro Sakurai, Takashi Natsume, Keikichi Hirose
Text-to-speech voice adaptation from sparse training data
Alexander Kain, Michael W. Macon
Describing intonation with a parametric model
Gregor Möhler
An educational dialogue system with a user controllable dialogue manager
Joakim Gustafson, Patrik Elmberg, Rolf Carlson, Arne Jonsson
End-user driven dialogue system design: the reward experience
Klaus Failenschmid, J.H. Simon Thornton
The design of a multi-domain Mandarin Chinese spoken dialogue system
Yi-Chung Lin, Tung-Hui Chiang, Heui-Ming Wang, Chung-Ming Peng, Chao-Huang Chang
An integrated dialogue system for the automation of call centre services
Kallirroi Georgila, Anastasios Tsopanoglou, Nikos Fakotakis, George Kokkinakis
An event driven model for dialogue systems
Kuansan Wang
Automatic classification of dialogue contexts for dialogue predictions
Cosmin Popovici, Paolo Baggia, Pietro Laface, Loreta Moisa
Automatic identification of command boundaries in a conversational natural language user interface
Ganesh N. Ramaswamy, Jan Kleindienst
The predictive power of game structure in dialogue act recognition: experimental results using maximum entropy estimation
Massimo Poesio, Andrei Mikheev
A schema based approach to dialog control
Paul C. Constantinides, Scott Hansma, Chris Tchou, Alexander I. Rudnicky
Expanding a time-sensitive conversational architecture for turn-taking to handle content-driven interruption
Gregory Aist
On different functions of repetitive utterances
Marc Swerts, Hanae Koiso, Atsushi Shimojima, Yasuhiro Katagiri
Prosody-based detection of the context of backchannel responses
Hiroaki Noguchi, Yasuharu Den
Robust interpretation for spoken dialogue systems
Lena Strombäck, Arne Jonsson
System-user interaction and response strategy in spoken dialogue system
Yohei Okato, Keiji Kato, Mikio Yamamoto, Shuichi Itahashi
Organizing self-motivated dialogue with autonomous creatures
Noriko Suzuki, Kazuo Ishii, Michio Okada
Fly with the EAGLES: evaluation of the "ACCeSS" spoken language dialogue system
Gerhard Hanrieder, Paul Heisterkamp, Thomas Brey
SQEL: a multilingual and multifunctional dialogue system
Maria Aretoulaki, Stefan Harbeck, Florian Gallwitz, Elmar Nöth, Heinrich Niemann, Jozef Ivanecky, Ivo Ipsic, Nikola Pavesic, Vaclav Matousek
Semi-automated incremental prototyping of spoken dialog systems
Stefan Kaspar, Achim Hoffmann
Beyond structured dialogues: factoring out grounding
Peter A. Heeman, Michael Johnston, Justin Denney, Edward Kaiser
A robust dialogue model for spoken dialogue processing
Masahiro Araki, Shuji Doshita
The REWARD service creation environment. an overview
Tom Brøndsted, Bo Nygaard Bai, Jesper Østergaard Olsen
An analysis of the timing of turn-taking in a corpus of goal-oriented dialogue
Matthew Bull, Matthew Aylett
The provision of corrective feedback in a spoken dialogue CALL system
Sarah Davies, Massimo Poesio
Evaluation of dialog strategies for a tourist information retrieval system
Laurence Devillers, Helene Bonneau-Maynard
Designing a multimodal dialogue system for information retrieval
Sadaoki Furui, Koh'ichiro Yamaguchi
The research project of man-computer dialogue system in Chinese
Dinghua Guan, Min Chu, Quan Zhang, Jian Liu, Xiangdong Zhang
Interfaces for speech recognition systems: the impact of vocabulary constraints and syntax on performance
Kate S. Hone, David Golightly
Pacing spoken directions to suit the listener
Tatsuya Iwase, Nigel Ward
A spoken dialogue system utilizing spatial information
Annika Flycht-Eriksson, Arne Jonsson
From novice to expert: the effect of tutorials on user expertise with spoken dialogue systems
Candace A. Kamm, Diane J. Litman, Marilyn A. Walker
Emergent computational dialogue management architecture for task-oriented spoken dialogue systems
Takeshi Kawabata
An analysis of dialogues with our dialogue system through a WWW page
Tadahiko Kumamoto, Akira Ito
Modelling spoken dialogues with state transition diagrams: experiences with the CSLU toolkit
Michael F. McTear
Situated dialogue coordination for spoken dialogue systems
Michio Okada, Noriko Suzuki, Jacques Terken
Robust spoken dialogue systems for consumer products: a concrete application
Xavier Pouteau, Luis Arevalo
A German dialogue system for scheduling dates and meetings by naturally spoken continuous speech
Daniel Willett, Arno Romer, Jörg Rottland, Gerhard Rigoll
Spoken dialogue system using corpus-based hidden Markov model
Chung-Hsien Wu, Gwo-Lang Yan, Chien-Liang Lin
A realistic wizard of oz simulation of a multimodal spoken language system
Peter Wyard, Gavin Churcher
A syllable-based Chinese spoken dialogue system for telephone directory services primarily trained with a corpus
Yen-Ju Yang, Lin-Shan Lee
How disagreement expressions are used in cooperative tasks
Hiroyuki Yano, Akira Ito
Tones of a tridialectal: acoustic and perceptual data on ten linguistic tonetic contrasts between lao, nyo and standard Thai
Phil Rose
Tone sandhi between complex tones in a seven-tone southern Thai dialect
Napier Guy Ian Thompson
The acoustic and perceptual features of tone in the tibeto-burman language ao naga
Alexander Robertson Coupe
The differential status of semivowels in the acoustic phonetic realisation of tone
Phil Rose
De-accentuation: linguistic environments and prosodic realizations
Kai Alter, Karsten Steinhauer, Angela D. Friederici
Towards an automatic classification of emotions in speech
N. Amir, S. Ron
Can we hear smile?
Marc Schröder, Véronique Aubergé, Marie-Agnes Cathiard
The automatic marking of prominence in spontaneous speech using duration and part of speech information
Matthew Aylett, Matthew Bull
On a pitch alteration technique in excited cepstral spectrum for high quality TTS
JongDeuk Kim, SeongJoon Baek, MyungJin Bae
Dovetailing of acoustics and prosody in spontaneous speech recognition
Jan Buckow, Anton Batliner, Richard Huber, Elmar Nöth, Volker Warnke, Heinrich Niemann
A computational memory and processing model for prosody
Janet E. Cahn
Convergence of fundamental frequencies in conversation: if it happens, does it matter?
Belinda Collins
Analysis and interpretation of fundamental frequency contours of british English in terms of a command-response model
Hiroya Fujisaki, Sumio Ohno, Takashi Yagi, Takeshi Ono
Common patterns in word level prosody
Frode Holm, Kazue Hata
Prosodic structure in Japanese spontaneous speech
Yasuo Horiuchi, Akira Ichikawa
An acoustic-phonetic description of word tone in kagoshima Japanese
Shunichi Ishihara
Representing prosodic words using statistical models of moraic transition of fundamental frequency contours of Japanese
Koji Iwano, Keikichi Hirose
Disambiguation of Korean utterances using automatic intonation recognition
Tae-Yeoub Jang, Minsuck Song, Kiyeong Lee
Multi-level rhythm control for speech synthesis using hybrid data driven and rule-based approaches
Oliver Jokisch, Diane Hirschfeld, Matthias Eichner, Rüdiger Hoffmann
EGG model of ditoneme in Mandarin
Jiangping Kong
Temporal organization of speech for normal and fast rates
Geetha Krishnan, Wayne Ward
A syllable-based generalization of Japanese accentuation
Haruo Kubozono
Non-adjacent segmental effects in tonal realization of accentual phrase in seoul Korean
Hyuck-Joon Lee
Improvement on connected numbers recognition using prosodic information
Eduardo López, Javier Caminero, Ismael Cortazar, Luis A. Hernández
Phonetic investigation of boundary pitch movements in Japanese
Kazuaki Maeda, Jennifer J. Venditti
Phonetic and phonological characteristics of paralinguistic information in spoken Japanese
Kikuo Maekawa
ToBI accent type recognition
Arman Maghbouleh
The influence of syllable structure on the timing of intonational events in German
Hansjorg Mixdorff, Hiroya Fujisaki
New prosodic control rules for expressive synthetic speech
Osamu Mizuno, Shin'ya Nakajima
The use of F0 reliability function for prosodic command analysis on F0 contour generation model
Mitsuru Nakai, Hiroshi Shimodaira
Analysis of effects of lexical accent, syntax, and global speech rate upon the local speech rate
Sumio Ohno, Hiroya Fujisaki, Hideyuki Taguchi
On the effects of speech rate upon parameters of the command-response model for the fundamental frequency contours of speech
Sumio Ohno, Hiroya Fujisaki, Yoshikazu Hara
The maximum-based description of F0 contours and its application to English
Thomas Portele, Barbara Heuft
Perceived prominence and acoustic parameters in american English
Thomas Portele
Generating emotional speech with a concatenative synthesizer
Erhard Rank, Hannes Pirker
A perceptive measure of pure prosody linguistic functions with reiterant sentences
Albert Rilliard, Véronique Aubergé
Prosodic parameters in emotional speech
Kazuhito Koike, Hirotaka Suzuki, Hiroaki Saito
Automatic detection of prominence (as defined by listeners' judgements) in read aloud dutch sentences
Barbertje M. Streefkerk, Louis C. W. Pols, Louis F.M. ten Bosch
A schema for illocutionary act identification with prosodic feature
Masafumi Tamoto, Takeshi Kawabata
An algorithm for choosing Japanese acknowledgments using prosodic cues and context
Wataru Tsukahara
A study of tones and tempo in continuous Mandarin digit strings and their application in telephone quality speech recognition
Chao Wang, Stephanie Seneff
Simulated emotions: an acoustic study of voice and perturbation measures
Sandra P. Whiteside
A robust tone recognition method of Chinese based on sub-syllabic F0 contours
Jin-song Zhang, Keikichi Hirose
The microprosodics of tone sandhi in shanghai disyllabic compounds
Xiaonong Sean Zhu
Jitter and shimmer differences between pathological voices of school children
Natalija Bolfan-Stosic, Tatjana Prizl
What spreads, and how? tonal rightward spreading on shanghai disyllabic compounds
Xiaonong Sean Zhu
Tonal complexity as a dialectal feature: 25 different citation tones from four zhejiang wu dialects
Sean Zhu, Phil Rose
Emotional speech synthesis: from speech database to TTS
Juan Manuel Montero, Juana M. Gutierrez-Arriola, Sira Palazuelos, Emilia Enriquez, Santiago Aguilera, José Manuel Pardo
Some acoustic characteristics of emotion
Cecile Pereira, Catherine Watson
Intonative structure as a determinant of word order variation in dutch verbal endgroups
Marc Swerts
Experiments on the meaning of two pitch accent types: the 'pointed hat' versus the accent-lending fall in dutch
Johanneke Caspers
Phonetic and phonological markers of contrastive focus in Korean
Sun-Ah Jun, Hyuck-Joon Lee
Reconciling two competing views on contrastiveness
Emiel Krahmer, Marc Swerts
The tilt intonation model
Paul Taylor
Analysis of occurrence of pauses and their durations in Japanese text reading
Hiroya Fujisaki, Sumio Ohno, Seiji Yamada
A statistical study of pitch target points in five languages
Estelle Campione, Jean Véronis
Fully automatic prosody generator for text-to-speech
Fabrice Malfrère, Thierry Dutoit, Piet Mertens
Automatic prosodic labeling of 6 languages
Halewijn Vereecken, Jean-Pierre Martens, Cynthia Grover, Justin Fackrell, Bert Van Coile
Automatic utterance type detection using suprasegmental features
Helen Wright
A contrastive study of lexical stress placement in singapore English and british English
Ee Ling Low, Esther Grabe
Integrated recognition of words and phrase boundaries
Florian Gallwitz, Anton Batliner, Jan Buckow, Richard Huber, Heinrich Niemann, Elmar Nöth
Phrase accents revisited: comparative evidence from standard and cypriot greek
Amalia Arvaniti
Phonetic invariance and phonological stability: lithuanian pitch accents
Grzegorz Dogil, Gregor Möhler
A HMM-based recognition system for perceptive relevant pitch movements of spontaneous German speech
Christel Brindöpke, Gernot A. Fink, Franz Kummert, Gerhard Sagerer
Towards a reversible symbolic coding of intonation
Jean Véronis, Estelle Campione
Nonreciprocal data sharing in estimating HMM parameters
Xiaoqiang Luo, Frederick Jelinek
Data-driven extensions to HMM statistical dependencies
Jeff A. Bilmes
Use of high-level linguistic constraints for constructing feature-based phonological model in speech recognition
Jiping Sun, Li Deng
Real-time probabilistic segmentation for segment-based speech recognition
Steven C. Lee, James R. Glass
Toward Markov random field modeling of speech
Guillaume Gravier, Marc Sigelle, Gérard Chollet
Hidden Markov models for trajectory modeling
Rukmini Iyer, Herbert Gish, Man-Hung Siu, George Zavaliagkos, Spyros Matsoukas
A statistical phonemic segment model for speech recognition based on automatic phonemic segmentation
Katsura Aizawa, Chieko Furuichi
Improved feature decorrelation for HMM-based speech recognition
Kris Demuynck, Jacques Duchateau, Dirk Van Compernolle, Patrick Wambacq
Efficient high-order hidden Markov modelling
J. A. du Preez, D. M. Weber
A time-synchronous, tree-based search strategy in the acoustic fast match of an asynchronous speech recognition system
Ellen M. Eide, Lalit R. Bahl
Effective structural adaptation of LVCSR systems to unseen domains using hierarchical connectionist acoustic models
Jürgen Fritsch, Michael Finke, Alex Waibel
Support vector machines for speech recognition
Aravind Ganapathiraju, Jonathan Hamaker, Joseph Picone
Natural number recognition using discriminatively trained inter-word context dependent hidden Markov models
Malan B. Gandhi
Information theoretic approaches to model selection
Jonathan Hamaker, Aravind Ganapathiraju, Joseph Picone
Continuous speech recognition using segmental unit input HMMs with a mixture of probability density functions and context dependency
Kengo Hanai, Kazumasa Yamamoto, Nobuaki Minematsu, Seiichi Nakagawa
Gaussian density tree structure in a multi-Gaussian HMM-based speech recognition system
Jacques Simonin, Lionel Delphin-Poulat, Geraldine Damnati
Generalized phone modeling based on piecewise linear segment lattice
Hiroaki Kojima, Kazuyo Tanaka
A flexible method of creating HMM using block-diagonalization of covariance matrices
Ryosuke Koshiba, Mitsuyoshi Tachimori, Hiroshi Kanazawa
HMM topology selection for accurate acoustic and duration modeling
C. Chesta, Pietro Laface, F. Ravera
Context-dependent duration modelling for continuous speech recognition
Tan Lee, Rolf Carlson, Björn Granström
Training of context-dependent subspace distribution clustering hidden Markov model
Brian Mak, Enrico Bocchieri
Unsupervised training of HMMs with variable number of mixture components per state
Cesar Martin del Alamo, Luis Villarrubia, Francisco Javier Gonzalez, Luis A. Hernández
Acoustic observation context modeling in segment based speech recognition
Mate Szarvas, Shoichi Matsunaga
Capturing discriminative information using multiple modeling techniques
Ji Ming, Philip Hanna, Darryl Stewart, Saeed Vaseghi, F. Jack Smith
Suprasegmental duration modelling with elastic constraints in automatic speech recognition
Laurence Molloy, Stephen Isard
An adaptive gradient-search based algorithm for discriminative training of HMM's
Albino Nogueiras-Rodriguez, José B. Mariño, Enric Monte
Task adaptation of sub-lexical unit models using the minimum confusibility criterion on task independent databases
Albino Nogueiras-Rodriguez, José B. Mariño
Stochastic calculus, non-linear filtering, and the internal model principle: implications for articulatory speech recognition
Gordon Ramsay
The use of meta-HMM in multistream HMM training for automatic speech recognition
Christian J. Wellekens, Jussi Kangasharju, Cedric Milesi
Enhanced ASR by acoustic feature filtering
Christian J. Wellekens
Soft state-tying for HMM-based speech recognition
Christoph Neukirchen, Daniel Willett, Gerhard Rigoll
Estimation of models for non-native speech in computer-assisted language learning based on linear model combination
Silke Witt, Steve Young
Duration modeling using cumulative duration probability and speaking rate compensation
Tae-Young Yang, Ji-Sung Kim, Chungyong Lee, Dae Hee Youn, Il-Whan Cha
Probabilistic modeling with Bayesian networks for automatic speech recognition
Geoffrey Zweig, Stuart Russell
Sub-band based speaker verification using dynamic recombination weights
Perasiriyan Sivakumaran, Aladdin M. Ariyaeeinia, Jill A. Hewitt
Measuring the dynamic encoding of speaker identity and dialect in prosodic parameters
Michael Barlow, Michael Wagner
German regional variants - a problem for automatic speech recognition?
Nicole Beringer, Florian Schiel, Peter Regel-Brietzmann
Improving accent identification through knowledge of English syllable structure
Kay Berkling, Marc A. Zissman, Julie Vonwiller, Christopher Cleirigh
Multi-dimensional scaling of listener responses to complex auditory stimuli
Z. S. Bond, Donald Fucci, Verna Stockmal, Douglas McColl
Same talker, different language
Verna Stockmal, Danny R. Moates, Z. S. Bond
The impact of regional variety upon specific word categories in spontaneous German
Susanne Burger, Daniela Oppermann
Speech pre-processing against intentional imposture in speaker recognition
Dominique Genoud, Gérard Chollet
A comparison of two unsupervised approaches to accent identification
Mike Lincoln, Stephen Cox, Simon Ringland
The influence of accents in australian English vowels and their relation to articulatory tract parameters
Dominik R. Dersch, Christopher Cleirigh, Julie Vonwiller
Automatic language recognition using high-order HMMs
J. A. du Preez, D. M. Weber
Speaker recognition using residual signal of linear and nonlinear prediction models
Marcos Faundez-Zanuy, Daniel Rodriguez-Porcheron
An implementation and evaluation of an on-line speaker verification system for field trials
Yong Gu, Trevor Thomas
Speaker verification on the polycost database using frequency filtered spectral energies
Javier Hernando, Climent Nadeu
A high-performance text-independent speaker identification system based on BCDM
Qin Jin, Luo Si, Qixiu Hu
Representation of voice quality features associated with talker individuality
Hiroshi Kido, Hideki Kasuya
Candidate selection based on significance testing and its use in normalisation and scoring
Ji-Hwan Kim, Gil-Jin Jang, Seong-Jin Yun, Yung Hwan Oh
Japanese forensic phonetics: non-contemporaneous within-speaker variation in natural and read-out speech
Yuko Kinoshita
Statistical modeling of pronunciation and production variations for speech recognition
Filipp Korkmazskiy, Biing-Hwang Juang
Dialect maps and dialect research; useful tools for automatic speech recognition?
Arne Kjell Foldvik, Knut Kvale
Text independent speaker recognition using micro-prosody
Youn-Jeong Kyung, Hwang-Soo Lee
Speaker verification using fundamental frequency
Yoik Cheng, Hong C. Leung
On optimum normalization method used for speaker verification
Weijie Liu, Toshihiro Isobe, Naoki Mukawa
Recurrent substrings and data fusion for language recognition
Harvey Lloyd-Thomas, Eluned S. Parris, Jeremy H. Wright
Text-independent speaker recognition using multiple information sources
Konstantin P. Markov, Seiichi Nakagawa
Discriminative training of GMM using a modified EM algorithm for speaker recognition
Konstantin P. Markov, Seiichi Nakagawa
Language identification incorporating lexical information
Driss Matrouf, Martine Adda-Decker, Lori F. Lamel, Jean-Luc Gauvain
A VQ based speaker recognition system based in histogram distances. text independent and for noisy environments
Enric Monte, Ramon Arqué, Xavier Miró
Spanish dialects: phonetic transcription
Asunción Moreno, José B. Mariño
Acoustic analysis of Japanese English prosody: comparison between fukushima dialect speakers and tokyo dialect speakers in declarative sentences and yes-no questions
Mieko Muramatsu
A context-dependent approach for speaker verification using sequential decision
Hideki Noda, Katsuya Harada, Eiji Kawaguchi, Hidefumi Sawai
Quantitative influence of speech variability factors for automatic speaker verification in forensic tasks
Javier Ortega-García, Santiago Cruz-Llanas, Joaquin Gonzalez-Rodriguez
Creating hidden Markov models for fast speech
Thilo Pfau, Guenther Ruske
Speaker identification using relaxation labeling
Tuan Pham, Michael Wagner
A novel technique for the combination of utterance and speaker verification systems in a text-dependent speaker verification task
Leandro Rodriguez-Linares, Carmen García-Mateo
A forensic phonetic investigation into non-contemporaneous variation in the f-pattern of similar-sounding speakers.
Phil Rose
Human vs. machine speaker identification with telephone speech
Astrid Schmidt-Nielsen, Thomas H. Crystal
A comparison of fusion techniques in mel-cepstral based speaker identification
Stefan Slomka, Sridha Sridharan, Vinod Chandran
On the influence of hyperarticulated speech on recognition performance
Hagen Soltau, Alex Waibel
Text-independent speaker identification and verification using the TIMIT database
Nuala C. Ward, Dominik R. Dersch
Incorporating linguistic knowledge into automatic dialect identification of Spanish
Lisa R. Yanguas, Gerald C. O'Leary, Marc A. Zissman
A novel text-independent speaker verification method using the global speaker model
Yiying Zhang, Xiaoyan Zhu
Speaker detection in broadcast speech databases
Aaron E. Rosenberg, Ivan Magrin-Chagnolleau, S. Parthasarathy, Qian Huang
Multilateral techniques for speaker recognition
Eluned S. Parris, Michael J. Carey
Real time speaker indexing based on subspace method - application to TV news articles and debate
Masafumi Nishida, Yasuo Ariki
SHEEP, GOATS, LAMBS and WOLVES: a statistical analysis of speaker performance in the NIST 1998 speaker recognition evaluation
George Doddington, Walter Liggett, Alvin Martin, Mark Przybocki, Douglas A. Reynolds
Progress in speaker recognition at dragon systems
Andres Corrada-Emmanuel, Michael Newman, Barbara Peskin, Lawrence Gillick, Robert Roth
A comparative study of speaker verification systems using the polycost database
Tomas Nordström, Haakan Melin, Johan Lindberg
Robust speaker verification insensitive to session-dependent utterance variation and handset-dependent distortion
Tomoko Matsui, Kiyoaki Aikawa
A comparative evaluation of variance flooring techniques in HMM-based speaker verification
Haakan Melin, Johan W. Koolwaaij, Johan Lindberg, Frédéric Bimbot
Text-independent speaker verification using automatically labelled acoustic segments
Dijana Petrovska-Delacretaz, Jan Cernocky, Jean Hennebert, Gérard Chollet
A fast decoding algorithm based on sequential detection of the changes in distribution
Qi Li
Speaker verification with ensemble classifiers based on linear speech transforms
Jesper Østergaard Olsen
Speaker recognition based on discriminative projection models
Jesper Østergaard Olsen
On the convergence of Gaussian mixture models: improvements through vector quantization
James Moody, Stefan Slomka, Jason Pelecanos, Sridha Sridharan
Modeling dynamic prosodic variation for speaker verification
Kemal Sönmez, Elizabeth Shriberg, Larry Heck, Mitchel Weintraub
Blind clustering of speech utterances based on speaker and language characteristics
Douglas A. Reynolds, Elliot Singer, Beth A. Carlson, Gerald C. O'Leary, Jack J. McLaughlin, Marc A. Zissman
Spoken language identification using the speechdat corpus
Diamantino Caseiro, Isabel M. Trancoso
Automatic language identification with perceptually guided training and recurrent neural networks
Jerome Braun, Haim Levkowitz
On the importance of components of the modulation spectrum for speaker verification
Sarel van Vuuren, Hynek Hermansky
A fast method of producing talking head mouth shapes from real speech
Andrew P. Breen, O. Gloaguen, P. Stern
The efficiency of multimodal interaction: a case study
Phil R. Cohen, Michael Johnston, David McGee, Sharon L. Oviatt, Joshua Clow, Ira Smith
Audio and audio-visual perception of consonants disturbed by white noise and 'cocktail party'
Laszlo Czap
Overview of the maya spoken language system
Simon Downey, Andrew P. Breen, Maria Fernández, Edward Kaneen
Automatic recognition of spontaneous speech dialogues
Mauro Cettolo, Daniele Falavigna
Using an animated talking character in a web-based city guide demonstrator
Georg Fries, Stefan Feldes, Alfred Corbet
Influence of facial views on the mcgurk effect in auditory noise
Rika Kanzaki, Takashi Kato
The intellimedia workbench - a generic environment for multimodal systems
Tom Brøndsted, Lars Bo Larsen, Michael Manthey, Paul McKevitt, Thomas B. Moeslund, Kristian G. Olesen
STAMP: a suite of tools for analyzing multimodal system processing
Joshua Clow, Sharon L. Oviatt
Cultural similarities and differences in the recognition of audio-visual speech stimuli
Sumi Shigeno
A multimodal-input multimedia-output guidance system: MMGS
Toshiyuki Takezawa, Tsuyoshi Morimoto
HMM-based visual speech recognition using intensity and location normalization
Oscar Vanegas, Akiji Tanaka, Keiichi Tokuda, Tadashi Kitamura
A hierarchy probability-based visual features extraction method for speechreading
Yanjun Xu, Limin Du, Guoqiang Li, Ziqiang Hou
Integration of talking heads and text-to-speech synthesizers for visual TTS
Jörn Ostermann, Mark C. Beutnagel, Ariel Fischer, Yao Wang
Speech driven 3-d face point trajectory synthesis algorithm
Levent M. Arslan, David Talkin
Speech-to-lip movement synthesis based on the EM algorithm using audio-visual HMMs
Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano
Learning words from natural audio-visual input
Deb Roy, Alex Pentland
Using the multi-stream approach for continuous audio-visual speech recognition: experiments on the M2VTS database
Stéphane Dupont, Juergen Luettin
Referential features and linguistic indirection in multimodal language
Sharon L. Oviatt, Karen Kuhn
Multimodal language processing
Michael Johnston
Implementation of coordinative nodding behavior on spoken dialogue systems
Jun-ichi Hirasawa, Noboru Miyazaki, Mikio Nakano, Takeshi Kawabata
Use of non-verbal information in communication between human and robot
Masao Yokoyama, Kazumi Aoyama, Hideaki Kikuchi, Katsuhiko Shirai
What you see is (almost) what you hear: design principles for user interfaces for accessing speech archives
Steve Whittaker, John Choi, Julia Hirschberg, Christine H. Nakatani
Improving accuracy of telephony-based, speaker-independent speech recognition
Daniel Azzopardi, Shahram Semnani, Ben Milner, Richard Wiseman
Rejection in speech recognition systems with limited training
Aruna Bayya
A four layer sharing HMM system for very large vocabulary isolated word recognition
Ruxin Chen, Miyuki Tanaka, Duanpei Wu, Lex Olorenshaw, Mariscela Amador
A comparative study of hybrid modelling techniques for improved telephone speech recognition
Rathinavelu Chengalvarayan
Smoothing and tying for Korean flexible vocabulary isolated word recognition
Jae-Seung Choi, Jong-Seok Lee, Hee-Youn Lee
Recent work on a preselection module for a flexible large vocabulary speech recognition system in telephone environment
Javier Ferreiros, Javier Macias-Guarasa, Ascensión Gallardo, José Colás, Ricardo Córdoba, José Manuel Pardo, Luis Villarrubia
A study of noise robustness for speaker independent speech recognition method using phoneme similarity vector
Masakatsu Hoshimi, Maki Yamada, Katsuyuki Niyada, Shozo Makino
Classification of taiwanese tones based on pitch and energy movements
Fran H. L. Jian
Phoneme-based recognition for the norwegian speechdat(II) database
Finn Tore Johansen
Robust feature extraction for alphabet recognition
Montri Karnjanadecha, Stephen A. Zahorian
Recognition of connected digit speech in Japanese collected over the telephone network
Hisashi Kawai, Norio Higuchi
Improving the speaker-dependency of subword-unit-based isolated word recognition
Takuya Koizumi, Shuji Taniguchi, Kazuhiro Kohtoh
Speaker independent speech recognition method using constrained time alignment near phoneme discriminative frame
Tomohiro Konuma, Tetsu Suzuki, Maki Yamada, Yoshio Ohno, Masakatsu Hoshimi, Katsuyuki Niyada
A nonstationary autoregressive HMM with gain adaptation for speech recognition
Ki Yong Lee, Joohun Lee
A large-vocabulary taiwanese (MIN-NAN) multi-syllabic word recognition system based upon right-context-dependent phones with state clustering by acoustic decision tree
Ren-yuan Lyu, Yuang-jin Chiang, Wen-ping Hsieh
Speech recognition based on the distance calculation between intermediate phonetic code sequences in symbolic domain
Kazuyo Tanaka, Hiroaki Kojima
High accuracy Chinese speech recognition approach with Chinese input technology for telecommunication use
York Chung-Ho Yang, June-Jei Kuo
Robust speech recognition using HMM's with toeplitz state covariance matrices
William J.J. Roberts, Yariv Ephraim
Modeling of output probability distribution to improve small vocabulary speech recognition in adverse environments
David Thambiratnam, Sridha Sridharan
Robust and compact multilingual word recognizers using features extracted from a phoneme similarity front-end
Philippe Morin, Ted H. Applebaum, Robert Boman, Yi Zhao, Jean-Claude Junqua
An effect of adaptive beamforming on hands-free speech recognition based on 3-d viterbi search
Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano
Coherence-based subband decomposition for robust speech and speaker recognition in noisy and reverberant rooms
Joaquin Gonzalez-Rodriguez, Santiago Cruz-Llanas, Javier Ortega-García
A minimax search algorithm for CDHMM based robust continuous speech recognition
Hui Jiang, Keikichi Hirose, Qiang Huo
Performance improvements through combining phone- and syllable-scale information in automatic speech recognition
Su-Lin Wu, Brian E. D. Kingsbury, Nelson Morgan, Steven Greenberg
Predictive adaptation and compensation for robust speech recognition
Arun C. Surendran, Chin-Hui Lee
Influence of the speaking style and the noise spectral tilt on the lombard reflex and automatic speech recognition
Jean-Claude Junqua, Steven Fincke, Ken Field
Data-driven PMC and Bayesian learning integration for fast model adaptation in noisy conditions
Stefano Crafa, Luciano Fissore, Claudio Vair
Improving the noise and spectral robustness of an isolated-word recognizer using an auditory-model front end
Martin Hunke, Meeran Hyun, Steve Love, Thomas Holton
A model for speech reverberation and intelligibility restoring filters
Owen P. Kenny, Douglas J. Nelson
Linear and nonlinear speech feature analysis for stress classification
Guojun Zhou, John H. L. Hansen, James F. Kaiser
Speech feature modeling for robust stressed speech recognition
Sahar E. Bou-Ghazale, John H. L. Hansen
Combining articulatory and acoustic information for speech recognition in noisy and reverberant environments
Katrin Kirchhoff
Improving speaker identification performance in reverberant conditions using lip information
Timothy Wark, Sridha Sridharan
Spectral sequence compensation based on continuity of spectral sequence
Masato Akagi, Mamoru Iwaki, Noriyoshi Sakaguchi
Robust features for speech recognition systems
Aruna Bayya, B. Yegnanarayana
Interfacing of CASA and partial recognition based on a multistream technique
Frédéric Berthommier, Hervé Glotin, Emmanuel Tessier, Hervé Bourlard
AN RNN-based compensation method for Mandarin telephone speech recognition
Sen-Chia Chang, Shih-Chieh Chien, Chih-Chung Kuo
Robust speech recognition using discriminative stream weighting and parameter interpolation
Stephen M. Chu, Yunxin Zhao
Acoustic backing-off in the local distance computation for robust automatic speech recognition
Johan de Veth, Bert Cranen, Louis Boves
Noise model selection for robust speech recognition
Laura Docio-Fernández, Carmen García-Mateo
A novel iterative signal enhancement algorithm for noise reduction in speech
Simon Doclo, Ioannis Dologlou, Marc Moonen
Missing data reconstruction for robust automatic speech recognition in the framework of hybrid HMM/ANN systems
Stéphane Dupont
Recognition from GSM digital speech
Ascensión Gallardo-Antolin, Fernando Diaz-de-Maria, Francisco J. Valverde-Albacete
Conversational speech systems for on-board car navigation and assistance
Petra Geutner, Matthias Denecke, Uwe Meier, Martin Westphal, Alex Waibel
A signal processing system for having the sound "pop-out" in noise thanks to the image of the speaker's lips: new advances using multi-layer perceptrons
Laurent Girin, Laurent Varin, Gang Feng, Jean-Luc Schwartz
Robust speech activity detection in the presence of noise
Ruhi Sarikaya, John H. L. Hansen
Robust automatic speech recognition by the application of a temporal-correlation-based recurrent multilayer neural network to the mel-based cepstral coefficients
Michel Héon, Hesham Tolba, Douglas O'Shaughnessy
Speech recognition from GSM codec parameters
Juan M. Huerta, Richard M. Stern
Improved parallel model combination based on better domain transformation for speech recognition under noisy environments
Jeih-Weih Hung, Jia-Lin Shen, Lin-Shan Lee
Robust speech/non-speech detection in adverse conditions based on noise and speech statistics
Lamia Karray, Jean Monne
Speech recognition in car noise environments using multiple models according to noise masking levels
Myung Gyu Song, Hoi In Jung, Kab-Jong Shim, Hyung Soon Kim
Spectral noise subtraction with recursive gain curves
Klaus Linhard, Tim Haulick
A novel robust speech recognition algorithm based on multi-models and integrated decision method
Shengxi Pan, Jia Liu, Jintao Jiang, Zuoying Wang, Dajin Lu
On the interaction between time and frequency filtering of speech parameters for robust speech recognition
Dusan Macho, Climent Nadeu
Inference of missing spectrographic features for robust speech recognition
Bhiksha Raj, Rita Singh, Richard M. Stern
SNR-dependent flooring and noise overestimation for joint application of spectral subtraction and model combination
Volker Schless, Fritz Class
Improved robust speech recognition considering signal correlation approximated by taylor series
Jia-Lin Shen, Jeih-Weih Hung, Lin-Shan Lee
Speech recognition in noisy environment using weighted projection-based likelihood measure
Won-Ho Shin, Weon-Goo Kim, Chungyong Lee, Il-Whan Cha
Evaluation of model adaptation by HMM decomposition on telephone speech recognition
Tetsuya Takiguchi, Satoshi Nakamura, Kiyohiro Shikano, Masatoshi Morishima, Toshihiro Isobe
Comparative experiments to evaluate a voiced-unvoiced-based pre-processing approach to robust automatic speech recognition in low-SNR environments
Hesham Tolba, Douglas O'Shaughnessy
Signal extraction from noisy signal based on auditory scene analysis
Masashi Unoki, Masato Akagi
Frequency domain binaural model as the front end of speech recognition system
Tsuyoshi Usagawa, Kenji Sakai, Masanao Ebata
A study on the recognition of low bit-rate encoded speech
An-Tzyh Yu, Hsiao-Chuan Wang
Weighted parallel model combination for noisy speech recognition
Tai-Hwei Hwang, Hsiao-Chuan Wang
Favourable and unfavourable short duration segments of speech in noise
Daniel Woo
Auditory modeling techniques for robust pitch extraction and noise reduction
Piero Cosi, Stefano Pasquin, Enrico Zovato
Wavelet transform-based speech enhancement
Eliathamby Ambikairajah, Graham Tattersall, Andrew Davis
A practical perceptual frequency autoregressive HMM enhancement system
Beth Logan, Tony Robinson
An effective quality evaluation protocol for speech enhancement algorithms
John H. L. Hansen, Bryan L. Pellom
An adaptive beamforming microphone array system using a blind deconvolution
Jin-Nam Park, Tsuyoshi Usagawa, Masanao Ebata
Speech enhancement using critical band spectral subtraction
Latchman Singh, Sridha Sridharan
A three-dimensional linear articulatory model based on MRI data
Pierre Badin, Gérard Bailly, Monica Raybaudi, Christoph Segebarth
On loops and articulatory biomechanics
Pascal Perrier, Yohan Payan, Joseph Perkell, Frédéric Jolly, Majid Zandipour, Melanie Matthies
Magnetic resonance measurements of the velum port opening
Didier Demolin, Véronique Lecuit, Thierry Metens, Bruno Nazarian, Alain Soquet
Cantilever-type force-sensor-mounted palatal plate for measuring palatolingual contact stress and pattern during speech phonation
Masafumi Matsumura, Takuya Niikawa, Takao Tanabe, Takashi Tachimura, Takeshi Wada
Determination of the vocal tract spectrum from the articulatory movements based on the search of an articulatory-acoustic database
Tokihiko Kaburagi, Masaaki Honda
An MRI study on the relationship between oral cavity shape and larynx position
Kiyoshi Honda, Mark K. Tiede
Acoustic-articulatory evaluation of the upper vowel-formant region and its presumed speaker-specific potency
Frantz Clermont, Parham Mokhtari
Control of larynx height in vowel production
Philip Hoole, Christian Kroos
Analyzing the effect of secondary excitations of the vocal tract on vocal intensity in different loudness conditions
Paavo Alku, Juha Vintturi, Erkki Vilkman
An analysis of modal coupling effects during the glottal cycle: formant synthesizers from time-domain finite-difference simulations
Gordon Ramsay
Laryngoscopic analysis of pharyngeal articulations and larynx-height voice quality settings
John H. Esling
Effects of shapes of radiational aperture on radiation characteristics
Hiroki Matsuzaki, Kunitoshi Motoki, Nobuhiro Miki
An electropalatographic, kinematic, and acoustic analysis of supralaryngeal correlates of word-level prominence contrasts in English
Jonathan Harrington, Mary E. Beckman, Janet Fletcher, Sallyanne Palethorpe
Consistencies and inconsistencies between EPG and locus equation data on coarticulation
Marija Tabain
Synergy between jaw and lips/tongue movements : consequences in articulatory modelling
Gérard Bailly, Pierre Badin, Anne Vilain
Modelling tongue configuration in German vowel production
Philip Hoole
Optopalatograph: real-time feedback of tongue movement in 3D
Alan A. Wrench, Alan D. McIntosh, Colin Watson, William J. Hardcastle
Effects of contrastive focal accent on linguopalatal articulation and coarticulation in the French [kskl] cluster
Yohann Meynadier, Michel Pitermann, Alain Marchal
Acoustic and affective qualities of IDS in English
Christine Kitamura, Denis Burnham
Acoustic qualities of IDS and ADS in Thai
Chayada Thanavisuth, Sudaporn Luksaneeyanawin
Pragmatic characteristics of infant directed speech
Sudaporn Luksaneeyanawin, Chayada Thanavisuth, Suthasinee Sittigasorn, Onwadee Rukkarangsarit
Are you my little pussy-cat? acoustic, phonetic and affective qualities of infant- and pet-directed speech
Denis Burnham, Elizabeth Francis, Ute Vollmer-Conna, Christine Kitamura, Vicky Averkiou, Amanda Olley, Mary Nguyen, Cal Paterson
Special speech registers: talking to australian and Thai infants, and to pets
Denis Burnham
A very low bit rate speech coder using HMM with speaker adaptation
Takashi Masuko, Keiichi Tokuda, Takao Kobayashi
ITU-t g.729 extension at 6.4 kbps
E. Ekudden, R. Hagen, B. Johansson, S. Hayashi, A. Kataoka, S. Kurihara
Adaptive transformation for segmented parametric speech coding
Damith J. Mudugamuwa, Alan B. Bradley
Speech enhancement using STC-based bandwidth extension
Julien Epps, W. Harvey Holmes
Performance and optimization of the SEEVOC algorithm
Weihua Zhang, W. Harvey Holmes
Towards a unified model for low bit-rate speech coding using a recognition-synthesis approach
Wendy J. Holmes
On the significance of temporal masking in speech coding
Jan Skoglund, W. Bastiaan Kleijn
Waveform interpolation coding with pitch-spaced subbands
W. Bastiaan Kleijn, Huimin Yang, Ed F. Deprettere
An improved decomposition method for WI using IIR wavelet filter banks
Nicola R. Chong, Ian S. Burnett, Joe F. Chicharo
A new linear predictive method for compression of speech signals
Paavo Alku, Susanna Varho
Hierarchical temporal decomposition: a novel approach to efficient compression of spectral characteristics of speech
Shahrokh Ghaemmaghami, Mohamed Deriche, Sridha Sridharan
Speech intelligibility testing for new technologies
Susan L. Hura
Efficient quantization of LSF parameters based on temporal decomposition
Sung Joo Kim, Sangho Lee, Woo Jin Han, Yung Hwan Oh
A sinusoidal harmonic vocoder at 1.2 kbps using auditory perceptual characteristics
Minoru Kohata
A 16 kbit/s wideband CELP coder using MEL-generalized cepstral analysis and its subjective evaluation
Kazuhito Koishida, Gou Hirabayashi, Keiichi Tokuda, Takao Kobayashi
Comparison of spectral estimation techniques for low bit-rate speech coding
D. J. Molyneux, C. I. Parris, X. Q. Sun, B. M. G. Cheetham
Low bit rate coding for speech and audio using mel linear predictive coding (MLPC) analysis
Yoshihisa Nakatoh, Takeshi Norimatsu, Ah Heng Low, Hiroshi Matsumoto
Comparison study on VQ codevector index assignment
Jeng-Shyang Pan, Chin-Shiuh Shieh, Shu-Chuan Chu
Using linguistic knowledge to improve the design of low-bit rate LSF quantisation
John J. Parry, Ian S. Burnett, Joe F. Chicharo
Transform coding of LSF parameters using wavelets
Davor Petrinovic
Source controlled variable bit-rate speech coder based on waveform interpolation
F. Plante, B. M. G. Cheetham, D. Marston, P. A. Barrett
Improving speaker recognisability in phonetic vocoders
Carlos M. Ribeiro, Isabel M. Trancoso
A comparison of Thai speech recognition systems using hidden Markov model, neural network, and fuzzy-neural network
Visarut Ahkuputra, Somchai Jitapunkul, Nutthacha Jittiwarangkul, Ekkarit Maneenoi, Sawit Kasuriya
Phoneme recognition with statistical modeling of the prediction error of neural networks
Felix Freitag, Enric Monte
Neural network based pronunciation modeling with applications to speech recognition
Toshiaki Fukada, Takayoshi Yoshimura, Yoshinori Sagisaka
A comparative study of OCON and MLP architectures for phoneme recognition
Stephen J. Haskey, Sekharajit Datta
Evaluation and integration of neural-network training techniques for continuous digit recognition
John-Paul Hosom, Ronald A. Cole, Piero Cosi
Hierarchical neural networks (HNN) for Chinese continuous speech recognition
Ying Jia, Limin Du, Ziqiang Hou
Neural network motivation for segmental distribution
Eric Keller
Combining connectionist multi-band and full-band probability streams for speech recognition of natural numbers
Nikki Mirghafori, Nelson Morgan
Initial speech recognition results using the multinet architecture
Ednaldo B. Pizzolato, T. Jeff Reynolds
Selection of the optimal structure of the continuous HMM using the genetic algorithm
Tomio Takara, Yasushi Iha, Itaru Nagayama
A proposed decision rule for speaker recognition based on fuzzy c-means clustering
Dat Tran, Michael Wagner, Tu Van Le
Fuzzy Gaussian mixture models for speaker recognition
Dat Tran, Tu Van Le, Michael Wagner
A new strategy of fuzzy-neural network for Thai numeral speech recognition
Chai Wutiwiwatchai, Somchai Jitapunkul, Visarut Ahkuputra, Ekkarit Maneenoi, Sudaporn Luksaneeyanawin
Thai polysyllabic word recognition using fuzzy-neural network
Chai Wutiwiwatchai, Somchai Jitapunkul, Visarut Ahkuputra, Ekkarit Maneenoi, Sudaporn Luksaneeyanawin
Modular neural networks for low-complex phoneme recognition
Axel Glaeser
Global optimisation of neural network models via sequential sampling-importance resampling
Joao F. G. de Freitas, Sue E. Johnson, Mahesan Niranjan, Andrew H. Gee
Efficient computation of MMI neural networks for large vocabulary speech recognition systems
Jörg Rottland, Andre Ludecke, Gerhard Rigoll
Modular connectionist systems for identifying complex arabic phonetic features
Sid-Ahmed Selouani, Jean Caelen
Fuzzy-integration based normalization for speaker verification
Tuan Pham, Michael Wagner
Improving the generalization performance of the MCE/GPD learning
Hiroshi Shimodaira, Jun Rokui, Mitsuru Nakai
Acoustic speech recognition model by neural net equation with competition and cooperation
Tetsuro Kitazoe, Tomoyuki Ichiki, Sung-Ill Kim
Improved surname pronunciations using decision trees
Julie Ngan, Aravind Ganapathiraju, Joseph Picone
Word verification using confidence measures in speech recognition
M. Carmen Benitez, Antonio Rubio, Pedro García, Jesus Diaz-Verdejo
Improving posterior based confidence measures in hybrid HMM/ANN speech recognition systems
Giulia Bernardis, Hervé Bourlard
Two-pass utterance verification algorithm for long natural numbers recognition
Javier Caminero, Eduardo López, Luis A. Hernández
A*-admissible key-phrase spotting with sub-syllable level utterance verification
Berlin Chen, Hsin-Min Wang, Lee-Feng Chien, Lin-Shan Lee
Speaker-independent upfront dialect adaptation in a large vocabulary continuous speech recognizer
Volker Fischer, Yuqing Gao, Eric Janke
Word-based acoustic confidence measures for large-vocabulary speech recognition
Asela Gunawardana, Hsiao-Wuen Hon, Li Jiang
Improved utterance rejection using length dependent thresholds
Sunil K. Gupta, Frank K. Soong
Bayesian constrained frequency warping HMMS for speaker normalisation
Ching Hsiang Ho, Saeed Vaseghi, Aimin Chen
An evaluation of keyword spotting performance utilizing false alarm rejection based on prosodic information
Masaki Ida, Ryuji Yamasaki
Predictive speaker adaptation and its prior training
Dieu Tran, Ken-ichi Iso
Powerful syllabic fillers for general-task keyword-spotting and unlimited-vocabulary continuous-speech recognition
Rachida El Méliani, Douglas O'Shaughnessy
Confidence scoring for speech understanding systems
Christine Pao, Philipp Schmid, James R. Glass
Phonological rules for enhancing acoustic enrollment of unknown words
Bhuvana Ramabhadran, Abraham Ittycheriah
Recognition-based word counting for reliable barge-in and early endpoint detection in continuous speech recognition
Anand R. Setlur, Rafid A. Sukkar
Linear discriminant - a new criterion for speaker normalization
Martin Westphal, Tanja Schultz, Alex Waibel
Confidence measures derived from an acceptor HMM
Gethin Williams, Steve Renals
Telephone speech multi-keyword spotting using fuzzy search algorithm and prosodic verification
Chung-Hsien Wu, Yeou-Jiunn Chen, Yu-Chun Hung
Topic recognition for news speech based on keyword spotting
Yoichi Yamashita, Toshikatsu Tsunekawa, Riichiro Mizoguchi
Heads and tails in word perception: evidence for `early-to-late' processing in listening and reading
Sieb G. Nooteboom, Meinou van Dijk
Evidence for early effects of sentence context on word segmentation
Saskia te Riele, Hugo Quené
Assimilation and anticipation in word perception
Hugo Quené, Maya van Rossum, Mieke van Wijck
Lexical activation by assimilated and reduced tokens
M. Louise Kelly, Ellen Gurman Bard, Catherine Sotillo
Fundamental frequency fluctuation in continuous vowel utterance and its perception
Masato Akagi, Mamoru Iwaki, Tomoya Minakawa
Estimation of mental lexicon size with word familiarity database
Shigeaki Amano, Tadahisa Kondo
Vowel quality in spontaneous speech: what makes a good vowel?
Matthew Aylett, Alice Turk
Cooperation and competition of burst and formant transitions for the perception and identification of French stops
Adrian Neagu, Gérard Bailly
The effect of modifying formant amplitudes on the perception of French vowels generated by copy synthesis
Anne Bonneau, Yves Laprie
Segmental and tonal processing in Cantonese
Hsuan-Chih Chen, Michael C. W. Yip, Sum-Yin Wong
Phonological similarity effects in Cantonese spoken-word processing
Michael C. W. Yip, Po-Yee Leung, Hsuan-Chih Chen
On the learnability of the voicing contrast for initial stops
Bob I. Damper, Steve R. Gunn
Acoustic and perceptual characteristic of Italian stop consonants
Loredana Cerrato, Mauro Falcone
Acoustic cues for the auditory identification of the Spanish fricative /f/
Santiago Fernández, Sergio Feijóo, Ramon Balsa, Nieves Barros
Recognition of vowels in fricative context.
Santiago Fernández, Sergio Feijóo, Ramon Balsa, Nieves Barros
Voicing affects perceived manner of articulation.
Santiago Fernández, Sergio Feijóo, Plinio Almeida
Enhancement techniques to improve the intelligibility of consonants in noise : speaker and listener effects
Valerie Hazan, Andrew Simpson, Mark Huckvale
Boundaries of perception of long tones in taiwanese speech
Fran H. L. Jian
Effects of phonetic quality and duration on perceptual acceptability of temporal changes in speech
Hiroaki Kato, Minoru Tsuzaki, Yoshinori Sagisaka
Dynamic vs. static spectral detail in the perception of gated stops
Michael Kiefte, Terrance M. Nearey
Phonological units in speech segmentation and phonological awareness
Takashi Otake, Kiyoko Yoneyama
How far do speakers back up in repairs? a quantitatve model
Elizabeth Shriberg, Andreas Stolcke
Don't blame it (all) on the pause: further ERP evidence for a prosody-induced garden-path in running speech
Karsten Steinhauer, Kai Alter, Angela D. Friederici
The role of stress for lexical selection in dutch
Jean Vroomen, Beatrice de Gelder
The perception of stressed syllables in finnish
Jyrki Tuomainen, Jean Vroomen, Beatrice de Gelder
The perception of the morae with devocalized vowels in Japanese language.
Kimiko Yamakawa, Ryoji Baba
Categorical perception: important phenomenon or lasting myth?
Dominic W. Massaro
Categorical perception of vowels
Ellen Gerrits, Bert Schouten
Suprasegmental cues for the segmentation of identical vowel sequences in Japanese
Kazuhiko Kakehi, Yuki Hirose
Perception of concurrent approximant-vowel syllables
William A. Ainsworth
Perceived Swedish vowel quantity: effects of postvocalic consonant duration
Dawn M. Behne, Peter E. Czigler, Kirk P. H. Sullivan
Orthografik inkoncistensy ephekts in foneme detektion?
Anne Cutler, Rebecca Treiman, Brit van Ooijen
The effect of orthographic knowledge on the segmentation of speech
Bruce L. Derwing, Terrance M. Nearey, Yeo Bom Yoon
Spotting (different types of) words in (different types of) context
James M. McQueen, Anne Cutler
Correlation between consonantal VC transitions and degree of perceptual confusion of place contrast in hindi
Manjari Ohala, John J. Ohala
Perception of tonal rises and falls for accentuation and phrasing in Swedish
David House, Dik Hermes, Frédéric Beaugendre
Speech intelligibility derived from exceedingly sparse spectral information
Steven Greenberg, Takayuki Arai, Rosaria Silipo
Adults with a severe-to-profound hearing impairment. investigating the effects of linguistic context on speech perception
Mark C. Flynn, Richard C. Dowell, Graeme M. Clark
Speech perception in dyslexia: measurements from birth onwards
Florien J. Koopmans-van Beinum, Caroline E. Schwippert, Cecile T. L. Kuijpers
An acoustic analysis of vowel production across tasks in a case of non-fluent progressive aphasia
Karen Croot
Speech technology in clinical environments
Jan van Doorn, Sharynne McLeod, Elise Baker, Alison Purcell, William Thorpe
GALAXY-II: a reference architecture for conversational system development
Stephanie Seneff, Ed Hurley, Raymond Lau, Christine Pao, Philipp Schmid, Victor Zue
Improvements in speech understanding accuracy through the integration of hierarchical linguistic, prosodic, and phonological constraints in the jupiter domain
Grace Chung, Stephanie Seneff
Towards robust methods for spoken document retrieval
Kenney Ng
Automatic ambiguity detection
Richard Sproat, Jan P. H. van Santen
Empowering knowledge based speech understanding through statistics
Julia Fischer, Juergen Haas, Elmar Nöth, Heinrich Niemann, Frank Deinzer
Concept-driven speech understanding incorporated with a statistic language model
Akito Nagai, Yasushi Ishikawa
On the limitations of stochastic conceptual finite-state language models for speech understanding
José Colás, Javier Ferreiros, Juan Manuel Montero, Julio Pastor, Ascensión Gallardo, José Manuel Pardo
Towards speech understanding across multiple languages
Todd Ward, Salim Roukos, Chalapathy Neti, Jerome Gros, Mark Epstein, Satya Dharanipragada
Automatic detection of sentence boundaries and disfluencies based on recognized words
Andreas Stolcke, Elizabeth Shriberg, Rebecca Bates, Mari Ostendorf, Dilek Hakkani, Madelaine Plauche, Gokhan Tur, Yu Lu
Language modeling for content extraction in human-computer dialogues
Wolfgang Reichl, Bob Carpenter, Jennifer Chu-Carroll, Wu Chou
A language model combining trigrams and stochastic context-free grammars
John Gillett, Wayne Ward
Online adaptation of language models in spoken dialogue systems
Bernd Souvignier, Andreas Kellner
Language model adaptation for spoken language systems
Giuseppe Riccardi, Alexandros Potamianos, Shrikanth Narayanan
Detecting topic shifts using a cache memory
Brigitte Bigi, Renato De Mori, Marc El-Beze, Thierry Spriet
A discourse coding scheme for conversational Spanish
Lori Levin, Ann Thyme-Gobbel, Alon Lavie, Klaus Ries, Klaus Zechner
Grammar fragment acquisition using syntactic and semantic clustering
Kazuhiro Arai, Jeremy H. Wright, Giuseppe Riccardi, Allen L. Gorin
Non-expert access to unification based speech understanding
Tom Brøndsted
Natural language call routing: a robust, self-organizing approach
Bob Carpenter, Jennifer Chu-Carroll
Automatic grammar induction from semantic parsing
Debajit Ghosh, David Goddeau
BTH: an efficient parsing algorithm for word-spotting
Yasuyuki Kono, Takehide Yano, Munehiko Sasajima
Syntax coordination: interaction of discourse and extrapositions
Susanne Kronenberg, Franz Kummert
Hierarchical tag-graph search for spontaneous speech understanding in spoken dialog systems
Bor-Shen Lin, Berlin Chen, Hsin-Min Wang, Lin-Shan Lee
Extraction of the dialog act and the topic from utterances in a spoken dialog system
Yasuhisa Niimi, Noboru Takinaga, Takuya Nishimoto
Fast computation of maximum entropy / minimum divergence feature gain
Harry Printz
Stochastic language models for speech recognition and understanding
Giuseppe Riccardi, Allen L. Gorin
Linguistically engineered tools for speech recognition error analysis
Carol Van Ess-Dykema, Klaus Ries
Estimating entropy of a language from optimal word insertion penalty
Kazuya Takeda, Atsunori Ogawa, Fumitada Itakura
A linguistic analysis of repair signals in co-operative spoken dialogues
Shu-Chuan Tseng
A hierarchical language model for CSR
Francisco J. Valverde-Albacete, José Manuel Pardo
Spoken language understanding within dialogs using a graphical model of task structure
Jeremy H. Wright, Allen L. Gorin, Alicia Abella
Keyword extraction of radio news using domain identification based on categories of an encyclopedia
Yoshimi Suzuki, Fumiyo Fukumoto, Yoshihiro Sekiguchi
Maximum a posteriori pitch tracking
James Droppo, Alex Acero
Vowel separation using the reassigned amplitude-modulation spectrum
Dekun Yang, Georg F. Meyer, William A. Ainsworth
Feature decorrelation methods in speech recognition. a comparative study
Eloi Batlle, Climent Nadeu, José A.R. Fonollosa
Multi-resolution for speech analysis
Marie-José Caraty, Claude Montacié
Dynamic features in children's vowels
Steve Cassidy, Catherine Watson
Effectiveness of phase-corrected rasta for continuous speech recognition
Johan de Veth, Louis Boves
Techniques for capturing temporal variations in speech signals with fixed-rate processing
Satya Dharanipragada, Ramesh A. Gopinath, Bhaskar D. Rao
Automatic detection of landmark for nasal consonants from speech waveform
Limin Du, Kenneth N. Stevens
Plug and play software for designing high-level speech processing systems
Thierry Dutoit, Juergen Schroeter
Creating speaker independent HMM models for restricted database using STRAIGHT-TEMPO morphing
Alexandre Girardi, Kiyohiro Shikano, Satoshi Nakamura
Restoration of hyperbaric speech by correction of the formants and the pitch
Laure Charonnat, Michel Guitton, Joel Crestel, Gerome Allée
Voice conversion based on parameter transformation
Juana M. Gutierrez-Arriola, Yung-Sheng Hsiao, Juan Manuel Montero, José Manuel Pardo, Donald G. Childers
Noise robust two-stream auditory feature extraction method for speech recognition
Jilei Tian, Ramalingam Hariharan, Kari Laurila
Heterogeneous measurements and multiple classifiers for speech recognition
Andrew K. Halberstadt, James R. Glass
Joint recognition and segmentation using phonetically derived features and a hybrid phoneme model
Naomi Harte, Saeed Vaseghi, Ben Milner
TRAPS - classifiers of temporal patterns
Hynek Hermansky, Sangita Sharma
Robust measurement of fundamental frequency and degree of voicing
John N. Holmes
Micropower electro-magnetic sensors for speech characterization, recognition, verification, and other applications
John F. Holzrichter, Gregory C. Burnett, Todd J. Gable, Lawrence C. Ng
Robust entropy-based endpoint detection for speech recognition in noisy environments
Jia-Lin Shen, Jeih-Weih Hung, Lin-Shan Lee
Statistical integration of temporal filter banks for robust speech recognition using linear discriminant analysis (LDA)
Jia-Lin Shen, Wen-Liang Hwang
Feature-based approach to speech recognition
Dorota J. Iskra, William H. Edmondson
Periodicity emphasis of voice wave using nonlinear IIR digital filters and its applications
Hiroyuki Kamata, Akira Kaneko, Yoshihisa Ishida
Speech recognition via phonetically featured syllables
Simon King, Todd Stephenson, Stephen Isard, Paul Taylor, Alex Strachan
Do phonetic features help to improve consonant identification in ASR?
Jacques Koreman, Bistra Andreeva, William J. Barry
Perceptual and acoustic properties of phonemes in continuous speech for different speaking rate
Hisao Kuwabara
On robust sequential estimator based on t-distribution with forgetting factor for speech analysis
Joohun Lee, Ki Yong Lee
Discriminant wavelet basis construction for speech recognition
Christopher John Long, Sekharajit Datta
An efficient mel-LPC analysis method for speech recognition
Hiroshi Matsumoto, Yoshihisa Nakatoh, Yoshinori Furuhata
Discriminative weighting of multi-resolution sub-band cepstral features for speech recognition
Philip McMahon, Paul McCourt, Saeed Vaseghi
Separation of singing and piano sounds
Yoram Meron, Keikichi Hirose
Modeling of variations in cepstral coefficients caused by F0 changes and its application to speech processing
Nobuaki Minematsu, Seiichi Nakagawa
A detection framework for locating phonetic events
Partha Niyogi, Partha Mitra, Man Mohan Sondhi
On frequency averaging for spectral analysis in speech recognition
Climent Nadeu, Felix Galindo, Jaume Padrell
Wavelet transform domain blind equalization and its application to speech analysis
Munehiro Namba, Yoshihisa Ishida
A novel method of formant analysis and glottal inverse filtering
Steve Pearson
Vector quantizer acceleration for an automatic speech recognition application
Antonio J. Araujo, Vitor C. Pera, Marcio N. Souza
Local speech rate as a combination of syllable and phone rate
Hartmut R. Pfitzinger
Recovering gestures from speech signals: a preliminary study for nasal vowels
Solange Rossato, Gang Feng, Rafael Laboissiere
Extended linear discriminant analysis (ELDA) for speech recognition
Guenther Ruske, Robert Faltlhauser, Thilo Pfau
Speech, silence, music and noise classification of TV broadcast material
Ara Samouelian, Jordi Robert-Ribes, Mike Plumpe
The relation between vocal tract shape and formant frequencies can be described by means of a system of coupled differential equations
Jean Schoentgen, Alain Soquet, Véronique Lecuit, Sorin Ciocea
Improving speech recognizer by broader acoustic-phonetic group classification
Youngjoo Suh, Kyuwoong Hwang, Oh-Wook Kwon, Jun Park
Separation of speech source and filter by time-domain deconvolution
C. William Thorpe
On the application of the AM-FM model for the recovery of missing frequency bands of telephone speech
Hesham Tolba, Douglas O'Shaughnessy
Estimation of voice source and vocal tract parameters using combined subspace-based and amplitude spectrum-based algorithm
Chang-Sheng Yang, Hideki Kasuya
The distance measure for line spectrum pairs applied to speech recognition
Fang Zheng, Zhanjiang Song, Ling Li, Wenjian Yu, Fengzhou Zheng, Wenhu Wu
Improving pitch estimation with short duration speech samples
William A. Ainsworth, Charles R. Day, Georg F. Meyer
An instantaneous-frequency-based pitch extraction method for high-quality speech transformation: revised TEMPO in the STRAIGHT-suite
Hideki Kawahara, Alain de Cheveigne, Roy D. Patterson
Speaker-independent speech recognition using micro segment spectrum integration
Kiyoaki Aikawa
On robust speech analysis based on time-varying complex AR model
Keiichi Funaki, Yoshikazu Miyanaga, Koji Tochinai
Spectral basis functions from discriminant analysis
Hynek Hermansky, Narendranath Malayath
Determination of articulatory positions from speech acoustics by applying dynamic articulatory constraints
Shin Suzuki, Takesi Okadome, Masaaki Honda
Recognizing emotions in speech using short-term and long-term features
Yang Li, Yunxin Zhao
Periphear : a nonlinear active model of the auditory periphery
Arnaud Robert, Jan Eriksson
The voicing feature for stop consonants: acoustic phonetic analyses and automatic speech recognition experiments
Padma Ramesh, Partha Niyogi
Wavelet-based energy binning cepstral features for automatic speech recognition
Sankar Basu, Stéphane Maes
Articulatory analysis using a codebook for articulatory based low bit-rate speech coding
Carlos Silva, Samir Chennoukh
The modeling and realization of natural speech generation system
Chen Fang, Yuan Baozong
ko tok ples ensin bilong tok pisin or the TP-CLE: a first report from a pilot speech-to-speech translation project from Swedish to tok pisin
Robert Eklund
An iterative, DP-based search algorithm for statistical machine translation
Ismael García-Varea, Francisco Casacuberta, Hermann Ney
Information extraction and text generation of news reports for a Swedish-English bilingual spoken dialogue system
Barbara Gawronska, David House
Utterance generation for transaction dialogues
Joris Hulstijn, Arjan van Hessen
Example-based error recovery method for speech translation: repairing sub-trees according to the semantic distance
Kai Ishikawa, Eiichiro Sumita, Hitoshi Iida
Context sensitive generation of descriptions
Emiel Krahmer, Mariet Theune
An interlingua based on domain actions for machine translation of task-oriented dialogues
Lori Levin, Donna Gates, Alon Lavie, Alex Waibel
Generating pitch accents in a concept-to-speech system using a knowledge base
Sandra Williams
Making the most of multiplicity: a multi-parser multi-strategy architecture for the robust processing of spoken language
Tobias Ruland, C. J. Rupp, Jörg Spilker, Hans Weber, Karsten L. Worm
Natural-sounding speech synthesis using variable-length units
Jon R. W. Yi, James R. Glass
A generic algorithm for generating spoken monologues
Esther Klabbers, Emiel Krahmer, Mariet Theune
On the use of automatically generated discourse-level information in a concept-to-speech synthesis system
Janet Hitzeman, Alan W. Black, Paul Taylor, Chris Mellish, Jon Oberlander
Learning phrase-based head transduction models for translation of spoken utterances
Hiyan Alshawi, Srinivas Bangalore, Shona Douglas
Probabilistic dialogue act extraction for concept based multilingual translation systems
Toshiaki Fukada, Detlef Koll, Alex Waibel, Kouichi Tanigaki
Fast decoding for statistical machine translation
Ye-Yi Wang, Alex Waibel
A Japanese-to-English speech translation system: ATR-MATRIX
Toshiyuki Takezawa, Tsuyoshi Morimoto, Yoshinori Sagisaka, Nick Campbell, Hitoshi Iida, Fumiaki Sugaya, Akio Yokoo, Seiichi Yamamoto
Acoustic indicators of topic segmentation
Julia Hirschberg, Christine H. Nakatani
IVie - a comparative transcription system for intonational variation in English
Esther Grabe, Francis Nolan, Kimberley J. Farrar
Automatic segmental and prosodic labeling of Mandarin speech database
Fu-Chiang Chou, Chiu-Yu Tseng, Lin-Shan Lee
Automatic labelling of German prosody
Stefan Rapp
An efficient labeling tool for the Quicksig speech database
Matti Karjalainen, Toomas Altosaar, Miikka Huttunen
Collection and detailed transcription of a speech database for development of language learning technologies
Harry Bratt, Leonardo Neumeyer, Elizabeth Shriberg, Horacio Franco
Resegmentation of SWITCHBOARD
Neeraj Deshmukh, Aravind Ganapathiraju, Andi Gleeson, Jonathan Hamaker, Joseph Picone
Automatic generation of visual scenarios for spoken corpora acquisition
Demetrio Aiello, Cristina Delogu, Renato De Mori, Andrea Di Carlo, Marina Nisi, Silvia Tummeacciu
Automatic detection of semantic boundaries based on acoustic and lexical knowledge
Mauro Cettolo, Daniele Falavigna
A new fast algorithm for automatic segmentation of continuous speech
Iman Gholampour, Kambiz Nayebi
Acoustic nature and perceptual testing of corpora of emotional speech
Akemi Iida, Nick Campbell, Soichiro Iga, Fumito Higuchi, Michiaki Yasumura
Korean prosodic break index labelling by a new mixed method of LDA and VQ
Pyungsu Kang, Jiyoung Kang, Jinyoung Kim
MOOSE: management of otago speech environment
Mark Laws, Richard Kilgour
Phonetic alignment: speech synthesis based vs. hybrid HMM/ANN
Fabrice Malfrère, Olivier Deroo, Thierry Dutoit
Customisation and quality assessment of spoken language description
J. Bruce Millar
A silence/noise/music/speech splitting algorithm
Claude Montacié, Marie-José Caraty
Audio-visual segmentation for content-based retrieval
David Pye, Nicholas J. Hollinghurst, Timothy J. Mills, Kenneth R. Wood
Same news is good news: automatically collecting reoccurring radio news stories
Stefan Rapp, Grzegorz Dogil
An annotation system for melodic aspects of German spontaneous speech
Christel Brindöpke, Brigitte Schaffranietz
Additional use of phoneme duration hypotheses in automatic speech segmentation
Karlheinz Stöber, Wolfgang Hess
Towards a minimal standard for dialogue transcripts: a new SGML architecture for the HCRC map task corpus
Amy Isard, David McKelvie, Henry S. Thompson
A recursive algorithm for the forced alignment of very long audio segments
Pedro J. Moreno, Chris Joerg, Jean-Manuel Van Thong, Oren Glickman
The selection of pronunciation variants: comparing the performance of man and machine
Judith M. Kessens, Mirjam Wester, Catia Cucchiarini, Helmer Strik
Acoustic confidence measures for segmenting broadcast news
Jon Barker, Gethin Williams, Steve Renals
A duration-based confidence measure for automatic segmentation of noise corrupted speech
Bryan L. Pellom, John H. L. Hansen
Segmentation and classification of broadcast news audio
Thomas Hain, Philip C. Woodland
Speaker recruitment methods and speaker coverage - experiences from a large multilingual speech database collection
Børge Lindberg, Robrecht Comeyne, Christoph Draxler, Francesco Senia
A multilingual prosodic database
Estelle Campione, Jean Véronis
The CSLU speaker recognition corpus
Ronald A. Cole, Mike Noel, Victoria Noel
How effective is unsupervised data collection for children's speech recognition?
Gregory Aist, Peggy Chan, Xuedong Huang, Li Jiang, Rebecca Kennedy, DeWitt Latimer, Jack Mostow, Calvin Yeung
An algorithm for automatic generation of Mandarin phonetic balanced corpus
Jyh-Shing Shyuu, Wang Jhing-Fa
Towards a formal framework for linguistic annotations
Steven Bird, Mark Liberman
Forming generic models of speech for uniform database access
Toomas Altosaar, Martti Vainio
Real-time recognition of broadcast news
Gary Cook, Tony Robinson, James Christie
Automatic recognition of Korean broadcast news speech
Ha-Jin Yu, Hoon Kim, Jae-Seung Choi, Joon-Mo Hong, Kew-Suh Park, Jong-Seok Lee, Hee-Youn Lee
Telephone-based conversational speech recognition in the JUPITER domain
James R. Glass, Timothy J. Hazen
Japanese large-vocabulary continuous speech recognition system based on microsoft whisper
Hsiao-Wuen Hon, Yun-Cheng Ju, Keiko Otani
Partitioning and transcription of broadcast news data
Jean-Luc Gauvain, Lori F. Lamel, Gilles Adda
Grammatical word graph re-generation for spontaneous speech recognition
Hajime Tsukada, Hirofumi Yamamoto, Toshiyuki Takezawa, Yoshinori Sagisaka
Compression algorithm of trigram language models based on maximum likelihood estimation
Norimichi Yodo, Kiyohiro Shikano, Satoshi Nakamura
Morphological modeling of word classes for language models
Ulla Uebler, Heinrich Niemann
A comparative study between polyclass and multiclass language models
Imed Zitouni, Kamel Smaili, Jean-Paul Haton, Sabine Deligne, Frédéric Bimbot
Log-linear interpolation of language models
Dietrich Klakow
The applicability of adaptive language modelling for the broadcast news task
Philip Clarkson, Tony Robinson
The BBN single-phonetic-tree fast-match algorithm
Long Nguyen, Richard Schwartz
An efficient two-pass search algorithm using word trellis index
Akinobu Lee, Tatsuya Kawahara, Shuji Doshita
Nozomi -- a fast, memory-efficient stack decoder for LVCSR
Mike Schuster
Reducing the OOV rate in broadcast news speech recognition
Thomas Kemp, Alex Waibel
Using automatically-derived acoustic sub-word units in large vocabulary speech recognition
Michiel Bacchiani, Mari Ostendorf
Fabricating conversational speech data with acoustic models: a program to examine model-data mismatch
Don McAllaster, Lawrence Gillick, Francesco Scattone, Michael Newman
High resolution decision tree based acoustic modeling beyond CART
Wu Chou, Wolfgang Reichl
Unsupervised training of a speech recognizer using TV broadcasts
Thomas Kemp, Alex Waibel
A new method to achieve fast acoustic matching for speech recognition
Clark Z. Lee, Douglas O'Shaughnessy
Improved parameter tying for efficient acoustic model evaluation in large vocabulary continuous speech recognition
Jacques Duchateau, Kris Demuynck, Dirk Van Compernolle, Patrick Wambacq
A new look at HMM parameter tying for large vocabulary speech recognition
Ananth Sankar
Factor analysis invariant to linear transformations of data
Ramesh A. Gopinath, Bhuvana Ramabhadran, Satya Dharanipragada
A thesaurus-based statistical language model for broadcast news transcription
Akio Ando, Akio Kobayashi, Toru Imai
Effect of task complexity on search strategies for the motorola lexicus continuous speech recognition system
Sreeram V. Balakrishnan
New features for confidence annotation
Dhananjay Bansal, Mosur K. Ravishankar
Multi-Span statistical language modeling for large vocabulary speech recognition
Jerome R. Bellegarda
Maximum-likelihood updates of HMM duration parameters for discriminative continuous speech recognition
Rathinavelu Chengalvarayan
Towards better integration of semantic predictors in statistical language modeling
Noah Coccaro, Daniel Jurafsky
An asymmetric stochastic language model based on multi-tagged words
Julio Pastor, José Colás, Ruben San-Segundo, José Manuel Pardo
Product-code vector quantization of cepstral parameters for speech recognition over the WWW
Vassilis Digalakis, Leonardo Neumeyer, Manolis Perakakis
Context dependent tree based transforms for phonetic speech recognition
Bernard Doherty, Saeed Vaseghi, Paul McCourt
Interfacing acoustic models with natural language processing systems
Michael T. Johnson, Mary P. Harper, Leah H. Jamieson
Hierarchical cluster language modeling with statistical rule extraction for rescoring n-best hypotheses during speech decoding
Photina Jaeyoun Jang, Alexander G. Hauptmann
Dealing with out-of-vocabulary words and speech disfluencies in an n-gram based speech understanding system
Atsuhiko Kai, Yoshifumi Hirose, Seiichi Nakagawa
Source-extended language model for large vocabulary continuous speech recognition
Tetsunori Kobayashi, Yosuke Wada, Norihiko Kobayashi
Time dependent language model for broadcast news transcription and its post-correction
Akio Kobayashi, Kazuo Onoe, Toru Imai, Akio Ando
Exploiting transitions and focussing on linguistic properties for ASR
Jacques Koreman, William J. Barry, Bistra Andreeva
A unified framework for sublexical and linguistic modelling supporting flexible vocabulary speech understanding
Raymond Lau, Stephanie Seneff
A method for modeling liaison in a speech recognition system for French
Lalit R. Bahl, S. De Gennaro, P. De Souza, E. Epstein, J.M. Le Roux, B. Lewis, C. Waast
On variable sampling frequencies in speech recognition
Fu-Hua Liu, Michael Picheny
Pronunciation modeling for large vocabulary conversational speech recognition
Kristine Ma, George Zavaliagkos, Rukmini Iyer
Time shift invariant speech recognition
Sankar Basu, Abraham Ittycheriah, Stéphane Maes
The demiphone versus the triphone in a decision-tree state-tying framework
José B. Mariño, Pau Paches-Leal, Albino Nogueiras
Word clustering for a word bi-gram model
Shinsuke Mori, Masafumi Nishimura, Nobuyasu Itoh
A large vocabulary continuous speech recognition hybrid system for the portuguese language
Joao P. Neto, Ciro Martins, Luis B. Almeida
Speech recognition performance on a new voicemail transcription task
Mukund Padmanabhan, Bhuvana Ramabhadran, Sankar Basu
Grammatical and statistical word prediction system for Spanish integrated in an aid for people with disabilities
Sira Palazuelos, Santiago Aguilera, José Rodrigo, Juan Godino
Segmentation using a maximum entropy approach
Kishore Papineni, Satya Dharanipragada
Recognition performance of a large-scale dependency grammar language model
Adam Berger, Harry Printz
A bootstrap technique for building domain-dependent language models
Ganesh N. Ramaswamy, Harry Printz, Ponani S. Gopalakrishnan
Estimation of the probability distributions of stochastic context-free grammars from the k-best derivations
Joan-Andreu Sanchez, José-Miguel Benedi
Robust HMM estimation with Gaussian merging-splitting and tied-transform HMMs
Ananth Sankar
Nonlinear interpolation of topic models for language model adaptation
Kristie Seymore, Stanley Chen, Ronald Rosenfeld
Performance evaluation of word phrase and noun category language models for broadcast news speech recognition
Kazuyuki Takagi, Rei Oguro, Kenji Hashimoto, Kazuhiko Ozeki
Robust automatic continuous-speech recognition based on a voiced-unvoiced decision
Hesham Tolba, Douglas O'Shaughnessy
Double tree beam search using hierarchical subword units
Juan Carlos Torrecilla, Ismael Cortazar, Luis A. Hernández
Text segmentation and topic tracking on broadcast news via a hidden Markov model approach
Paul van Mulbregt, Ira Carp, Lawrence Gillick, Steve Lowe, Jon Yamron
Multi-phone strings as subword units for speech recognition
Philip O'Neill, Saeed Vaseghi, Bernard Doherty, Wooi Haw Tan, Paul McCourt
Phonetic modification of the syllable /tu/ in two spontaneous american English dialogues
Nanette M. Veilleux, Stefanie Shattuck-Hufnagel
Efficient lattice representation and generation
Fuliang Weng, Andreas Stolcke, Ananth Sankar
Modeling pronunciation variation for a dutch CSR: testing three methods
Mirjam Wester, Judith M. Kessens, Helmer Strik
Comparison of language modelling techniques for Russian and English
Edward W. D. Whittaker, Philip C. Woodland
Optimized POS-based language models for large vocabulary speech recognition
Petra Witschel
Reducing peak search effort using two-tier pruning
Mark Wright, Simon Hovell, Simon Ringland
Using untranscribed training data to improve performance
George Zavaliagkos, Man-Hung Siu, Thomas Colthurst, Jayadev Billa
Telephone band LVCSR for hearing-impaired users
Ea-Ee Jan, Raimo Bakis, Fu-Hua Liu, Michael Picheny
Using x-gram for efficient speech recognition
Antonio Bonafonte, José B. Mariño
Sharable software repository for Japanese large vocabulary continuous speech recognition
Tatsuya Kawahara, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Katsunobu Itou, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano
The design of the newspaper-based Japanese large vocabulary continuous speech recognition corpus
Katunobu Itou, Mikio Yamamoto, Kazuya Takeda, Toshiyuki Takezawa, Tatsuo Matsuoka, Tetsunori Kobayashi, Kiyohiro Shikano, Shuichi Itahashi
Indexing and classification of TV news articles based on speech dictation using word bigram
Jun Ogata, Yasuo Ariki
Parametric trajectory mixtures for LVCSR
Man-Hung Siu, Rukmini Iyer, Herbert Gish, Carl Quillen
Steps toward the integration of speaker recognition in real-world telecom applications
Axel Glaeser, Frédéric Bimbot
A bimodal Korean address entry/retrieval system
Hyun-Yeol Chung, Cheol-Jun Hwang, Shi-Wook Lee
Usability evaluation of IVR systems with DTMF and ASR
Cristina Delogu, Andrea Di Carlo, Paolo Rotundi, Danilo Sartori
SALSA version 1.0: a speech-based web browser for hong kong English
Pascale Fung, Chi Shun Cheung, Kwok Leung Lam, Wai Kat Liu, Yuen Yee Lo
A language for creating speech applications
Andrew Pargellis, Qiru Zhou, Antoine Saad, Chin-Hui Lee
The use of automatic speech recognition to reduce the interference between concurrent tasks of driving and phoning
Robert Graham, Chris Carter, Brian Mellor
Interactive listening to structured speech content on the internet
Makoto J. Hirayama, Taro Sugahara, Zhiyong Peng, Junichi Yamazaki
MSF format for the representation of speech synchronized moving image
Cheol-Woo Jo
Effects of using speech in timetable information systems for WWW
Pernilla Qvarfordt, Arne Jonsson
The interactive systems labs view4you video indexing system
Thomas Kemp, Petra Geutner, Michael Schmidt, Borislav Tomaz, Manfred Weber, Martin Westphal, Alex Waibel
SEMOLE: a robust framework for gathering information from the world wide web
Hyung-Jin Kim, Lee Hetherington
Enhancing a WIMP based interface with speech, gaze tracking and agents
Lau Bakman, Mads Blidegn, Martin Wittrup, Lars Bo Larsen, Thomas B. Moeslund
Now you hear it, now you don't: empirical studies of audio browsing behavior behavior
Christine H. Nakatani, Steve Whittaker, Julia Hirschberg
A voice verifier for face/voice based person verification system
Rongyu Qiao, Youngkyu Choi, Johnson I. Agbinya
On the use of automatic speech recognition for TV captioning
Jordi Robert-Ribes
An undergraduate course on speech recognition based on the CSLU toolkit
Ben Serridge
Real time voice alteration based on linear prediction
Ping-Fai Yang, Yannis Stylianou
Evaluation and implementation of a voice-activated dialing system with utterance verification
Beng Tiong Tan, Yong Gu, Trevor Thomas
Towards a Mandarin voice memo system
Hsin-Min Wang, Bor-Shen Lin, Berlin Chen, Bo-Ren Bai
Development of CAI system employing synthesized speech responses
Tsubasa Shinozaki, Masanobu Abe
Using combined decisions and confidence measures for name recognition in automatic directory assistance systems
Andreas Kellner, Bernhard Rueber, Hauke Schramm
VPQ: a spoken language interface to large scale directory information
Bruce Buntschuh, Candace A. Kamm, Giuseppe Di Fabbrizio, Alicia Abella, Mehryar Mohri, Shrikanth Narayanan, I. Zeljkovic, R.D. Sharp, Jeremy H. Wright, S. Marcus, J. Shaffer, R. Duncan, J.G. Wilpon
SCAN - speech content based audio navigator: a system overview
John Choi, Don Hindle, Julia Hirschberg, Ivan Magrin-Chagnolleau, Christine H. Nakatani, Fernando Pereira, Amit Singhal, Steve Whittaker
Controlling a HIFI with a continuous speech understanding system
Javier Ferreiros, José Colás, Javier Macias-Guarasa, Alejandro Ruiz, José Manuel Pardo
User evaluation of the mask kiosk
Lori F. Lamel, Samir Bennacef, Jean-Luc Gauvain, Hervé Dartigues, Jean-Noel Temem
Is speech the right thing for your application?
Niels Ole Bernsen, Laila Dybkjaer
A PC-based tool for helping in diagnosis of pathologic voice
Juan Ignacio Godino Llorente, Santiago Aguilera Navarro, Sira Palazuelos Cagigas, Alberto Nieto Altuzarra, Pedro Gomez Vilda
Web-based educational tools for speech technology
Kaare Sjölander, Jonas Beskow, Joakim Gustafson, Erland Lewin, Rolf Carlson, Björn Granström
Universal speech tools: the CSLU toolkit
Stephen Sutton, Ronald A. Cole, Jacques de Villiers, Johan Schalkwyk, Pieter Vermeulen, Michael W. Macon, Yonghong Yan, Edward Kaiser, Brian Rundle, Khaldoun Shobaki, John-Paul Hosom, Alex Kain, Johan Wouters, Dominic W. Massaro, Michael Cohen
Creating a mexican Spanish version of the CSLU toolkit
Ben Serridge, Alejandro Barbosa, Ronald A. Cole, Nora Munive, Alcira Vargas
A voice user interface demonstration system for mexican Spanish
Carmen García-Mateo, Qiru Zhou, Chin-Hui Lee, Andrew Pargellis
Non-native productions of Japanese single stops that are too long for one mora unit
Yasuyo Minagawa-Kawai, Shigeru Kiritani
The process of generation and development of second language Japanese accentuation
Nobuko Yamada
Perceptual properties of Russians with Japanese fricatives
Seiya Funatsu, Shigeru Kiritani
Assessment of dutch pronunciation by means of automatic speech recognition technology
Catia Cucchiarini, Febe De Wet, Helmer Strik, Louis Boves
Phonetic-level mispronunciation detection in non-native Swedish speech
Philippe Langlais, Anne-Marie Öster, Björn Granström
Computer-based second language production training by using spectrographic representation and HMM-based speech recognition scores
Reiko Akahane-Yamada, Erik McDermott, Takahiro Adachi, Hideki Kawahara, John S. Pruitt
Spoken word identification by native and nonnative speakers of English: effects of training, modality, context and phonetic environment
Debra M. Hardison
The effect of background knowledge on first and second language comprehension difficulty
Michael D. Tyler
Comparison of cross-language coarticulation: English, Japanese and Japanese-accented English
Kimiko Tsukada
Plasticity of non-native phonetic perception and production: a training study
Satoshi Imaizumi, Hidemi Itoh, Yuji Tamekawa, Toshisada Deguchi, Koichi Mori
The relation between perceptual and production categories in acquisition
Ian Watson
The development of perceptual cue-weighting in children aged 6 to 12
Valerie Hazan, Sarah Barrett
Assimilation of place in Japanese and dutch
Anne Cutler, Takashi Otake
Prosodic constraint on v-to-v coarticulation in Japanese
Yuko Kondo, Yumiko Arai
Postvocalic /r/-deletion in standard dutch: how experimental phonology can profit from ASR technology
Catia Cucchiarini, Henk van den Heuvel
More evidence for the perceptual basis of sound change? suprasegmental effects in the development of distinctive nasalization
John Hajek, Ian Watson
Speech production of vowel sequences using a physiological articulatory model
Jianwu Dang, Kiyoshi Honda
Regional variation in the vowels of female adolescents from sydney
Felicity Cox, Sallyanne Palethorpe
A kinematic analysis of new zealand and australian English vowel spaces
Catherine Watson, Jonathan Harrington, Sallyanne Palethorpe
Syllable-onset acoustic properties associated with syllable-coda voicing
Noel Nguyen, Sarah Hawkins
Articulatory, acoustic and perceptual aspects of fricative-stop coarticulation
Noel Nguyen, Alan A. Wrench, Fiona Gibbon, William J. Hardcastle
Efficiency as an organizing principle of natural speech
Rob J. J. H. van Son, Florien J. Koopmans-van Beinum, Louis C. W. Pols
Within-speaker variability due to speaking manners
Inger Karlsson, Tanja Bänziger, Jana Dankovicova, Tom Johnstone, Johan Lindberg, Haakan Melin, Francis Nolan, Klaus R. Scherer
Eigenvoices for speaker adaptation
Roland Kuhn, Patrick Nguyen, Jean-Claude Junqua, Lloyd Goldwasser, Nancy Niedzielski, Steven Fincke, Ken Field, Matteo Contolini
Speaker clustering using direct maximisation of the MLLR-adapted likelihood
Sue E. Johnson, Philip C. Woodland
Incremental on-line speaker adaptation in adverse conditions
Olli Viikki, Kari Laurila
Cluster adaptive training for speech recognition
Mark J. F. Gales
On-line hierarchical transformation of hidden Markov models for speaker adaptation
Jen-Tzung Chien
High-speed speaker adaptation using phoneme dependent tree-structured speaker clustering
Motoyuki Suzuki, Toshiaki Abe, Hiroki Mori, Shozo Makino, Hirotomo Aso
The use of confidence measures in unsupervised adaptation of speech recognizers
Tasos Anastasakos, Sreeram V. Balakrishnan
Speaker normalization with all-pass transforms
John McDonough, William Byrne, Xiaoqiang Luo
Toward on-line learning of Chinese continuous speech recognition system
Rong Zheng, Zuoying Wang
The CHAM model of hyperarticulate adaptation during human-computer error resolution
Sharon L. Oviatt
Bilingual and dialectal adaptation and retraining
Ulla Uebler, Michael Schüssler, Heinrich Niemann
Language independent and language adaptive large vocabulary speech recognition
Tanja Schultz, Alex Waibel
A method for measuring the intelligibility and nonnativeness of phone quality in foreign language pronunciation training
Goh Kawai, Keikichi Hirose
Speech perception and spoken language in children with impaired hearing
Peter Blamey, Julia Sarant, Tanya Serry, Roger Wales, Christopher James, Johanna Barry, Graeme M. Clark, M. Wright, R. Tooher, C. Psarros, G. Godwin, M. Rennie, T. Meskin
Quantitative assessment of second language learners' fluency: an automatic approach
Catia Cucchiarini, Helmer Strik, Louis Boves
Cross-language merged speech units and their descriptive phonetic correlates
Paul Dalsgaard, Ove Andersen, William J. Barry
Crosslinguistic disfluency modelling: a comparative analysis of Swedish and american English human--human and human--machine dialogues
Robert Eklund, Elizabeth Shriberg
Calibration of machine scores for pronunciation grading
Horacio Franco, Leonardo Neumeyer
Phonetic-distance-based hypothesis driven lexical adaptation for transcribing multlingual broadcast news
Petra Geutner, Michael Finke, Alex Waibel
Automatic pronunciation error detection and guidance for foreign language learning
Chul-Ho Jo, Tatsuya Kawahara, Shuji Doshita, Masatake Dantsuji
Lexical access for large-vocabulary speech recognition
Roger Ho-Yin Leung, Hong C. Leung
The effect of fundamental frequency on Mandarin speech recognition
Sharlene Liu, Sean Doyle, Allen Morris, Farzad Ehsani
The perception of nativeness: variable speakers and flexible listeners
Duncan Markham
Voice dictation in the secondary school classroom
Michael F. McTear, Eamonn A. O'Hare
The importance of the first syllable in English spoken word recognition by adult Japanese speakers
Kazuo Nakayama, Kaoru Tomita-Nakayama
Spoken L2 teaching with contrastive visual and auditory feedback
Anne-Marie Öster
The role of phonological, morphological, and orthographic knowledge in the intuitive syllabification of dutch words: a longitudinal approach
Dominiek Sandra, Steven Gillis
The acquisition of Japanese compound accent rule
Ayako Shirose, Haruo Kubozono, Shigeru Kiritani
The acquisition of putonghua phonology
Lydia K. H. So, Zhou Jing
Enhancing speech processing of Japanese learners of English utilizing time-scale expansion with constant pitch
Kaoru Tomita-Nakayama, Kazuo Nakayama, Masayuki Misaki
A bootstrap training approach for language model classifiers
Volker Warnke, Elmar Nöth, Jan Buckow, Stefan Harbeck, Heinrich Niemann
Voice onset time patterns in 7-, 9- and 11-year old children
Sandra P. Whiteside, Jeni Marshall
Some developmental patterns in the speech of 6-, 8- and 10-year old children: an acoustic phonetic study
Sandra P. Whiteside, Carolyn Hodgson
Language development after extreme childhood deprivation: a case study
Lisa-Jane Brown, John Locke, Peter Jones, Sandra P. Whiteside
Phonological elements as a basis for language-independent ASR
Geoff Williams, Mark Terry, Jonathan Kaye
A phonetic and acoustic study of babbling in an Italian child
Claudio Zmarich, Roberta Lanni
Rescoring multiple pronunciations generated from spelled words
Roland Kuhn, Jean-Claude Junqua, Philip D. Martzen
SIVHA, visual speech synthesis system
Yolanda Blanco, Maria Cuellar, Arantxa Villanueva, Fernando Lacunza, Rafael Cabeza, Beatriz Marcotegui
Using automatic speech recognition and its possible effects on the voice
C. G. de Bruijn, Sandra P. Whiteside, P. A. Cudd, D. Syder, K. M. Rosen, L. Nord
The importance of F0 or voice pitch for perception of tonal language: simulations with cochlear implant speech processing strategies
Robert Alexander Fearn
Assessing high-level language in individuals with multiple sclerosis: a pilot study
Karin Brunnegaard, Katja Laakso, Lena Hartelius, Elisabeth Ahlsen
Design of cochlear implant device for transmitting voice pitch information in speech sound of asian languages
Shizuo Hiki, Kazuya Imaizumi, Yumiko Fukuda
Abnormal volume-duration relationship in parkinsonian speech
Aileen K. Ho, John L. Bradshaw, Robert Iansek, Robin J. Alfredson
Analysis of disordered speech signal using wavelet transform
Cheol-Woo Jo, Dae-Hyun Kim
Multi-channel pulsation strategy for electric stimulation of cochlea
Shigeyoshi Kitazawa, Hiroyuki Kirihata, Tatsuya Kitamura
Synthetic faces as a lipreading support
Eva Agelfors, Jonas Beskow, Martin Dahlquist, Björn Granström, Magnus Lundeberg, Karl-Erik Spens, Tobias Öhman
Predicting language scores from the speech perception scores of hearing-impaired children
Lois Martin, John Bench
Content-independent duration model on categories of voice and unvoice segments
Oleg P. Skljarov
Dynamical spectrogram, an aid for the deaf
Ali-Asghar Soltani-Farani, Edward H.S. Chilton, Robin Shirley
Evidence of dual-route phonetic encoding from apraxia of speech: implications for phonetic encoding models
Rosemary A. Varley, Sandra P. Whiteside
Speech communication profiles across the adult lifespan: persons without self-identified hearing impairment
M. F. Cheesman, K. L. Smilsky, T. M. Major, F. Lewis, L. M. Boorman
Time as a factor in the acoustic variation of schwa
William J. Barry
On the structure of vowel space: a genealogy of general phonetic concepts
Hendrik F. V. Boshoff, Elizabeth C. Botha
The relationship between intensity and subglottal pressure with controlled pitch
Véronique Lecuit, Didier Demolin
Segmentation of the airway from the surrounding tissues on magnetic resonance images: a comparative study
Alain Soquet, Véronique Lecuit, Thierry Metens, Bruno Nazarian, Didier Demolin
Recovering vocal tract shapes from MFCC parameters
Sorin Dusan, Li Deng
Quantification of pharyngeal articulations using measurements from laryngoscopic images
John H. Esling, Jocelyn Clayards, Jerold A. Edmondson, Qiu Fuyuan, Jimmy G. Harris
Variance and invariance in speech rate as a reflection of conceptual planning
Janice Fon
Correspondence between the glottal gesture overlap pattern and vowel devoicing in Japanese
Masako Fujimoto, Emi Murano, Seiji Niimi, Shigeru Kiritani
Evaluation of Japanese manners of generating word accent of English based on a stressed syllable detection technique
Yukiko Fujisawa, Nobuaki Minematsu, Seiichi Nakagawa
Independence of consonantal voicing and vocoid F0 perturbation in English and Japanese
Shunichi Ishihara
Reduction of English function words in switchboard
Daniel Jurafsky, Alan Bell, Eric Fosler-Lussier, Cynthia Girand, William Raymond
Duration compensation in non-adjacent consonant and temporal regularity
Hee-Sun Kim
Relationship between lip shapes and acoustical characteristics during speech
Keisuke Mori, Yorinobu Sonoda
A model to represent propagation and radiation of higher-order modes for 3-d vocal-tract configuration
Kunitoshi Motoki, Hiroki Matsuzaki
FEM analysis of aspirated air flow in three-dimensional vocal tract during fricative consonant phonation
Takuya Niikawa, Masafumi Matsumura, Takashi Tachimura, Takeshi Wada
Trajectory formation of articulatory movements for a given sequence of phonemes
Takesi Okadome, Tokihiko Kaburagi, Masaaki Honda
Contextual effects on voicing profiles of German and Mandarin consonants
Chilin Shih, Bernd Möbius
Reconstructing the tongue surface from six cross-sectional contours: ultrasound data
Andrew J. Lundberg, Maureen Stone
Articulability of two consecutive morae in Japanese speech production: evidence from sound exchange errors in spontaneous speech
Yasushi Terao, Tadao Murata
Coarticulation and degrees of freedom in the elaboration of a new articulatory plant: GENTIANE
Anne Vilain, Christian Abry, Pierre Badin
A pressure sensitive palatography: application of new pressure sensitive sheet for measuring tongue-palatal contact pressure
Masahiko Wakumoto, Shinobu Masaki, Kiyoshi Honda, Toshikazu Ohue
Dual-route phonetic encoding: some acoustic evidence
Sandra P. Whiteside, Rosemary A. Varley
Fast and slow speech rate: a characterisation for French
Brigitte Zellner
Context dependent anti subword modeling for utterance verification
Padma Ramesh, Chin-Hui Lee, Biing-Hwang Juang
Combination of confidence measures in isolated word recognition
J. G. A. Dolfing, Andreas Wendemuth
Confidence measures for HMM-based speech recognition
Daniel Willett, Andreas Worm, Christoph Neukirchen, Gerhard Rigoll
Vocabulary-independent word confidence measure using subword features
Li Jiang, Xuedong Huang
A new confidence measure based on rank-ordering subphone scores
Qiguang Lin, Subrata Das, David Lubensky, Michael Picheny
Speaking-style dependent lexicalized filler model for key-phrase detection and verification
Tatsuya Kawahara, Kentaro Ishizuka, Shuji Doshita, Chin-Hui Lee
A speechreading aid based on phonetic ASR
Paul Duchnowski, Louis Braida, Maroula Bratakos, David Lum, Matthew Sexton, Jean Krause
Training speech through visual feedback patterns
Jan Nouza
Word sequence pair spotting for synchronization of speech and text in production of closed-caption TV programs for the hearing impaired
Ichiro Maruyama, Yoshiharu Abe, Takahiro Wakao, Eiji Sawamura, Terumasa Ehara, Katsuhiko Shirai
Volume regulation in parkinsonian speech
Aileen K. Ho, John L. Bradshaw, Robert Iansek, Robin J. Alfredson
On the amount and domain of focal lengthening in Swedish
Eva Strangert, Mattias Heldner
Differential lengthening of syllabic constituents in French: the effect of accent type and speaking style
Daniel Hirst, Corine Astesano, Albert Di Cristo
Prosodic analysis of fillers and self-repair in Japanese speech
Felix C. M. Quimbo, Tatsuya Kawahara, Shuji Doshita
A synthesis-oriented model of phrasal pitch movements in standard Chinese
Jinfu Ni, Goh Kawai, Keikichi Hirose
Non-linear probability estimation method used in HMM for modeling frame correlation
Qing Guo, Fang Zheng, Jian Wu, Wenhu Wu
Patterns of linguopalatal contact during Japanese vowel devoicing
Shuri Kumagai
Speech separation based on the GMM PDF estimation
Xiao Yu, Guangrui Hu
Growth transform of a sum of rational functions and its application in estimating HMM parameters
Xiaoqiang Luo
Two automatic approaches for analyzing connected speech processes in dutch
Mirjam Wester, Judith M. Kessens, Helmer Strik
The use of broad phonetic class models in speaker recognition
Johan W. Koolwaaij, Johan de Veth
Analysis and treatment of esophageal speech for the enhancement of its comprehension
Jorge Miquélez, Rocio Sesma, Yolanda Blanco
High quality text-to-speech system in Spanish for handicapped people
Fernando Lacunza, Yolanda Blanco
Factors affecting speech retrieval
Corinna Ng, Ross Wilkinson, Justin Zobel
Perception of words with vowel reduction
Johan Frid
Automated captioning of television programs: development and analysis of a soundtrack corpus
Ingrid Ahmer, Robin W. King
On the influence of the delta coefficients in a HMM-based speech recognition system
Fabrice Lefèvre, Claude Montacié, Marie-José Caraty
Speech recognition using the probabilistic neural network
Raymond Low, Roberto Togneri
A language modeling based on a hierarchical approach: m_n^v
Imed Zitouni
Temporal variables in lectures in the Japanese language
Michiko Watanabe
Building a statistical model of the vowel space for phoneticians
Matthew Aylett
Computer-mediated input and the acquisition of L2 vowels
Michelle Minnick Fox
Speech analysis by subspace methods of spectral line estimation
Najam Malik, W. Harvey Holmes
Pausing in Swedish spontaneous speech
Petra Hansson
Prosody and voice quality in the expression of emotions
Elisabeth Zetterholm
Acoustic analysis of /l/ in glossectomees
Julie Lunn, Alan A. Wrench, Janet Mackenzie Beck
Article |
---|