ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

Query-by-example spoken term detection on multilingual unconstrained speech

Xavier Anguera, Luis Javier Rodriguez-Fuentes, Igor Szőke, Andi Buzo, Florian Metze, Mikel Penagarikano

As part of the MediaEval 2013 benchmark evaluation campaign, the objective of the Spoken Web Search (SWS) task was to perform Query-by-Example Spoken Term Detection (QbE-STD) using audio queries in a low-resource setting. After two successful editions and a continuously growing interest in the scientific community, a special effort was made in SWS 2013 to prepare a challenging database, including speech in 9 different languages with diverse environment and channel conditions. In this paper, first we describe the database and the performance metrics. Then, we briefly review the algorithmic approaches followed by participants and present and discuss the obtained performances, which demonstrate the feasibility of the proposed task, even under such challenging conditions (multiple languages and unconstrained acoustic conditions). Finally, we analyze the fusion of the top-performing systems, which achieved a 30% relative improvement over the best single system in the evaluation, proving that a variety of approaches can be effectively combined to bring complementary information in the search for queries.

doi: 10.21437/Interspeech.2014-522

Cite as: Anguera, X., Rodriguez-Fuentes, L.J., Szőke, I., Buzo, A., Metze, F., Penagarikano, M. (2014) Query-by-example spoken term detection on multilingual unconstrained speech. Proc. Interspeech 2014, 2459-2463, doi: 10.21437/Interspeech.2014-522

  author={Xavier Anguera and Luis Javier Rodriguez-Fuentes and Igor Szőke and Andi Buzo and Florian Metze and Mikel Penagarikano},
  title={{Query-by-example spoken term detection on multilingual unconstrained speech}},
  booktitle={Proc. Interspeech 2014},