ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

Segmentation in singer turns with the Bayesian information criterion

Marwa Thlithi, Thomas Pellegrini, Julien Pinquier, Régine André-Obrecht

As part of a project on indexing ethno-musicological audio recordings, segmentation in singer turns automatically appeared to be essential. In this article, we present the problem of segmentation in singer turns of musical recordings and our first experiments in this direction by exploring a method based on the Bayesian Information Criterion (BIC), which are used in numerous works in audio segmentation, to detect singer turns. The BIC penalty coefficient was shown to vary when determining its value to achieve the best performance for each recording. In order to avoid the decision about which single value is best for all the documents, we propose to combine several segmentations obtained with different values of this parameter. This method consists of taking a posteriori decisions on which segment boundaries are to be kept. A gain of 7.1% in terms of F-measure was obtained compared to a standard coefficient.

doi: 10.21437/Interspeech.2014-453

Cite as: Thlithi, M., Pellegrini, T., Pinquier, J., André-Obrecht, R. (2014) Segmentation in singer turns with the Bayesian information criterion. Proc. Interspeech 2014, 1988-1992, doi: 10.21437/Interspeech.2014-453

  author={Marwa Thlithi and Thomas Pellegrini and Julien Pinquier and Régine André-Obrecht},
  title={{Segmentation in singer turns with the Bayesian information criterion}},
  booktitle={Proc. Interspeech 2014},