ISCA Archive SIGUL 2023
ISCA Archive SIGUL 2023

Automatic Transcription and (De)Standardisation

Nina Markl, Electra Wallington, Ondrej Klejch, Thomas Reitmaier, Gavin Bailey, Jennifer Pearson, Matt Jones, Simon Robinson, Peter Bell

In this paper we illustrate the gap between real language use and the language use assumed in ASR development through the example of isiXhosa in Langa, South Africa. Understanding speech and writing practices in context is particularly important when developing speech technologies for minoritised and under-resourced languages, and their communities.


doi: 10.21437/SIGUL.2023-20

Cite as: Markl, N., Wallington, E., Klejch, O., Reitmaier, T., Bailey, G., Pearson, J., Jones, M., Robinson, S., Bell, P. (2023) Automatic Transcription and (De)Standardisation . Proc. 2nd Annual Meeting of the ELRA/ISCA SIG on Under-resourced Languages (SIGUL 2023), 93-97, doi: 10.21437/SIGUL.2023-20

@inproceedings{markl23_sigul,
  author={Nina Markl and Electra Wallington and Ondrej Klejch and Thomas Reitmaier and Gavin Bailey and Jennifer Pearson and Matt Jones and Simon Robinson and Peter Bell},
  title={{Automatic Transcription and (De)Standardisation }},
  year=2023,
  booktitle={Proc. 2nd Annual Meeting of the ELRA/ISCA SIG on Under-resourced Languages (SIGUL 2023)},
  pages={93--97},
  doi={10.21437/SIGUL.2023-20}
}