ISCA Archive SIGUL 2023
ISCA Archive SIGUL 2023

A Finite-State Morphological Analyzer for Saraiki

Meesum Alam, Alexandra O’Neil, Daniel Swanson, Francis Tyers

Saraiki (also Sirariki) (skr) is the first language of almost 25 million speakers in Pakistan and nearly one million speakers in India. Our study documents the process of creating an Apertium module for Saraiki and contributes to future efforts to generate computational resources for Saraiki. Apertium is chosen for the development of a Saraiki morphological analyzer since the platform has shown to adequately handle morphological complexity. In discussing the process of creating an analyzer for Saraiki, we detail our implementation by discussing our treatment of Saraiki morphology in regard to gender, number, and case marking for nouns and adjectives, verb categorizations (basic stem forms, direct causatives, and indirect causatives), and cases of ambiguity in nominal gender inflections.


doi: 10.21437/SIGUL.2023-3

Cite as: Alam, M., O’Neil, A., Swanson, D., Tyers, F. (2023) A Finite-State Morphological Analyzer for Saraiki . Proc. 2nd Annual Meeting of the ELRA/ISCA SIG on Under-resourced Languages (SIGUL 2023), 9-13, doi: 10.21437/SIGUL.2023-3

@inproceedings{alam23_sigul,
  author={Meesum Alam and Alexandra O’Neil and Daniel Swanson and Francis Tyers},
  title={{A Finite-State Morphological Analyzer for Saraiki }},
  year=2023,
  booktitle={Proc. 2nd Annual Meeting of the ELRA/ISCA SIG on Under-resourced Languages (SIGUL 2023)},
  pages={9--13},
  doi={10.21437/SIGUL.2023-3}
}