SciELO - Scientific Electronic Library Online

 
vol.111 issue4Leveraging the Technology of Unmanned Aerial Vehicles for Developing Countries author indexsubject indexarticles search
Home Pagealphabetic serial listing  

Services on Demand

Article

Indicators

Related links

  • On index processCited by Google
  • On index processSimilars in Google

Share


SAIEE Africa Research Journal

On-line version ISSN 1991-1696
Print version ISSN 0038-2221

Abstract

KIZITO, Ronald; OKELLO, Wayne S.  and  KAGUMIRE, Sulaiman. Design and Implementation of a Luganda Text Normalization Module for a Speech Synthesis Software Program. SAIEE ARJ [online]. 2020, vol.111, n.4, pp.149-154. ISSN 1991-1696.

This paper describes a Luganda text normalization module, a crucial component needed for a Luganda Text to Speech system. We describe the use of a rule-based approach for detection, classification and verbalization of Luganda text. At the core of this module are the Luganda grammar rules that were hand-built to normalize Non-Standard Words (NSWs) from different semiotic and noun classes. Input text is first analyzed, matched against handcrafted patterns developed using regular expressions to detect any NSWs. Upon detection, NSWs are tokenized and classified into one of the semiotic classes and then if necessary, into one of the Luganda noun classes. These are subsequently verbalized, each according to its semiotic as well as noun class, and a new text file is produced. We tested the module with 7 datasets and achieved average detection and normalization rates of 82% and 77.7% respectively.

Keywords : Automatic Speech Recognition; Detection-conversion; Luganda; Machine Translation; NLP; Number system; Speech Synthesis; Text Normalization; Text-to-speech; TTS.

        · text in English     · English ( pdf )

 

Creative Commons License All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License