Automatic Segmentation of Indonesian Speech into Syllables using Fuzzy Smoothed Energy Contour with Local Normalization, Splitting, and Assimilation
AbstractThis paper discusses the usage of short term energy contour of a speech smoothed by a fuzzy-based method to automatically segment the speech into syllabic units. Two additional procedures, local normalization and postprocessing, are proposed to improve the method. Testing to Indonesian speech dataset shows that local normalization significantly improves the accuracy of fuzzy smoothing. In postprocessing step, the procedure of splitting missed short syllables reduces the deletion errors, but unfortunately it increases the insertion ones. On the other hand, an assimilation of a single consonant segment into its previous or next segment reduces the insertion errors, but increases the deletion ones. The sequential combination of splitting and then assimilation gives quite significant improvement of accuracy as well as reduction of deletion errors, but it slightly increases the insertion ones.
Janakiraman, R., Kumar, J.C., and Murthy, H.A., Robust syllable segmentation and its application to syllable-centric continuous speech recognition, in Proceedings of National Conference on Communications (NCC), pp. 1-5, 2010.
Sheikhi, G. & Almasganj, F., Segmentation of speech into syllable units using fuzzy smoothed short term energy contour, in Proc. the 18th Iranian Conference on BioMedical Engineering, 14-16 December 2011, Tehran, Iran, pp. 195-198, 2011.
Suyanto & Adityatama, J., 2012, Yooi: An Indonesian Short Message Dictation, International Journal of Intelligent Information Processing (IJIIP), Republic of Korea, 3(4), pp. 68-74, 2012.
Suyanto & Hartati, S., Design of Indonesian LVCSR Using Combined Phoneme and Syllable Models, in Proceedings of the 7th International Conference on Information & Communication Technology and Systems (ICTS), Bali, Indonesia, pp. 191-196, 2013.
Alwi, H., Dardjowidjojo, S., Lapoliwa, H., and Moeliono, A.M., Tata bahasa baku bahasa Indonesia (The standart Indonesian grammar), Jakarta, Balai Pustaka, 1998.
Petrillo, M. & Cutugno, F., A syllable segmentation algorithm for English and italian, in Proceedings of EUROSPEECH, pp. 2913-2916, 2003.