Authors:

I Gede Angga Purnajiwa Arimbawa, Ngurah Agus Sanjaya ER

Abstract:

“Lemmatization is a process to extracting root word from an affixed word with the aim of reducing variations of the word into the root word. Previous researches on extraction of root word in Balinese Language has been done with rule- based methods to remove affixes from words. The weakness of the rule-based method is that it must comply with the set of rules provided. However, writings in Balinese often contain typographical errors because speakers tend to write words according to how the word is spoken instead of following the correct rules. In this research, we apply the Levenshtein distance method to overcome the aforementioned shortcoming. After all the rules applied to a given word fail, the Leven- shtein distance method is used to list all words that are ”close”. Next, we select the closest word as the root word of the given input. Based on the experiments, our proposed method achieved an accuracy of 96.01 %.”

Keywords

Keyword Not Available

Downloads:

Download data is not yet available.

References

References Not Available

PDF:

https://jurnal.harianregional.com/jlk/full-51892

Published

2020-01-25

How To Cite

PURNAJIWA ARIMBAWA, I Gede Angga; SANJAYA ER, Ngurah Agus. Lemmatization in Balinese Language.JELIKU (Jurnal Elektronik Ilmu Komputer Udayana), [S.l.], v. 8, n. 3, p. 235-242, jan. 2020. ISSN 2654-5101. Available at: https://ojs.unud.ac.id/index.php/JLK/article/view/51892. Date accessed: 28 Aug. 2025. doi:https://doi.org/10.24843/JLK.2020.v08.i03.p04.

Citation Format

ABNT, APA, BibTeX, CBE, EndNote - EndNote format (Macintosh & Windows), MLA, ProCite - RIS format (Macintosh & Windows), RefWorks, Reference Manager - RIS format (Windows only), Turabian

Issue

Vol 8 No 3 (2020): JELIKU Volume 8 No 3, February 2020

Section

Articles

Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 International License