Syllabification of Balinese Words Using the Syllabification Algorithm
on
p-ISSN: 2301-5373
e-ISSN: 2654-5101
Jurnal Elektronik Ilmu Komputer Udayana
Volume 8, No 2. November 2019
Syllabification of Balinese Words Using the Syllabification Algorithm
Gede Bagus Prawira Putraa1, Ngurah Agus Sanjaya ERa2
aInformatics Department, Udayana University Bali, Indonesia 1[email protected] 2agus [email protected]
Abstract
This research discusses the syllabification of Balinese words. The syllabification of Balinese words is necessary in determining padalingsa in pupuh. In order to be able to develop the application of technology in the field of pupuh literary, a system that can automatically find the correct division of a Balinese word into syllabels is required. Several studies have been conducted on the syllabification Spanish and Sinhala language using rule-based approach [1][2]. Both of the studies achieved good accuracy score. Thus, in this research we apply the rule-based syllabification approach on Balinese words. The data used in this study were 257 Balinese words in which the system managed to correctly divide the word as many as 244 and obtained an accuracy score of 94.94%.
Keywords: Syllabification Algorithm, Word, Balinese, Word Syllabification, Rule-based
Language is a system of symbols used to communicate with others, which includes creativity and a system of rules. With this creativity, humans are able to build meaningful sentences by combining words and also pay attention to the rules in making sentences so that, language is something that will never be stopped because it is a medium of communication [3].
There are various languages used in communication. Each region in Indonesia has different regional languages where each language in a region has different rules. As one example in the area of Bali has a regional language that is Balinese.
Language, literature and religion are very closely related and cannot be separated from Balinese culture. Language can be said as a unifying tool for language because language is used as a medium of communication by the community. In general, the Balinese people use Balinese as a medium of communication in daily life and is an important part of culture because in the life of the people, Balinese has its own position and function [4].
Spelling of the Balinese has its own rules, both in the use of letters, use of punctuation, writing and syllabification of words. Syllabification can be used to find syllables in Balinese words. Syllabification in Balinese can also be implemented in a Balinese literary work called pupuh.
The pupuh is bound by the padalingsa law (rules) consisting of the guru wilang and the guru dingdong where the guru wilang is a provision that binds the number of lines and the number of syllables in each row on each type of pupuh, while the guru dingdong is the one who regulates the fall of vowels at the end syllables in each line [5]. To be able to know padalingsa from the pupuh, it is necessary to seize words in each line of the pupuh. In order to be able to develop the application of technology in the field of literary pupuh, a system that can syllabification Balinese words is needed. To be able to make a system of syllabification into syllables can use the syllabification algorithm. There have been many studies that discuss syllabification algorithm, including [1] [2], but there is no related research on syllabification algorithm for Balinese.
In research [1] developed a syllabification algorithm for Spanish language using a rule-based approach. The rules implemented in the algorithm are generated based on the grammar rules of the Spanish language. In that study, an error rate of 1.6%.
In the research [2] developed Syllabification algorithm Sinhalese is the language of Sri Lanka. The algorithm used is also a rule-based approach. The rules used are also based on Sinhala grammar rules. The accuracy obtained in the research is 99.95%.
Because there is no research that develops syllabification algorithm for Balinese and also seen from previous studies using the syllabification algorithm get high accuracy, therefore this research develops the Balinese syllabification algorithm with a rule-based approach. Where the rules for syllabification the Balinese langage words are taken from a book entitled "PEDOMAN UMUM EJAAN BAHASA BALI DENGAN HURUF LATIN" compiled by Ida Ayu Mirah Purwiati, Ni Putu Ekatini Negari, and I Wayan Sudiartha [6].
The data used are Balinese words both basic words and affixed words. The data used were 257 where the data was obtained on the website page id.wiktionary.org. The data taken is a Balinese word that has not been labeled or whose syllabification are unknown, so labeling is done manually. There will be 2 .txt files where one saves the words to be tested and the other saves the decapitation of those words.
Balinese has its own rules for syllabification. The result of syllabification is the syllable of the word. The syllables in the Balinese can consist of 6 categories according to figure 1 which consists of one vowel letter, preceded by vowels then followed by consonants, preceded by consonants and then followed by vowels, preceded by consonants then followed by a vowel then consonant, followed by consonant then followed by consonant then vowel, and followed by consonant then followed by consonant then vowel then consonant.
-
1. V a-dep
-
2. VK en-dep
-
3. KV di-ngeh
-
4. KVK sam-pi
-
5. KKV pra-gat
-
6. KKVK klang-sah
Ket:
V -VokaI K-Konsonan
-
Figure 1. Example of a Balinese Syllable Category
Syllabification of Balinese words can be done with the following rules:
-
a. If in the middle of the word there are two consecutive vowels, syllabification is done
between the two vowels. Example: si-ap ’chicken’
-
b. If there are consonants in between the vowels in the middle of the word, including
consonants, the syllabification is done before the consonants. Example: ja-gut ‘chin’
-
c. If in the middle of a word there are two consonants in a row, syllabification is carried out between the two consonants. Example: sam-pi ’cow’
-
d. If in the middle of a word there are three or more consonants, syllabification is done
between the first consonants. Example: kam-plang ’slap’
-
e. The prefix and suffix affixes, including those that have changed shape, syllabification
can be done at the turn of a line. Example: ka-tulis ’written’
Syllabification algorithm developed in this study uses a rule-based approach. The rules that are implemented in the Balinese syllabification algorithm are taken from a book entitled "PEDOMAN UMUM EJAAN BAHASA BALI DENGAN HURUF LATIN” compiled by Ida Ayu Mirah Purwiati, Ni Putu Ekatini Negari, and I Wayan Sudiartha as written in sub CHAPTER 2.2 [6]. Following is the syllabification algorithm for Balinese:
-
a. Check the characters in the word from the smallest to the largest index.
-
b. Initialize symbols representing diphthong letters ($ = ng, & = ny).
-
c. Check letters that cannot be separated / diphthong (ng and ny).
-
d. Change diphthong letters with one symbol ($, &, or another).
-
e. Create an index of words containing symbols, namely:
-
1. If diphthongs are in front, index after the diphthong letters.
-
2. If diphthongs are in the middle, index before and after diphthong letters.
-
3. If diphthongs are located behind, index before the diphthong letters.
-
f. Check vowels and consonants:
-
1. if the i-th consonant and the i + 1 consonant are indexed after the i-th letter.
-
2. If the i letter is vowel and the i letter is + 1 vowel, give an index after the i-th letter.
-
3. If the i letter is vowel and the i letter is + 1 consonant and the i letter is + 2 vowels, give
an index after the i letter.
-
4. If the consonant i and the i + 1 consonant and the i + 2 consonant, give an index after the i-th letter.
-
g. Make beheading according to the specified index.
-
h. Change the symbol back to diphthong letters.
The implementation of the system uses as much test data as the data set has, which is 257 Balinese words. The system syllabification every word contained in the test data. The results will be compared with the syllabification label from the test data which has been done manually syllabification process manually. If the result of the syllabification of the system is the same as the test syllabification label, the system will be considered correct in syllabification. The accuracy of the system is calculated by comparing the amount of true test data with the total of all test data or if it is formulated in a mathematical model it will be like the formula (1)
Accuracy = 100% (1)
Explanation:
N = Overall total test data
Db = Test data is correct
The test results show that the algorithm can syllabification almost entire words. Table 1 shows the test results of the Balinese Syllabification Algorithm
Table 1. System Testing Results
Amount of test data |
Correct |
Incorrect |
Accuracy |
257 |
244 |
13 |
94,94% |
Examples of Balinese words used in testing can be seen in table 2
Table 2. Examples of Test Data and Syllabification Results
I No □ Word Result I
1 |
Ancruk |
An-cruk |
2 |
Angklung |
Ang-klung |
3 |
Andus |
An-dus |
4 |
Baat |
Ba-at |
5 |
Badak |
Ba-dak |
6 |
Bakat |
Ba-kat |
7 |
banjar |
Ban-jar |
8 |
Baong |
Ba-ong |
9 |
Bapa |
Ba-pa |
10 |
Barong |
Ba-rong |
11 |
Basing |
Ba-sang |
12 |
Basange |
Ba-sa-nge |
13 |
Bates |
Ba-tes |
14 |
Batis |
Ba-tis |
15 |
Bawak |
Ba-wak |
16 |
Be |
Be |
17 |
Bebekel |
Be-be-kel |
18 |
Bedik |
Bed-ik |
19 |
Bek |
Bek |
20 |
belog |
Bel-og |
Syllabification algorithm is an algorithm that is used to syllabification words. Rules for syllabification vary depending on the language used. In general, syllabification algorithm uses a rule (rule-based) approach and the rule used is the rule in the fragment of the word itself. By using the syllabification algorithm and using the rules of syllabification Balinese words, the system proved successful in syllabification words with quite good results. The accuracy obtained by the system is 94.94% with the amount of data that was successfully beheaded with a large amount of 244 words from 257 test data.
References
-
[1] H. Cuayahuitl, “A syllabification algorithm for spanish,” in International Conference on
Intelligent Text Processing and Computational Linguistics, pp. 412–415, Springer, 2004.
-
[2] R. Weerasinghe, A. Wasala, and K. Gamage, “A rule based syllabification algorithm for
sinhala,” in International Conference on Natural Language Processing, pp. 438–449, Springer, 2005.
-
[3] N. Dhieni, L. Fridani, A. Muis, and G. Yarmi, “Metode pengembangan bahasa,” 2014.
-
[4] I. K. Sudarsana, “Menumbuhkan minat belajar bahasa bali pada kalangan remaja,” Prosiding Sembada 2017, 2017.
-
[5] I. G. A. M. Putra, P. Aji, and M. H. Syahbani, “Aplikasi pembelajaran nyanyian pupuh tradisional bali berbasis android di sekolah dasar,” ePro- ceedings of Applied Science, vol. 1, no. 2, 2015.
-
[6] I. W. S. Ida Ayu Mirah Purwiati, Ni Putu Ekatini Negari, PEDOMAN UMUM EJAAN BAHASA BALI DENGAN HURUF LATIN. Balai Bahasa Provinsi Bali, 2013.
128
Discussion and feedback