The Classification of Documents in Malay and Indonesian Using the Naive Bayesian Method Uses Words and Phrases as a Training Set

Wijaya, Marvin Chandra (2020) The Classification of Documents in Malay and Indonesian Using the Naive Bayesian Method Uses Words and Phrases as a Training Set. MENDEL: Soft Computing Journal, 26 (2). pp. 23-28. ISSN 2571-3701

[img] Text
3. The Classification of Documents in Malay and Indonesian.pdf

Download (2456Kb)
[img] Text
3. Turnitin_The Classification Of Documents In Malay And Indonesian Using The Naive Bayesian Method Uses Words And Phrases As A Training Set.pdf

Download (1488Kb)

Abstract

Malay Language and Indonesian Language are two closely related languages, sharing a lot in common in the meanings of words and grammar. Classifying the two languages automatically using a tool is a challenge because the two languages are very similar. The classi�cation method that is widely used today is the Naive Bayesian method. This method needs to be implemented in a particular way to increase the level of classi�cation accuracy. In this study, a new method was used, by using a training set in the form of words and phrases instead of just using a training set in the form of words only. With this method, the level of classi�cation accuracy of the two languages is increased.

Item Type: Article
Contributors:
ContributionContributorsNIDN/NIDKEmail
UNSPECIFIEDWijaya, Marvin ChandraUNSPECIFIEDUNSPECIFIED
Uncontrolled Keywords: Malay, Indonesia, Language, Na��ve Bayesian, Classi�cation.
Subjects: T Technology > T Technology (General)
Depositing User: Perpustakaan Maranatha
Date Deposited: 15 Jun 2023 07:37
Last Modified: 15 Jun 2023 07:37
URI: http://repository.maranatha.edu/id/eprint/31937

Actions (login required)

View Item View Item