Automatic Topic Clustering Using Latent Dirichlet Allocation with Skip-gram Model on Final Project Abstracts

Bunyamin, Hendra and Sulistiani, Lisan (2017) Automatic Topic Clustering Using Latent Dirichlet Allocation with Skip-gram Model on Final Project Abstracts. In: International Computer Science and Engineering Conference (ICSEC), 15-18 November 2017, Bangkok, Thailand.

[img] Text
2017-ICSEC-automatic_new.pdf

Download (4Mb)
[img] Text
2017-ICSEC-automatic_topic_clustering-turnitin.pdf

Download (1179Kb)
Official URL: https://ieeexplore.ieee.org/document/8443795/figur...

Abstract

Abstract—Topic model has been an elegant method to discover hidden structures in knowledge collections, such as news archives, blogs, web pages, scientific articles, books, images, voices, videos, and social media. The basic model of topic model is Latent Dirichlet Allocation (LDA) and this paper utilizes LDA to automatically cluster topics from final project abstract collection. We compare two methods, that are LDA as a unigram model and LDA with Skip-gram model. Our results are evaluated by an expert on readily available categories. Overall, words from each topic are indeed keywords describing each topic; moreover, the combination of LDA and skip-gram model are capable to capture key phrases from each topic.

Item Type: Conference or Workshop Item (Paper)
Contributors:
ContributionContributorsNIDN/NIDKEmail
AuthorBunyamin, HendraUNSPECIFIEDUNSPECIFIED
AuthorSulistiani, Lisan UNSPECIFIEDUNSPECIFIED
Uncontrolled Keywords: topic model, latent dirichlet allocation, skipgram model, final project abstracts
Subjects: T Technology > T Technology (General)
Divisions: Faculty of Information Technology > 72 Information Technology Department
Depositing User: Perpustakaan Maranatha
Date Deposited: 28 Mar 2025 09:39
Last Modified: 28 Mar 2025 09:39
URI: http://repository.maranatha.edu/id/eprint/33625

Actions (login required)

View Item View Item