Bunyamin, Hendra and Sulistiani, Lisan (2017) Automatic Topic Clustering Using Latent Dirichlet Allocation with Skip-gram Model on Final Project Abstracts. In: International Computer Science and Engineering Conference (ICSEC), 15-18 November 2017, Bangkok, Thailand.
![]() |
Text
2017-ICSEC-automatic_new.pdf Download (4Mb) |
![]() |
Text
2017-ICSEC-automatic_topic_clustering-turnitin.pdf Download (1179Kb) |
Abstract
Abstract—Topic model has been an elegant method to discover hidden structures in knowledge collections, such as news archives, blogs, web pages, scientific articles, books, images, voices, videos, and social media. The basic model of topic model is Latent Dirichlet Allocation (LDA) and this paper utilizes LDA to automatically cluster topics from final project abstract collection. We compare two methods, that are LDA as a unigram model and LDA with Skip-gram model. Our results are evaluated by an expert on readily available categories. Overall, words from each topic are indeed keywords describing each topic; moreover, the combination of LDA and skip-gram model are capable to capture key phrases from each topic.
Item Type: | Conference or Workshop Item (Paper) | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Contributors: |
|
||||||||||||
Uncontrolled Keywords: | topic model, latent dirichlet allocation, skipgram model, final project abstracts | ||||||||||||
Subjects: | T Technology > T Technology (General) | ||||||||||||
Divisions: | Faculty of Information Technology > 72 Information Technology Department | ||||||||||||
Depositing User: | Perpustakaan Maranatha | ||||||||||||
Date Deposited: | 28 Mar 2025 09:39 | ||||||||||||
Last Modified: | 28 Mar 2025 09:39 | ||||||||||||
URI: | http://repository.maranatha.edu/id/eprint/33625 |
Actions (login required)
![]() |
View Item |