Software Keyphrase Extraction with Domain-specific Features

Karnalim, Oscar (2016) Software Keyphrase Extraction with Domain-specific Features. In: 2016 International Conference on Advanced Computing and Applications (ACOMP) 2016, 23-25 November 2016, Can Tho City.

[img] Text
A14 2016-11 ACOMP ISBN-978-1-5090-6145-7.pdf - Published Version

Download (1246Kb)

Abstract

Despite the fact that keyphrase is widely used as a brief summary to represent documents, most keyphrase extraction is only focused on arbitrary text. However, many document types have specific behavior which require particular pre-processing in order to extract keyphrases. In software domain, keyphrases can only be extracted by utilizing reverseengineering approach and applying several conversion rules. This paper proposes a mechanism to extract software keyphrases with domain-specific features. For our case study, our proposed method is applied to Java Archive, a distributional form of Java binaries. Besides pre-processing and conversion rules, our method also utilizes the combination of supervised and unsupervised keyphrase extraction approach to exploit the benefits of both approaches. Furthermore, in order to extract keyphrase pattern more accurately, software-related features are also incorporated besides standard keyphrase extraction features. These features are software structure, software-related natural language text, and software term association. Based on overall evaluation, our proposed method yields moderate Rprecision. Thus, our approach is quite considerable to be applied for extracting software keyphrase.

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: Keywords—keyphrase extraction; software; domain-specific features; Java Archive
Subjects: T Technology > T Technology (General)
Depositing User: Perpustakaan Maranatha
Date Deposited: 09 Apr 2018 02:34
Last Modified: 09 Apr 2018 02:34
URI: http://repository.maranatha.edu/id/eprint/24327

Actions (login required)

View Item View Item