A Comparative Evaluation of Different Keyword Extraction Techniques

A Comparative Evaluation of Different Keyword Extraction Techniques

Raj Kishor Bisht
Copyright: © 2022 |Volume: 12 |Issue: 1 |Pages: 17
ISSN: 2155-6377|EISSN: 2155-6385|EISBN13: 9781683182085|DOI: 10.4018/IJIRR.289573
Cite Article Cite Article

MLA

Bisht, Raj Kishor. "A Comparative Evaluation of Different Keyword Extraction Techniques." IJIRR vol.12, no.1 2022: pp.1-17. http://doi.org/10.4018/IJIRR.289573

APA

Bisht, R. K. (2022). A Comparative Evaluation of Different Keyword Extraction Techniques. International Journal of Information Retrieval Research (IJIRR), 12(1), 1-17. http://doi.org/10.4018/IJIRR.289573

Chicago

Bisht, Raj Kishor. "A Comparative Evaluation of Different Keyword Extraction Techniques," International Journal of Information Retrieval Research (IJIRR) 12, no.1: 1-17. http://doi.org/10.4018/IJIRR.289573

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

Retrieving keywords in a text is attracting researchers for a long time as it forms a base for many natural language applications like information retrieval, text summarization, document categorization etc. A text is a collection of words that represent the theme of the text naturally and to bring the naturalism under certain rules is itself a challenging task. In the present paper, the authors evaluate different spatial distribution based keyword extraction methods available in the literature on three standard scientific texts. The authors choose the first few high-frequency words for evaluation to reduce the complexity as all the methods are somehow based on frequency. The authors find that the methods are not providing good results particularly in the case of the first few retrieved words. Thus, the authors propose a new measure based on frequency, inverse document frequency, variance, and Tsallis entropy. Evaluation of different methods is done on the basis of precision, recall, and F-measure. Results show that the proposed method provides improved results.