Reference Hub2
Enhanced Bootstrapping Algorithm for Automatic Annotation of Tweets

Enhanced Bootstrapping Algorithm for Automatic Annotation of Tweets

Mudasir Mohd, Rafiya Jan, Nida Hakak
Copyright: © 2020 |Volume: 14 |Issue: 2 |Pages: 26
ISSN: 1557-3958|EISSN: 1557-3966|EISBN13: 9781799805328|DOI: 10.4018/IJCINI.2020040103
Cite Article Cite Article

MLA

Mohd, Mudasir, et al. "Enhanced Bootstrapping Algorithm for Automatic Annotation of Tweets." IJCINI vol.14, no.2 2020: pp.35-60. http://doi.org/10.4018/IJCINI.2020040103

APA

Mohd, M., Jan, R., & Hakak, N. (2020). Enhanced Bootstrapping Algorithm for Automatic Annotation of Tweets. International Journal of Cognitive Informatics and Natural Intelligence (IJCINI), 14(2), 35-60. http://doi.org/10.4018/IJCINI.2020040103

Chicago

Mohd, Mudasir, Rafiya Jan, and Nida Hakak. "Enhanced Bootstrapping Algorithm for Automatic Annotation of Tweets," International Journal of Cognitive Informatics and Natural Intelligence (IJCINI) 14, no.2: 35-60. http://doi.org/10.4018/IJCINI.2020040103

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

Annotations are critical in various text mining tasks such as opinion mining, sentiment analysis, word sense disambiguation. Supervised learning algorithms start with the training of the classifier and require manually annotated datasets. However, manual annotations are often subjective, biased, onerous, and burdensome to develop; therefore, there is a need for automatic annotation. Automatic annotators automatically annotate the data for creating the training set for the supervised classifier, but lack subjectivity and ignore semantics of underlying textual structures. The objective of this research is to develop scalable and semantically rich automatic annotation system while incorporating domain dependent characteristics of the annotation process. The authors devised an enhanced bootstrapping algorithm for the automatic annotation of Tweets and employed distributional semantic models (LSA and Word2Vec) to augment the novel Bootstrapping algorithm and tested the proposed algorithm on the 12,000 crowd-sourced annotated Tweets and achieved a 68.56% accuracy which is higher than the baseline accuracy.