Reference Hub2
Large-Scale System for Social Media Data Warehousing: The Case of Twitter-Related Drug Abuse Events Integration

Large-Scale System for Social Media Data Warehousing: The Case of Twitter-Related Drug Abuse Events Integration

Jenhani Ferdaous, Mohamed Salah Gouider
Copyright: © 2022 |Volume: 18 |Issue: 1 |Pages: 18
ISSN: 1548-3924|EISSN: 1548-3932|EISBN13: 9781799893684|DOI: 10.4018/IJDWM.290890
Cite Article Cite Article

MLA

Ferdaous, Jenhani, and Mohamed Salah Gouider. "Large-Scale System for Social Media Data Warehousing: The Case of Twitter-Related Drug Abuse Events Integration." IJDWM vol.18, no.1 2022: pp.1-18. http://doi.org/10.4018/IJDWM.290890

APA

Ferdaous, J. & Gouider, M. S. (2022). Large-Scale System for Social Media Data Warehousing: The Case of Twitter-Related Drug Abuse Events Integration. International Journal of Data Warehousing and Mining (IJDWM), 18(1), 1-18. http://doi.org/10.4018/IJDWM.290890

Chicago

Ferdaous, Jenhani, and Mohamed Salah Gouider. "Large-Scale System for Social Media Data Warehousing: The Case of Twitter-Related Drug Abuse Events Integration," International Journal of Data Warehousing and Mining (IJDWM) 18, no.1: 1-18. http://doi.org/10.4018/IJDWM.290890

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

Social media data become an integral part in the business data and should be integrated into the decisional process for better decision making based on information which reflects better the true situation of business in any field. However, social media data are unstructured and generated in very high frequency which exceeds the capacity of the data warehouse. In this work, we propose to extend the data warehousing process with a staging area which heart is a large scale system implementing an information extraction process using Storm and Hadoop frameworks to better manage their volume and frequency. Concerning structured information extraction, mainly events, we combine a set of techniques from NLP, linguistic rules and machine learning to succeed the task. Finally, we propose the adequate data warehouse conceptual model for events modeling and integration with enterprise data warehouse using an intermediate table called Bridge table. For application and experiments, we focus on drug abuse events extraction from Twitter data and their modeling into the Event Data Warehouse.