HTAP With Reactive Streaming ETL

HTAP With Reactive Streaming ETL

Carl Camilleri, Joseph G. Vella, Vitezslav Nezval
Copyright: © 2021 |Volume: 23 |Issue: 4 |Pages: 19
ISSN: 1548-7717|EISSN: 1548-7725|EISBN13: 9781799859192|DOI: 10.4018/JCIT.20211001.oa10
Cite Article Cite Article

MLA

Camilleri, Carl, et al. "HTAP With Reactive Streaming ETL." JCIT vol.23, no.4 2021: pp.1-19. http://doi.org/10.4018/JCIT.20211001.oa10

APA

Camilleri, C., Vella, J. G., & Nezval, V. (2021). HTAP With Reactive Streaming ETL. Journal of Cases on Information Technology (JCIT), 23(4), 1-19. http://doi.org/10.4018/JCIT.20211001.oa10

Chicago

Camilleri, Carl, Joseph G. Vella, and Vitezslav Nezval. "HTAP With Reactive Streaming ETL," Journal of Cases on Information Technology (JCIT) 23, no.4: 1-19. http://doi.org/10.4018/JCIT.20211001.oa10

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

In database management systems (DBMSs), query workloads can be classified as online transactional processing (OLTP) or online analytical processing (OLAP). These often run within separate DBMSs. In hybrid transactional and analytical processing (HTAP), both workloads may execute within the same DBMS. This article shows that it is possible to run separate OLTP and OLAP DBMSs, and still support timely business decisions from analytical queries running off fresh transactional data. Several setups to manage OLTP and OLAP workloads are analysed. Then, benchmarks on two industry standard DBMSs empirically show that, under an OLTP workload, a row-store DBMS sustains a 1000 times higher throughput than a columnar DBMS, whilst OLAP queries are more than 4 times faster on a columnar DBMS. Finally, a reactive streaming ETL pipeline is implemented which connects these two DBMSs. Separate benchmarks show that OLTP events can be streamed to an OLAP database within a few seconds.