Reference Hub4
Filtering Infrequent Behavior in Business Process Discovery by Using the Minimum Expectation

Filtering Infrequent Behavior in Business Process Discovery by Using the Minimum Expectation

Ying Huang, Liyun Zhong, Yan Chen
Copyright: © 2020 |Volume: 14 |Issue: 2 |Pages: 15
ISSN: 1557-3958|EISSN: 1557-3966|EISBN13: 9781799805328|DOI: 10.4018/IJCINI.2020040101
Cite Article Cite Article

MLA

Huang, Ying, et al. "Filtering Infrequent Behavior in Business Process Discovery by Using the Minimum Expectation." IJCINI vol.14, no.2 2020: pp.1-15. http://doi.org/10.4018/IJCINI.2020040101

APA

Huang, Y., Zhong, L., & Chen, Y. (2020). Filtering Infrequent Behavior in Business Process Discovery by Using the Minimum Expectation. International Journal of Cognitive Informatics and Natural Intelligence (IJCINI), 14(2), 1-15. http://doi.org/10.4018/IJCINI.2020040101

Chicago

Huang, Ying, Liyun Zhong, and Yan Chen. "Filtering Infrequent Behavior in Business Process Discovery by Using the Minimum Expectation," International Journal of Cognitive Informatics and Natural Intelligence (IJCINI) 14, no.2: 1-15. http://doi.org/10.4018/IJCINI.2020040101

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

The aim of process discovery is to discover process models from the process execution data stored in event logs. In the era of “Big Data,” one of the key challenges is to analyze the large amounts of collected data in meaningful and scalable ways. Most process discovery algorithms assume that all the data in an event log fully comply with the process execution specification, and the process event logs are no exception. However, real event logs contain large amounts of noise and data from irrelevant infrequent behavior. The infrequent behavior or noise has a negative influence on the process discovery procedure. This article presents a technique to remove infrequent behavior from event logs by calculating the minimum expectation of the process event log. The method was evaluated in detail, and the results showed that its application in existing process discovery algorithms significantly improves the quality of the discovered process models and that it scales well to large datasets.