Authors
Hadi Fadlallah, Rima Kilany, Houssein Dhayne, Rami El Haddad, Rafiqul Haque, Yehia Taher, Ali Jaber
Publication date
2023/8/23
Journal
ACM Journal of Data and Information Quality
Volume
15
Issue
3
Pages
1-30
Publisher
ACM
Description
In the big data domain, data quality assessment operations are often complex and must be implementable in a distributed and timely manner. This article tries to generalize the quality assessment operations by providing a new ISO-based declarative data quality assessment framework (BIGQA). BIGQA is a flexible solution that supports data quality assessment in different domains and contexts. It facilitates the planning and execution of big data quality assessment operations for data domain experts and data management specialists at any phase in the data life cycle. This work implements BIGQA to demonstrate its ability to produce customized data quality reports while running efficiently on parallel or distributed computing frameworks. BIGQA generates data quality assessment plans using straightforward operators designed to handle big data and guarantee a high degree of parallelism when executed. Moreover, it …
Total citations
Scholar articles
H Fadlallah, R Kilany, H Dhayne, R El Haddad… - ACM Journal of Data and Information Quality, 2023