Data Quality Management dalam Data Warehouse: Tinjauan Literatur

  • Romiko Afriantoni Universitas Ibnu Sina
  • Naisya Nindy Pangestuti Mahasiswa Universitas Ibnu Sina
Keywords: Data quality management; data warehouse; ETL/ELT; data governance; data lineage; observability; data contracts; PRISMA; systematic literature review.

Abstract

This study presents a systematic literature review of Data Quality Management (DQM) in data warehouse environments, aiming to map key dimensions, processes, and architectural/technological enablers, and to identify research gaps. Searches were conducted across Scopus, ScienceDirect, IEEE Xplore, ACM Digital Library, SpringerLink, and Google Scholar (as a complement) for the period 2009–2025, following PRISMA 2020. Of 200 initial records, 133 were excluded during the first screening, 67 underwent further assessment, and 6 studies met the inclusion criteria for in-depth analysis. Thematic synthesis indicates that effective DQM rests on four integrated pillars: (1) standardized quality dimensions and metrics (accuracy, completeness, consistency, timeliness, and traceability), (2) prevention–detection–correction processes embedded along the ETL/ELT pipeline (including consistent SCD policies and handling of late-arriving data), (3) architectural/technological support (automated data tests within CI/CD, catalogs/metadata, data lineage, observability, and data contracts), and (4) governance that clarifies roles and accountability (data owners/stewards) with incident-response procedures. Practically, organizations should start from critical data elements and high-priority consumption paths, translating SLA/SLI into executable rules. Limitations include the small number of included studies and contextual heterogeneity, motivating further work on cross-domain metric standardization, open DQM benchmarks, cost–benefit evaluations of observability/contract enforcement, and the impact of data quality on analytic/AI performance in near real-time settings.

Downloads

Download data is not yet available.

References

Altendeitering, M., Guggenberger, T. M., & Möller, F. (2024). A design theory for data quality tools in data ecosystems: Findings from three industry cases. Data & Knowledge Engineering, 153, 102333. https://doi.org/10.1016/j.datak.2024.102333
Bernardo, B. M. V., Mamede, H. S., Barroso, J., & Santos, V. (2024). Data governance & quality management—Innovation and breakthroughs across different fields. Journal of Innovation and Knowledge, 9(4), 100598. https://doi.org/10.1016/j.jik.2024.100598
Ehrlinger, L., & Wöß, W. (2022). A survey of data quality measurement and monitoring tools. Frontiers in Big Data, 5, 850611. https://doi.org/10.3389/fdata.2022.850611
Foidl, H., Golendukhina, V., Ramler, R., & Felderer, M. (2024). Data pipeline quality: Influencing factors, root causes of data-related issues, and processing problem areas for developers. Journal of Systems and Software, 207, 111855. https://doi.org/10.1016/j.jss.2023.111855
Miller, R. (2024). A framework for current and new data quality dimensions: An overview. Data, 9(12), 151. https://doi.org/10.3390/data9120151
Taleb, I., Serhani, M. A., Bouhaddioui, C., & Dssouli, R. (2021). Big data quality framework: A holistic approach to continuous quality management. Journal of Big Data, 8, Article 76. https://doi.org/10.1186/s40537-021-00468-0
Liu, Q., Feng, G., Tayi, G. K., & Tian, J. (2020). Minimizing the data quality problem of information systems: A process-based method. Decision Support Systems, 137, 113381. https://doi.org/10.1016/j.dss.2020.113381
Schwabe, D., et al. (2024). The METRIC-framework for assessing data quality of medical training data. npj Digital Medicine, 7, 116. https://doi.org/10.1038/s41746-024-01196-4
Ehrlinger, L., & Wöß, W. (2022). A survey of data quality measurement and monitoring tools. Frontiers in Big Data, 5, 850611. https://doi.org/10.3389/fdata.2022.850611
Foidl, H., Golendukhina, V., Ramler, R., & Felderer, M. (2024). Data pipeline quality: Influencing factors, root causes of data-related issues, and processing problem areas for developers. Journal of Systems and Software, 207, 111855. https://doi.org/10.1016/j.jss.2023.111855
Statistik
Abstract View: 15
Article Download: 18 Turnitin Download: 2
Published
2025-12-28