Data Quality Issues and Current Approaches to Data Cleaning Process In Data Warehousing

Jaya Bajpai, Symbiosis Institute of Computer Studies and Research,Pune; Dr Pravin Metkewar ,Symbiosis Institute of Computer Studies and Research

Data Cleaning, Data Warehousing, Data Quality,ETL,Data Cleaning Tool

In this paper we have discussed the problems of data quality which are addressed during data cleaning phase. Data cleaning is one of the important processes during ETL. Data cleaning is especially required when integrating heterogeneous data sources. This problem should be addresses together with schema related data transformation. At the end we have also discussed the Current tool which supports data cleaning.
    [1] Pandey R.K, Data Quality in Data warehouse: problems and solution ,IOSR Journal of Computer Engineering (IOSR-JCE) e-ISSN: 2278-0661, p- ISSN: 2278-8727Volume 16, Issue 1, Ver. IV (Jan. 2014), PP 18-24 [2] M Sakshi et all, An Overview On Evocation Of Data Quality At ETL Stage,International Journal of Advanced Technology in Engineering and Science , Volume No 03, Special Issue No. 01, March 2015 ISSN (online): 2348 – 7550 [3] Y Richard et al, A Framework for Analysis of Data Quality Research, IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 7, NO, 4, AUGUST 1995 [4] Jonathan G, Data Quality Management The Most Critical Initiative You Can Implement , Intelligent Solutions, Inc., Boulder, CO [5] K Vinay et al, A S IMPLIFIED APPROACH FOR QUALITY MANAGEMENT IN DATA WAREHOUSE International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.3, No.5, September 2013 [6] J.Rajni et al, COMPARATIVE STUDY OF DATA WAREHOUSE DESIGN APPROACHES : A SURVEY ,International Journal of Database Management Systems ( IJDMS ) Vol.4, No.1, February 2012 [7] S.Ranjit et al, A Descriptive Classification of Causes of Data Quality Problems in Data Warehousing ,IJCSI International Journal of Computer Science Issues, Vol. 7, Issue 3, No 2, May 2010 41 ISSN (Online): 1694-0784 ISSN (Print): 1694-0814 [8] Erhard Rahm et al, Data Cleaning: Problems and Current Approaches.
Paper ID: GRDJEV01I100013
Published in: Volume : 1, Issue : 10
Publication Date: 2016-10-01
Page(s): 14 - 18