An Approach Based on Data Profiling at the Preparing a Dataset for Cleaning
Annotatsiya
It is known that today the volume of data is increasing dramatically in every field. This in turn complicates the process of data analysis. In this case, creating a data profile facilitates big data analysis. Therefore, data profiling is becoming an important and integral part of data analysis necessary for making accurate decisions in various fields. The importance of data profiling is that during the data profiling process, data that leads to incorrect decisions is identified and eliminated. After going through this process, the dataset becomes fully ready for data cleaning. The probability of algorithms built on the basis of a properly organized data cleaning data set to produce accurate and effective results increases several times. This research paper reveals the importance of data profiling in data analytics and artificial intelligence algorithms. At the same time, information is provided about the sequence of practical work performed in the data profiling process. The experimental dataset is also subjected to data profiling and the data cleaning processes to be performed on this selected dataset are determined. The article concludes with the main conclusions of this research work.