Major tasks of data pre-processing
Data cleaning is a process to clean the data in such a way that data can be easily integrated.
Data integration is a process to integrate/combine all the data.
Data reduction is a process to reduce the large data into smaller once in such a way that data can be easily transformed further.
Data transformation is a process to transform the data into a reliable shape.
After the completion of these tasks, the data is ready for mining.