The process of cleaning up the data before loading them into a data warehouse by removing errors, incomplete information or inconsistency between sources.