The terms data cleaning and data transformation are sometimes used interchangeably, but they describe fundamentally different processes.
Data cleaning is primarily about correcting any mistakes in a raw dataset. The data cleaning process works to remediate missing datapoints, identify values that seem wildly out of scope and remove any irrelevant data. The data cleaning step is solely focused on correcting any mistakes and making the raw data more accurate.
Data transformation is a more involved process which transforms the data structure and format for a particular use. For example, a data visualization tool, machine learning algorithm or BI workflow will need datasets to match a specific structure and format before it can be successfully processed. This process, which takes place after data has been cleaned and checked for accuracy, is what transforms raw data into a usable format.
It’s similar to a professional kitchen where ingredients are cleaned and sorted before cooking. Once they’ve been cleaned, they’re prepped for a particular dish — a carrot may be diced for soup or shredded for a salad, a transformation which depends on the dish it is meant for.