Previously known as Google Refine, Open Refine is an open-source application for data cleaning and data transformation which will come in useful when performing data analysis at a later stage. It uses your browser as an interface but keeping your data private on your own device unless you would like to share it with others.
Before we can analyze any data, we often need to clean the data beforehand. We use this Excel file to demonstrate the powerful data cleaning function in Open Refine. The information in the spreadsheet under "Region" in column F is in a mess with inconsistent spelling. It would be very time consuming to manually clean the data using Excel.
Now ley's try to clean it by using Open Refine with steps below:
For more data cleaning functions, please refer to the official documentation.