Are you struggling with cleaning and organizing your large data sets? Consider using OpenRefine for data cleaning. OpenRefine is an open source application for data cleaning and transformation, also known as data wrangling. OpenRefine allows you to view and manipulate large quantities of data, making it an excellent tool for wrangling big data sets.
By the end of this 90-minute workshop, participants will gain experience conducting key data cleaning practices, including:
- Removing duplicate records
- Analyzing the occurrence of values throughout a data set
- Clustering and standardizing values
- Separating multiple values contained in the same field
- Joining multiple values contained in separate fields
Workshop materials can be found on our Open Science Framework page, closer to the workshop date.