Data cleaning made easier with OpenRefine


Data cleaning made easier with OpenRefine

In an ideal world, any data you collect or obtain would be clean and formatted perfectly for analysis and visualization. But the reality is that data can be really messy! Cleaning and reformatting your data can be a time-consuming and tedious task, but there are ways to speed things up and automate repetitive tasks. OpenRefine can help!

This 2-hour workshop will provide an introduction to OpenRefine, a powerful open-source tool for exploring, cleaning, and manipulating “messy” data, to prepare it for analysis and visualization. Through a combination of lecture, demonstrations, and activities, participants will learn how to:

  • Understand what kinds of tasks are involved in data cleaning
  • Understand why data cleaning is important
  • Get started using OpenRefine for data cleaning to manipulate both textual and numeric data, transform and reshape datasets, and search and filter data in a variety of ways

This workshop is designed for those new to data cleaning and OpenRefine. There are no prerequisites or assumptions of knowledge of math, statistics, or programming.

Map & Data Library workshops, such as this one, are a welcoming and inclusive environment for learning. To learn more, check out our Code of Conduct.

Alternatively, if you would like to learn more about data cleaning and OpenRefine on your own, you are encouraged to explore the Map & Data Library’s OpenRefine online tutorials or self-enroll in our online, self-paced workshop (same content as this live one): Working with Messy Data in OpenRefine.

Instructor: Kelly Schultz, Data Visualization Librarian, Map & Data Library, University of Toronto Libraries

Date and Time: October 12, 2022 from 2-3 pm

Register Now (Students)