Thu., Oct. 25, 12:30 pm – 2:00 pm, Mullen Library Instruction Room
When working with your dataset, have you wondered how to remove ‘null’ or ‘N/A’ from fields, handle different spellings of words, or determining whether a field name is ambiguous? When interviewed, many data scientists complain that the most tedious, time-consuming aspect of any project is the cleaning and manipulating of data. For this workshop, we will use the open access software, OpenRefine, to clean, manipulate, and refine a dataset before analysis. Since this workshop is focused on saving you time by discovering and avoiding common pitfalls in data preparation, a brief foray into regular expressions will be useful. You are welcome to bring your own dataset.
Please RSVP to firstname.lastname@example.org.