Digital Scholarship Fundamentals Workshop: Cleaning and Manipulating Data

Thu., Oct. 25, 12:30 pm – 2:00 pm, Mullen Library Instruction Room

When working with your dataset, have you wondered how to remove ‘null’ or ‘N/A’ from fields, handle different spellings of words, or determining whether a field name is ambiguous? When interviewed, many data scientists complain that the most tedious, time-consuming aspect of any project is the cleaning and manipulating of data. For this workshop, we will use the open access software, OpenRefine, to clean, manipulate, and refine a dataset before analysis. Since this workshop is focused on saving you time by discovering and avoiding common pitfalls in data preparation, a brief foray into regular expressions will be useful. You are welcome to bring your own dataset.

Please RSVP to gunn@cua.edu.

Share this:

Leave a Reply