FINAL SCHEDULE - Refresh the SCHED app to view last-minute changes and cancellations. 
Back To Schedule
Tuesday, March 26 • 8:00am - 12:00pm
The Art of OpenRefine: Data Manipulation, Reconciliation, and GREL

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

This workshop requires pre-registration. Fee: $35

The first encounter most library patrons have with art objects, books, and special collections materials is typically through library records rather than the resources themselves. This workshop focuses on making our data as beautiful and rich as the information objects we share. Participants will get to work firsthand with OpenRefine using data from the University of Utah's "Utah Artists Project‚" gathered via the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH). OpenRefine is an open source software program that has been adopted by the library profession to manipulate and clean data in a variety of formats (XML, CSV, JSON, etc.). While particularly applicable for professionals working with digital collections, this workshop is relevant for anyone dealing with data or those who want to learn more about open source tools and resources. In this session, individuals will be guided through the process of creating a project, using the Google Refine Expression Language (GREL), automatically matching names and subjects with existing authorities and controlled vocabularies (reconciliation), and exporting the resulting project. We will begin by introducing simple processes built into OpenRefine for manipulating data and then venture into introducing unique expressions that can be written in GREL. For exercises on reconciliation, the Getty and Library of Congress vocabularies will be highlighted. Additional information on reconciliation with VIAF and geonames will also be provided. Finally, the session will end with methods for getting your cleaned data out of the program and into your own systems for public use and enjoyment.

The sample data and detailed instructions on the various processes to be covered in the workshop are available at https://github.com/mlhale7/ARLIS19Workshop. Artists represented in the data include Claudia Coberly, Lee Deffebach, Edwin Evans, and many more. The session will support active engagement with Utah-specific data on site while providing individuals with the tools and knowledge to tackle data cleaning projects at their home institutions.

Note: Participants are required to bring their own laptop and install OpenRefine before the session. Please review the GitHub repository before the workshop and contact the instructor (mhale16@utk.edu) if you encounter any issues.

Learning Objective
  • Gain familiarity with OpenRefine and other open access tools / resources (like OAI-PMH)

avatar for Meredith Hale

Meredith Hale

Metadata Librarian, University of Tennessee

Tuesday March 26, 2019 8:00am - 12:00pm MDT
Grand America Hotel: Audubon (3rd Floor) 555 Main St, Salt Lake City, UT 84111