Hesburgh Libraries

CANCELED — Software Carpentry: OpenRefine Session 2

Wednesday, March 18, 2020

5:00 pm – 6:00 pm

247 Hesburgh Library, Navari Family Center for Digital Scholarship

In order to protect the health and wellness of our community, this event has been canceled. We will share more information on rescheduling, as appropriate, at a later date.

Overview

This session further explores the features of OpenRefine. We’ll use OpenRefine to apply a set of steps to a dataset using the “Extract” and “Apply” features. We’ll be working with data types and regular expressions to write more complex transformations using General Refine Expression Language (GREL). We’ll use arrays in data transformation and learn how to export data in different formats.

Objectives

  1. Work with columns and sorting to reorder, rename, remove, and sort columns
  2. Understand common transformations
  3. Use transformations to programmatically edit data
  4. Use GREL, the General Refine Expression Language and write a valid GREL expression
  5. Save and apply a set of steps to a new set of data using the “Extract” and “Apply” features
  6. Transform Strings, Numbers, Dates and Booleans
  7. Use Arrays in data transformation
  8. Export data in different formats from OpenRefine

In this lesson we further use OpenRefine to manipulate and enhance datasets. Please review episodes 1-5 from the OpenRefine carpentries curriculum which we covered in the previous session before attending this workshop.

Please bring your own laptop. You will need to install OpenRefine and download the data file doaj-article-sample.csv to follow the lesson in this session.

OpenRefine does not support Internet Explorer or Edge. For this session please use Firefox, Chrome or Safari instead. See Setup for more information. If you’d like help to address difficulties you encounter with setup, please contact cds@nd.edu.

View All Events

Sign up to receive weekly email updates for specific types of events.

284 Hesburgh Library, Notre Dame, IN 46556

Circulation Desk Phone (574) 631-6679

Security Monitors Phone (574) 631-6350

asklib@nd.edu

Facebook  Instagram  LinkedIn  Twitter   NDlibraries
Hesburgh Library Logo
Phone Number: (574) 631-6679