Introducing the Shell and Data cleaning with Open Refine

Stockholm Trio university libraries

December 2, 2024

9 am - 4pm CET

Instructors: Joakim Philipson, Merlijn de Smit

Helpers: Mattias Vesterlund, Stefan Wiens

General Information

This Carpentry workshop is aimed at helping you:

Who: The course is for any researcher or Ph.D. student at KI, KTH or SU. You don't need to have any previous knowledge of the tools that will be presented at the workshop.

Where: Växthuset, Stockholm University Library, Universitetsvägen 14D, 114 18 Stockholm. More information on the location. Get directions with OpenStreetMap or Google Maps.

When: December 2, 2024. Add to your Google Calendar.

Requirements: Participants must bring a laptop with a Mac, Linux, or Windows operating system (not a tablet, Chromebook, etc.) that they have administrative privileges on. If you're on a centrally managed laptop issued by the university, please check out the comments above the setup instructions below. They should have a few specific software packages installed (listed below).

Accessibility: We are committed to making this workshop accessible to everybody. For workshops at a physical location, the workshop organizers have checked that:

Materials will be provided in advance of the workshop and large-print handouts are available if needed by notifying the organizers in advance. If we can help making learning easier for you (e.g. sign-language interpreters, lactation facilities) please get in touch (using contact details below) and we will attempt to provide them.

Contact: Please email opendata@su.se for more information.

Roles: To learn more about the roles at the workshop (who will be doing what), refer to our Workshop FAQ.

Who can attend?: This workshop is open to doctoral students and researchers from the Stockholm Trio universities, i. e. Karolinska institutet, KTH Royal Institute of Technology and Stockholm University.


Sign up: Please note that the number of participants is limited to 30, on a first come first served base! Sign up will open on November 11 and remain open until December 2 or until seats are filled! Please click here to sign up for the workshop.

Code of Conduct

Everyone who participates in Carpentries activities is required to conform to the Code of Conduct. This document also outlines how to report an incident if needed.


Collaborative Notes

We will use this collaborative document for chatting, taking notes, and sharing URLs and bits of code.


Surveys

Please be sure to complete these surveys before and after the workshop.

Pre-workshop Survey

Post-workshop Survey


Schedule

Dec. 2, 2024 "Växthuset", Stockholm University Library

Before Pre-workshop survey
08:30 Optional: help for setting up
09:00 Bash Shell-novice
10:30 Morning break
11:00 Bash Shell-novice
12:00 Lunch break
13:00 Data Cleaning with Open Refine
14:15 Afternoon break
15:00 Data Cleaning with Open Refine
16:00 END

Setup

To participate in this Carpentry workshop, you will need access to software as described below. In addition, you will need an up-to-date web browser.

We maintain a list of common issues that occur during installation as a reference for instructors that may be useful on the Configuration Problems and Solutions wiki page.

Please see below for setup instructions for this workshop.

The Bash Shell

Bash is a commonly-used shell that gives you the power to do simple tasks more quickly. For this lesson we will use both Software Carpentry and to some extent Library Carpentry curricula. Please find primary setup instructions here.

OpenRefine

OpenRefine is a tool to clean up and organize messy data. For this lesson we will use both Data Carpentry and to some extent Library Carpentry curricula. Please find primary setup instructions here.