Open Refine for Natural History Collection Data: Setup

Data

Download this data file to your computer: https://figshare.com/s/6fe692e2883347b4c15f

About the data

The data for this lesson is a part of the Data Carpentry Natural History Collections workshop. It is a teaching version of a download of rodent data from the http://portal.idigbio.org/ for use in Open Refine for the Natural History Science Collections (NHC) Lessons for Data Carpentry. See more information about the NHC datasets for Data Carpentry on figshare.

Dataset description: common rodents in the continental United States from major institutions that publish extended information about their specimens. There are 10,767 rows in the idigbio_rodents.csv file. The data in this lesson are just a small subset of the available data at http://portal.idigbio.org. Data aggregated for this experience have been intentionally ‘messed up’ for this lesson.

The data for this lesson are at https://figshare.com/s/6fe692e2883347b4c15f on Figshare, with a CC-BY license available for reuse.

Software

For this lesson you will need OpenRefine (formerly Google Refine) and a web browser.

Note: this is a Java program that runs on your machine (not in the cloud). It runs inside your browser, but no web connection is needed.

Windows

Mac

Linux