Thing 6: Long-lived data: curation & preservation

Overview

Teaching: 0 min
Exercises: 0 min
Questions
  • What is data curation?

Objectives
  • Getting started: how would you advise someone what to do to make sure their fragile born digital data is robust and long lived?

  • Learn more: how does archiving, preserving and curating data ‘Stack’ up?

  • Challenge me: what’s in a (PRO)NOM?

Getting started: The vulnerability of digital data

Traditional information sources such as books, photos and sculptures can easily survive for years, decades or even centuries but digital items require special care to keep them usable over time.

Digital Preservation

Watch the below 2.5 minute video from the US Library of Congress which shows the vulnerability of “born digital” objects like research data: they are fragile; they are dependent on software and hardware; and they require active management.

Why Digital Preservation is Important for Everyone (YouTube)

What are some of the challenges with preserving digital assets?

Delve deeper

  • Start with at the ANDS page on data preservation
  • See the Library of Congress Digital Preservation website
  • What key advice would you give someone about preserving their born digital objects, e.g. the family historian, a researcher, yourself?

Consider: What key advice would you give someone about preserving their born digital objects eg the family historian, a researcher, yourself

Learn more: What defines long-lived data?

‘Curation’, ‘preservation’, ‘archiving’ … are all commonly used data management terms. Are they all the same thing?

Stack Model

  1. Watch the below 5.54 min video in which Sayeed Choudhury, Associate Dean for Research Data Management at Johns Hopkins University introduces the Stack Model for data management and discusses the model’s components—storage, archiving, preservation, and curation.

    Data Conservancy Stack Model for Data Management (YouTube)

  2. what do you think about the Stack Model and its relevance for data repositories?

Challenge me: File formats for the future

Data managers often refer to ‘long-lived data formats’, ‘open file formats’ and ‘format migration’. The UK National Archives has made available tools and services that can assist with identifying and managing file formats. For this activity we will look at two tools accessible via the National Archives in the UK: PRONOM and DROID.

Long-lived data formats

  1. Start by searching PRONOM to learn more about a particular file format you commonly work with. Hint: Read the PRONOM Help file to find out about the search options available.
  2. Now take a look at DROID and read how to use DROID to profile file formats. If you have time: download the current version of DROID and try using it to profile a small number of files.

PRONOM and DROID

Are PRONOM and DROID tools you’d like to explore further? {: .discussion}

Key Points

  • First key point.