Thing 22: What's in a name?

Overview

Teaching: min
Exercises: min
Questions
  • Who are the key players in Australia’s research data management ecosystem?

Objectives
  • Learn about the key players in Australia’s research data management ecosystem.

Who’s who in the acronym soup?

Data is at the heart of the Australian national innovation agenda. The key players who help enable the innovation agenda float in an acronym soup! Let’s find out who’s in the soup….

Read this article that explains the Australian Government initiatives to foster innovation through publishing and sharing data.

Put (very!) simply the main players in the Australian research landscape (a.k.a. research alphabet soup…) are:

Let’s focus on NCRIS - it’s amazing. NCRIS is designed to take a national approach to providing the world’s best research infrastructure for Australia. NCRIS facilities provide storage for data (RDS), research computer networking across Australia (AARNET), tools and virtual labs for researchers (NeCTAR), very, very big data crunching (NCI), as well as lots of specialised research facilities. ANDS, who is putting together these 23 (rd) Things, is one of the 27 NCRIS facilities.

Exlpore NCRIS

  1. Browse over some NCRIS case studies to get an idea of what data and activities are produced by NCRIS facilities.
  2. Just for fun:
    • Fancy some Snakes and Ladders? Find out how some of the Australian infrastructure players fit together: Print the game and start playing!
    • Play the Australian Research Data Acronym quiz and get your acronym literacy score! Created by the ORCID group for a bit of fun.

For NZ: What’s happening in NZ?

Big data: the sky is not the limit!

We often hear terms such as “big data” and “data deluge”. And it doesn’t get much bigger than astronomy and satellite data! Let’s look at 2 big data projects that are only possible because of national collaborations - most of whom love acronyms!

Option 1: ASKAP - reaching for the stars

The Australian Square Kilometre Array Pathfinder (ASKAP) telescope is made of 36 identical 12-metre wide dish antennas. These produce some 2.5 gigabytes of data per second, equivalent to 75 petabytes per year!

  1. Learn about managing this big data project by hearing from the researchers who use ASKAP - watch this stunning 3 min video.

Storing, analysing, managing and publishing such big data usually requires a collaborative effort across a number of organisations. In the case of ASKAP data for example:

  • the data is captured at the Murchison Radio-astronomy Observatory (MRO), approximately 315 km northeast of Geraldton in WA.
  • It is then transmitted 730 km to the Pawsey Supercomputing Centre in Perth where it is processed and stored.
  • Processed data is then published via the CSIRO DAP (Data Access Portal) with metadata records harvested to RDA (Research Data Australia) to enhance discovery.

It’s hard to imagine just how many organisations work together to make this apparently simple workflow possible. Consider for example:

  • funding bodies (eg NCRIS, federal and state governments)
  • research organisations (eg CSIRO, Swinburne University, University of WA)
  • infrastructure providers (AARNET, Pawsey Supercomputing Centre, ANDS)

Learn more about the ASKAP telescopes.

Option 2: the Data Cube - satellites mapping the earth

The Australian Geoscience Data Cube is another example where collaboration between a number of organisations has resulted in news ways of organising, analysing and managing the large amounts of data collected from Earth Observation Satellites (EOS).

  1. This wonderful 4.35min video explains the concept behind Geoscience Australia’s Data Cube and the partners who made it possible. By collaborating, Australian organisations including GA (Geoscience Australia), CSIRO and the NCI (National Computational Infrastructure) have created an outcome that any one of these organisations in isolation could never have achieved.
  2. Learn more about the Australian Geoscience Data Cube

Consider: why big data is often publicly available, yet the so called ‘long tail of research data’ (smaller data sets) are often not published.

National infrastructure supporting Australian research

Option 1: Virtual Laboratories

Explore one or more of the NeCTAR Virtual Laboratories - who uses these labs and what do they do?

  • Explore in words
  • Explore in videos (4 mins max)
  • Pick one or two and explore in depth.

Option 2: AURIN (Australian Urban Research Infrastructure Network)

AURIN is a powerful mapping tool with a wide range of inbuilt datasets. Watch this 3.44min “how to use AURIN” video to see how it can be used to build national or small maps, particularly for social science and health.

Consider: how researchers can use NCRIS facilities to help generate, manage, share and publish their data.

Key Points

  • First key point.