Learning Objectives

This week, students will be able to:

  • list the elements of the data life cycle
  • articulate the relevance of good data management for scientific research
  • identify the differences between good and bad data entry and management
  • recognize bad data organization and why it is problematic for research
  • implement quality assurance and control measures for data entry in spreadsheets using excel
  • list current measures used by the scientific community in ecology and evolution to preserve data long term

Day 1

Welcome!

Logistics survey:

Syllabus overview

Schedule overview

The schedule shows a list of topics that will be covered each week of the course.

The schedule reflects a flipped course structure, aka minimal lecture and maximal student practice. Each row corresponds to a different topic, and organizes links for pre-class activities, in-class activities and homework.

The importance of data for research: Exercises

  1. Data is used in all areas of human activity. Tim Stobierski writes about business analytics and the importance of knowing the data life cycle.
    • What is the importance of the data life cycle?
    • How is it related to research?
  2. DataONE is an organization dedicated to making earth data universally accessible and FAIR (Findable, Accessible, Interoperable, and Reusable).
    • Take 10 min to read The DataONE Life Cycle. If the link is not available, a screen shot of the text is available here.
    • Answer the following questions interactively on this mentimeter:
    • How many steps are there in the DataOne life cycle?
    • Identify steps that are different between the previous and this data life cycle.
  3. Look at Figure 1 from research paper Best practice data life cycle approaches for the life sciences.
    • Take 5 min to identify steps that are similar across the three data life cycle approaches you have reviewed so far. Write the steps that you consider similar on this mentimeter.

In class group activity

Homework

A minute feedback



Day 2

Reading discussion

Group exercise

Tidy Data Principles

Steps for data entry

Best practices for data collection

A minute feedback

Homework