TidyTuesday
    • About TidyTuesday
    • Datasets
      • 2025
      • 2024
      • 2023
      • 2022
      • 2021
      • 2020
      • 2019
      • 2018
    • Useful links

    On this page

    • The Global Human Day
      • The Data
      • How to Participate
        • Data Dictionary
    • all_countries.csv
    • country_regions.csv
    • global_human_day.csv
    • global_economic_activity.csv
      • Cleaning Script

    The Global Human Day

    The data this week comes from the The Human Chronome Project an initiative based at McGill University in Montreal, from their paper The global human day in PNAS and the associated dataset on Zenodo.

    The daily activities of ≈8 billion people occupy exactly 24 h per day, placing a strict physical limit on what changes can be achieved in the world. These activities form the basis of human behavior, and because of the global integration of societies and economies, many of these activities interact across national borders. This project estimates how all humans spend their time using a generalized, physical outcome–based categorization that facilitates the integration of data from hundreds of diverse datasets.

    See their supplementary materials for details about their methods and additional visualizations.

    The Zenodo dataset includes the input data and scripts used to create the datasets used in the paper. The datasets are from the outputData file “all_countries.csv”, “global_human_day.csv”, “global_economic_activity.csv” and inputData “country_regions.csv”. The outputData files are aggregated output data from data collected, created from the scripts in the ‘scripts’ directory.

    h/t Data is Plural 2023-07-13 newsletter for the dataset.

    The Data

    # Option 1: tidytuesdayR package 
    ## install.packages("tidytuesdayR")
    
    tuesdata <- tidytuesdayR::tt_load('2023-09-12')
    ## OR
    tuesdata <- tidytuesdayR::tt_load(2023, week = 37)
    
    all_countries <- tuesdata$all_countries
    country_regions <- tuesdata$country_regions
    global_human_day <- tuesdata$global_human_day
    global_economic_activity <- tuesdata$global_economic_activity
    
    # Option 2: Read directly from GitHub
    
    all_countries <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2023/2023-09-12/all_countries.csv')
    country_regions <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2023/2023-09-12/country_regions.csv')
    global_human_day <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2023/2023-09-12/global_human_day.csv')
    global_economic_activity <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2023/2023-09-12/global_economic_activity.csv')

    How to Participate

    • Explore the data, watching out for interesting relationships. We would like to emphasize that you should not draw conclusions about causation in the data. There are various moderating variables that affect all data, many of which might not have been captured in these datasets. As such, our suggestion is to use the data provided to practice your data tidying and plotting techniques, and to consider for yourself what nuances might underlie these relationships.
    • Create a visualization, a model, a shiny app, or some other piece of data-science-related output, using R or another programming language.
    • Share your output and the code used to generate it on social media with the #TidyTuesday hashtag.

    Data Dictionary

    all_countries.csv

    variable class description
    Category character M24 categories
    Subcategory character M24 subcategories
    country_iso3 character Country code in iso3
    region_code character Region code
    population double Population
    hoursPerDayCombined double Hours per day combined for the country
    uncertaintyCombined double Uncertainty combined. Uncertainty is in units variance.

    country_regions.csv

    variable class description
    region_code character Region code
    region_name character Region name
    country_name character Country name
    M49_code double M49 code
    country_iso2 character Country code in iso2
    country_iso3 character Country code in iso3
    alt_country_name character Alternative country name
    alt_country_name1 character Alternative country name 1
    alt_country_name2 character Alternative country name 2
    alt_country_name3 character Alternative country name 3
    alt_country_name4 character Alternative country name 4
    alt_country_name5 character Alternative country name 5
    alt_country_name6 character Alternative country name 6
    other_code1 character Other country code 1
    other_code2 character Other country code 2

    global_human_day.csv

    variable class description
    Subcategory character M24 subcategory
    hoursPerDay double Hours per day for all countries
    uncertainty double Uncertainty in units variance.

    global_economic_activity.csv

    variable class description
    Subcategory character M24 subcategory
    hoursPerDay double Hours per day for all countries.
    uncertainty double Uncertainty in units variance.

    Cleaning Script

    library(tidyverse)
    
    # Read in the data file all_countries.csv from https://zenodo.org/record/8040631
    
    all_countries <- read_csv("all_countries.csv")
    
    # Change variable name to be consistent between files
    
    colnames(all_countries)[3] <- "country_iso3"
    
    # Remove columns on data status
    
    all_countries = subset(all_countries, select = -c(dataStatus,dataStatusEconomic))
    
    # write out data
    readr::write_csv(
      all_countries, "all_countries.csv")