TidyTuesday
    • About TidyTuesday
    • Datasets
      • 2025
      • 2024
      • 2023
      • 2022
      • 2021
      • 2020
      • 2019
      • 2018
    • Useful links

    On this page

    • Lisa’s Vegetable Garden Data
      • The Data
      • How to Participate
        • Data Dictionary
    • planting_2020.csv
    • planting_2021.csv
    • harvest_2020.csv
    • harvest_2021.csv
    • spending_2020.csv
    • spending_2021.csv
      • Cleaning Script

    Lisa’s Vegetable Garden Data

    We’re exploring Lisa Lendway’s vegetable garden from summer 2020 and summer 2021, from her {gardenR} package. Thank you to Lisa for suggesting this dataset back in 2021!

    The gardenR package contains data collected by Lisa Lendway from her vegetable garden, starting in the summer of 2020. Data from the summer of 2021 was added 2022-01-29 (finally!). The data were used in her Introduction to Data Science course at Macalester College to introduce many concepts. For examples, see the tutorials for the course.

    Lisa also has a YouTube video with a visual tour of the garden.

    What changed between 2020 and 2021?

    The Data

    # Option 1: tidytuesdayR package 
    ## install.packages("tidytuesdayR")
    
    tuesdata <- tidytuesdayR::tt_load('2024-05-28')
    ## OR
    tuesdata <- tidytuesdayR::tt_load(2024, week = 22)
    
    harvest_2020 <- tuesdata$harvest_2020
    harvest_2021 <- tuesdata$harvest_2021
    planting_2020 <- tuesdata$planting_2020
    planting_2021 <- tuesdata$planting_2021
    spending_2020 <- tuesdata$spending_2020
    spending_2021 <- tuesdata$spending_2021
    
    
    # Option 2: Read directly from GitHub
    
    harvest_2020 <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2024/2024-05-28/harvest_2020.csv')
    harvest_2021 <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2024/2024-05-28/harvest_2021.csv')
    planting_2020 <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2024/2024-05-28/planting_2020.csv')
    planting_2021 <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2024/2024-05-28/planting_2021.csv')
    spending_2020 <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2024/2024-05-28/spending_2020.csv')
    spending_2021 <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2024/2024-05-28/spending_2021.csv')

    How to Participate

    • Explore the data, watching out for interesting relationships. We would like to emphasize that you should not draw conclusions about causation in the data. There are various moderating variables that affect all data, many of which might not have been captured in these datasets. As such, our suggestion is to use the data provided to practice your data tidying and plotting techniques, and to consider for yourself what nuances might underlie these relationships.
    • Create a visualization, a model, a shiny app, or some other piece of data-science-related output, using R or another programming language.
    • Share your output and the code used to generate it on social media with the #TidyTuesday hashtag.

    Data Dictionary

    planting_2020.csv

    variable class description
    plot character label of plot where vegetables were planted - see garden_coords dataset (in the package) for more info
    vegetable character type of vegetable planted
    variety character variety of vegetable planted, usually according to the information on the seed package, but sometimes inferred by Lisa and sometimes unknown or reseeded from last year’s plants
    number_seeds_planted numeric the exact number or a guess at how many seeds/plants were planted
    date Date date the vegetable was planted
    number_seeds_exact logical if number_seeds_planted is exact, this is TRUE; if it was a guess, then this is FALSE
    notes character mostly missing but intially created just in case

    planting_2021.csv

    variable class description
    date Date date the vegetable was planted
    vegetable character type of vegetable planted
    variety character variety of vegetable planted, usually according to the information on the seed package, but sometimes inferred by Lisa and sometimes unknown or reseeded from last year’s plants
    plot character label of plot where vegetables were planted - see garden_coords dataset (in the package) for more info
    number_seeds_planted numeric the exact number or a guess at how many seeds/plants were planted
    number_seeds_exact logical if number_seeds_planted is exact, this is TRUE; if it was a guess, then this is FALSE
    notes character mostly missing but intially created just in case

    harvest_2020.csv

    variable class description
    vegetable character type of vegetable planted
    variety character variety of vegetable planted, usually according to the information on the seed package, but sometimes inferred by Lisa and sometimes unknown or reseeded from last year’s plants
    date Date date the vegetable was harvested
    weight numeric weight harvested in grams
    units character all “grams” - this variable was originally created in case larger vegetables were weighed in other units but there was no need

    harvest_2021.csv

    variable class description
    vegetable character type of vegetable planted
    variety character variety of vegetable planted, usually according to the information on the seed package, but sometimes inferred by Lisa and sometimes unknown or reseeded from last year’s plants
    date Date date the vegetable was harvested
    weight numeric weight harvested in grams
    units character all “grams” - this variable was originally created in case larger vegetables were weighed in other units but there was no need

    spending_2020.csv

    variable class description
    vegetable character type of vegetable planted
    variety character variety of vegetable planted, usually according to the information on the seed package, but sometimes inferred by Lisa and sometimes unknown or reseeded from last year’s plants
    brand character brand of seed or plant
    eggplant_item_number numeric item number of item at Eggplant, the store where most of the seeds were purchased
    price numeric raw price - no taxes
    price_with_tax numeric price with taxes

    spending_2021.csv

    variable class description
    vegetable character type of vegetable planted
    variety character variety of vegetable planted, usually according to the information on the seed package, but sometimes inferred by Lisa and sometimes unknown or reseeded from last year’s plants
    brand character brand of seed or plant
    price numeric raw price - no taxes
    price_with_tax numeric price with taxes

    Cleaning Script

    No data cleaning. Datasets are from the {gardenR} package.