TidyTuesday
    • About TidyTuesday
    • Datasets
      • 2025
      • 2024
      • 2023
      • 2022
      • 2021
      • 2020
      • 2019
      • 2018
    • Useful links

    On this page

    • Kenya Census
      • Get the data here
      • Data Dictionary
    • gender.csv
    • crops.csv
    • households.csv
    • Data Catalogue
      • Cleaning Script

    Kenya Census

    The data this week comes from rKenyaCensus courtesy of Shelmith Kariuki. Shelmith wrote about these datasets on her blog.

    rKenyaCensus is an R package that contains the 2019 Kenya Population and Housing Census results. The results were released by the Kenya National Bureau of Statistics in February 2020, and published in four different pdf files (Volume 1 - Volume 4).

    The 2019 Kenya Population and Housing Census was the eighth to be conducted in Kenya since 1948 and was conducted from the night of 24th/25th to 31st August 2019. Kenya leveraged on technology to capture data during cartographic mapping, enumeration and data transmission, making the 2019 Census the first paperless census to be conducted in Kenya

    Additional details about Kenya can be found on Wikipedia.

    Kenya, officially the Republic of Kenya (Swahili: Jamhuri ya Kenya), is a country in Eastern Africa. At 580,367 square kilometres (224,081 sq mi), Kenya is the world’s 48th largest country by total area. With a population of more than 47.6 million people in the 2019 census, Kenya is the 29th most populous country. Kenya’s capital and largest city is Nairobi, while its oldest city and first capital is the coastal city of Mombasa.

    To access ALL the data, install the package from Github:

    (if needed, install.packages("remotes"))

    remotes::install_github("Shelmith-Kariuki/rKenyaCensus")

    She’s also put together a Shiny app to download various datasets!

    I’ve taken 3 indicators below just as a teaser, but go ahead and try the package for ALL the data!

    Get the data here

    # Get the Data
    
    # Read in with tidytuesdayR package 
    # Install from CRAN via: install.packages("tidytuesdayR")
    # This loads the readme and all the datasets for the week of interest
    
    # Either ISO-8601 date or year/week works!
    
    tuesdata <- tidytuesdayR::tt_load('2021-01-19')
    tuesdata <- tidytuesdayR::tt_load(2021, week = 4)
    
    gender <- tuesdata$gender
    
    # Or read in the data manually
    
    gender <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2021/2021-01-19/gender.csv')
    crops <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2021/2021-01-19/crops.csv')
    households <- readr::read_csv('https://raw.githubusercontent.com/rfordatascience/tidytuesday/main/data/2021/2021-01-19/households.csv')

    Data Dictionary

    gender.csv

    variable class description
    County character County Name
    Male double Population of Male identifying
    Female double Population of Female identifying
    Intersex double Population of Interesex identifying
    Total double Total population

    crops.csv

    Distribution of Households Growing Permanent Crops by Type and County.

    variable class description
    SubCounty character SubCounty name
    Farming double Population growing farming crops
    Tea double Population growing farming tea
    Coffee double Population growing farming coffee
    Avocado double Population growing farming avocado
    Citrus double Population growing farming citrus
    Mango double Population growing farming Mango
    Coconut double Population growing farming Coconut
    Macadamia double Population growing farming Macadamia
    Cashew Nut double Population growing farming cashew nut
    Khat (Miraa) double Population growing farming Khat (Miraa)

    households.csv

    variable class description
    County character County Name
    Population double Population
    NumberOfHouseholds double Number of households
    AverageHouseholdSize double Average houshold size

    Data Catalogue

    Full catalogue from the package about all the available datasets!

    Dataset Volume Table.No.in.PDFs Dataset.Description
    DataCatalogue Shows the table number for each of the datasets
    CountyGPS Shows the County GPS centroids
    KenyaCounties_SHP Shapefiles of Kenya County boundaries
    V1_T2.1 V1 Table 2. 1 Census Indicators at a Glance, 2019
    V1_T2.2 V1 Table 2. 2 Distribution of Population by Sex and County
    V1_T2.3 V1 Table 2. 3 Distribution of Population, Number of Households and Average
    V1_T2.4 V1 Table 2. 4 Distribution of Population, Land Area and Population Density by County
    V1_T2.5 V1 Table 2. 5 Distribution of Population by Sex and Sub-County
    V1_T2.6 V1 Table 2. 6 Distribution of Population, Number of Households and Average Household Size by Sub- County
    V1_T2.7 V1 Table 2. 7 Distribution of Population by Land Area and Population Density by Sub-County
    V2_T1.1 V2 Table 1.1 Summary of Census Counts in Kenya
    V2_T1.2 V2 Table 1.2 List of Counties and Sub-Counties
    V2_T2.1 V2 Table 2.1 Sub-locations with no People on the Census Night by Status/Reason
    V2_T2.2 V2 Table 2.2 Distribution of Population by Sex, Number of Households, Land Area, Population Density and County
    V2_T2.2a V2 Table 2.2a Distribution of Rural Population by Sex, Number of Households, Land Area, Population Density and County
    V2_T2.2b V2 Table 2.2b Distribution of Urban Population by Sex, Number of Households, Land Area, Population Density and County
    V2_T2.3 V2 Table 2.3 Distribution of Population by Sex, Number of Households, Land Area, Population Density and Sub County
    V2_T2.5 V2 Table 2.5 Distribution of Population by Urban Centres, Sex and County
    V3_T1.1 V3 Table 1.1 Summary of Census Counts in Kenya
    V3_T1.2 V3 Table 1.2 List of Counties and Sub-Counties
    V3_T2.1 V3 Table 2.1 Sub-locations with no People on the Census Night by Status/Reason
    V3_T2.2 V3 Table 2.2 Distribution of Population by Age and Sex, Kenya
    V3_T2.2a V3 Table 2.2a Distribution of Rural Population by Age and Sex, Kenya
    V3_T2.2b V3 Table 2.2b Distribution of Urban Population by Age and Sex, Kenya
    V3_T2.3 V3 Table 2.3 Distribution of Population by Age, Sex, County and Sub- County
    V3_T2.4a V3 Table 2.4a Distribution of Rural Population by Age, Sex and County
    V3_T2.4b V3 Table 2.4b Distribution of Urban Population by Age, Sex and County
    V4_T1.1 V4 Table 1. 1 Summary of Census Counts in Kenya.
    V4_T1.9 V4 Table 1.9 List of Counties and Sub-Counties
    V4_T2.2 V4 Table2.2 Distribution of Population Aged 3 Years and Above by School Attendance Status, Area of Residence, Sex, County and SubCounty.
    V4_T2.3 V4 Table 2.3 Distribution of Population Aged 3 Years and Above Currently Attending School/ Learning Institution by Education Level, Area of Residence, Sex, County and Sub-County
    V4_T2.4 V4 Table 2.4 Distribution of Population Aged 3 Years and Above by Highest Level of Education Reached, Area of Residence, Sex, County and Sub-County..
    V4_T2.5 V4 Table 2.5 Distribution of Population Aged 3 Years and Above by Highest Level of Education Completed, Area of Residence, Sex, County and Sub-County..
    V4_T2.6a V4 Table 2.6a Distribution of Population Aged 3 Years and Above by School Attendance Status, Sex and Special Age Groups
    V4_T2.6b V4 Table 2.6b Distribution of Population Aged 3 Years and Above by School Attendance Status, Sex, Special Age Groups and County..
    V4_T2.7 V4 Table 2.7 Distribution of Population Aged 15 years and Above by Sex and Main Training Acquired and Qualified for
    V4_T2.8a V4 Table 2.8a Distribution of Population Aged 5 Years and above by Activity Status, Sex, County and Sub-County
    V4_T2.8b V4 Table 2.8b Distribution of Urban Population Aged 5 Years and above by Activity Status, Sex, County and Sub-County…..
    V4_T2.8c V4 Table 2.8c Distribution of Rural Population Aged 5 Years and above by Activity Status, Sex, County and Sub-County..
    V4_T2.9a V4 Table 2.9a Distribution of Population Aged 5 years and above by Activity Status, Broad Age Groups and County..
    V4_T2.9b V4 Table 2.9b Distribution of Rural Population Aged 5 years and above by Activity Status, Broad Age Groups and County…….
    V4_T2.9c V4 Table 2.9c Distribution of Urban Population Aged 5 years and above by Activity Status, Broad Age Groups and County…….
    V4_T2.10 V4 Table 2.10 Distribution of Households and Tenure Status of Main Dwelling Unit by Area of Residence, County and Sub-County
    V4_T2.11a V4 Table 2.11a Distribution of Households Owning the Main Dwelling Unit by Mode of Acquisition, Area of Residence, County and Sub-County
    V4_T2.11b V4 Table 2.11b Distribution of Households Renting/Provided with their Main Tenure Status of Main Dwelling Unit by Provider, Area of Residence, County and Sub- County..
    V4_T2.12 V4 Table 2.12 Percentage Distribution of Conventional Households by Dominant Roofing Material of Main Dwelling Unit, Area of Residence, County and Sub-County…..
    V4_T2.13 V4 Table 2.13 Percentage Distribution of Conventional Households by Dominant Wall Material of Main Dwelling Unit, Area of Residence, County and Sub-County…….
    V4_T2.14 V4 Table 2.14 Percentage Distribution of Conventional Households by Dominant Floor Material of the Main Dwelling Unit, Area of Residence, County and Sub County……
    V4_T2.15 V4 Table 2.15 Percentage Distribution of Conventional Households by Main Source of Drinking Water, Area of Residence, County and Sub-County..
    V4_T2.16 V4 Table 2.16 Percentage Distribution of Conventional Households by Main Mode of Human Waste Disposal, Area of Residence, County and Sub-County..
    V4_T2.17 V4 Table 2.17 Percentage Distribution of Conventional Households by Main Mode of Solid Waste Disposal, Area of Residence, County and Sub-County..
    V4_T2.18 V4 Table 2.18 Percentage Distribution of Conventional Households by Main Type of Cooking Fuel, Area of Residence, County and Sub-County….
    V4_T2.19 V4 Table 2.19 Percentage Distribution of Conventional Households by Main Type of Lighting Fuel, Area of Residence, County and Sub-County….
    V4_T2.20 V4 Table 2.20 Distribution of households practicing Agriculture, Fishing and Irrigation by County and Sub County.
    V4_T2.21 V4 Table 2.21 Distribution of Households Growing Permanent Crops by Type and County.
    V4_T2.22 V4 Table 2.22 Distribution of Households Growing Other Crops by Type, County and Sub County
    V4_T2.23 V4 Table 2.23 Distribution of Households Rearing Livestock and Fish by County and Sub County.
    V4_T2.24 V4 Table 2.24 Distribution of Livestock population by type, Fish Ponds and Fish Cages by County and Sub County..
    V4_T2.25 V4 Table 2.25 Distribution of area (hectares) of Agricultural land and Farming Households by purpose of production, County and Sub-County..
    V4_T2.26 V4 Table 2.26 Distribution of Population aged 5 years and above by Disability Status, Sex1, Area of Residence, County and Sub-County….
    V4_T2.27 V4 Table 2.27 Distribution of Persons with Disability by Type of Disability, Sex1, Area of Residence, County and Sub County…
    V4_T2.28 V4 Table 2.28 Distribution of Persons with Albinism by Sex1, Area of Residence, County and Sub County..
    V4_T2.29 V4 Table 2.29 Population of Street Persons/Outdoor Sleepers by Sex1, Area of Residence and County.
    V4_T2.30 V4 Table 2.30 Distribution of Population by Religious Affiliation and County
    V4_T2.31 V4 Table 2.31 Distribution of Population by Ethnicity/Nationality
    V4_T2.32 V4 Table 2.32 Distribution of Population Age 3 years and Above Owning a Mobile Phone by Area of Residence, Sex, County and Sub County
    V4_T2.33 V4 Table 2.33 Distribution of Population Age 3 Years and Above Using Internet and Computer/Laptop/Tablet by Area of Residence, Sex, County and Sub-County…
    V4_T2.34 V4 Table 2.34 Distribution of Population age 15 years and above who Searched and Bought Goods and Services Online by Area of Residence, Sex, County and Sub-County.
    V4_T2.35 V4 Table 2.35 Distribution of Population Age 3 years and Above who owned and used Selected ICT Equipment and Service by Age, Area of Residence and County..
    V4_T2.36 V4 Table 2.36 Percentage Distribution of Conventional Households by Ownership of Selected Household Assets by Area of Residence, County and Sub County…
    V4_T2.37 V4 Table 2.37 Births in the Last 12 months* by place of Occurrence and County…
    V4_T2.38 V4 Table 2.38 Births in the Last 5 Years* by place of Occurrence and County
    V4_T2.39 V4 Table 2.39 Notified Births in the Last 12 months by County
    V4_T2.40 V4 Table 2.40 Notified Births in the Last 5 Years by County

    Cleaning Script

    No cleaning script today, enjoy the data!

    This is how I got the three datasets for today.

    library(rKenyaCensus)
    library(tidyverse)
    
    # create a markdown table for the readme
    rKenyaCensus::DataCatalogue %>% 
      knitr::kable()
    
    # grab 3 tables of interest
    crops <- rKenyaCensus::V4_T2.21
    gender <- rKenyaCensus::V1_T2.2
    households <- rKenyaCensus::V1_T2.3
    
    # write them out
    households %>% 
      write_csv("2021/2021-01-19/households.csv")
    
    gender %>% 
      write_csv("2021/2021-01-19/gender.csv")
    
    crops %>% 
      write_csv("2021/2021-01-19/crops.csv")