Skip to content
The Kids Research Institute Australia logo
Donate

Discover . Prevent . Cure .

A Journey from Wild to Textbook Data to Reproducibly Refresh the Wages Data from the National Longitudinal Survey of Youth Database

Textbook data is essential for teaching statistics and data science methods because it is clean, allowing the instructor to focus on methodology. Ideally textbook datasets are refreshed regularly, especially when they are subsets taken from an ongoing data collection.

Citation:
Amaliah D, Cook D, Tanaka E, Hyde K, Tierney N. A Journey from Wild to Textbook Data to Reproducibly Refresh the Wages Data from the National Longitudinal Survey of Youth Database. J Stat Data Sci Educ. 2022;30(3):289-303

Keywords:
Data cleaning; Data tidying; Initial data analysis; Longitudinal data; NLSY79; Reproducible workflow

Abstract:
Textbook data is essential for teaching statistics and data science methods because it is clean, allowing the instructor to focus on methodology. Ideally textbook datasets are refreshed regularly, especially when they are subsets taken from an ongoing data collection.