Data cleaning is an incredibly important part of the data science process. This course introduces a customizable data cleaning pipeline, focusing on biomedical data from electronic health records and health survey data. In this course, you will learn how to summarize the data collection process and data dictionaries, identify and address missing values using R, handle data quality issues (such as invalid and inconsistent values) using R, reshape your data into a “tidy†format using R, and create a custom, reproducible data cleaning function in R.
This noncredit class meets with a regular University of Utah credit course. Students can purchase any assigned hardcopy textbooks through our Campus Bookstore.
Questions? Call Academic Programs at 801-585-9963 or use our online form.
| Date(s) | Day | Time | Location |
|---|---|---|---|
| 10/19/26 - 12/10/26 | Online |
Instructor: TINGYING HE
Questions? Call Academic Programs at 801-585-9963 or use our online form.