Biotech and Data Science
Professional Education

These courses provide skills that are necessary for data science roles within Biotechnology: computing, handling data, and statistics. The primary courses are taught in the R programming language, through the modern tidyverse lens, or python which provides the easiest onramp to powerful data science techniques, and is widely used in the industry. The courses assume no prior programming experience, and quickly provide training in hands-on and tangible skills. Further courses unlock more advanced data science skills and methodologies essential for fully-rounded fluency in data science.

Biotech and data science classes are not eligible for employee tuition reduction.

You may sign up for our email list to be notified when applications open.

Sign up for email list


Introduction to R for Data Analysis

This course was created for individuals with data processing experience who want to perform more advanced work, and get into programming with R and the tidyverse. No prior experience required! We’ll guide you step by step in this affordable introduction to R for Data Analysis. Go at your own pace, anytime, anywhere. No rigid schedules!You’re never alone—our instructors are here to help.

Coming soon


Advanced R for Data Analysis

This course is designed to take your R programming expertise to the next level, building on the foundations laid in the Introduction to R for Data Analysis course. Dive into advanced techniques and powerful tools that will transform your data workflows and analytical capabilities. In this course, you will learn how to write your own custom R functions, reduce redundancy in your code by learning iteration techniques with the purrr package, refactor column variables, reshape your data frames, and join multiple data frames together.

Coming soon


Data Cleaning with R

Data cleaning is an incredibly important part of the data science process. This course introduces a customizable data cleaning pipeline, focusing on biomedical data from electronic health records and health survey data. In this course, you will learn how to summarize the data collection process and data dictionaries, identify and address missing values using R, handle data quality issues (such as invalid and inconsistent values) using R, reshape your data into a “tidy” format using R, and create a custom, reproducible data cleaning function in R

Coming soon


Statistics for Data Analysis

In this course, you’ll learn to apply statistical methods to real data problems, transforming how you analyze and interpret data. Enhance your ability to make data-driven decisions by implementing classical and modern statistical techniques, including hypothesis testing, confidence intervals, and linear regression. Bridge the gap between theoretical and practical applications of statistics in R, gaining hands-on experience. Refine your statistical intuition.

Coming soon


Introduction to Python for Data Analysis

No prior coding experience required! We’ll guide you step by step in this affordable introduction to Python. Learn how to analyze real-world data using the powerful pandas library. Go at your own pace, anytime, anywhere. You’re never alone—our instructors are here to help.

Coming soon


Data Science Ethics

Data science drives innovation, but ethics are crucial to ensure fairness, transparency, and accountability. This course explores the ethical challenges around data collection, sharing, and analysis, helping you navigate real-world problems with integrity. Learn to understand ethical data collection and share personal data responsibly, including obtaining informed consent and safeguarding privacy. Evaluate automated decision-making systems' fairness and impact on diverse groups. Learn best practices for protecting sensitive data from unauthorized access while balancing the need for insights.Develop practical tools by creating checklists and frameworks to guide ethical decision-making in data collection, sharing, and algorithm development.

Coming soon


Machine Learning with Python

Making accurate predictions is a powerful skill when working with real-world data. This course teaches you how to frame research questions as prediction problems and apply foundational machine learning techniques. We'll explore a range of algorithms, including linear and logistic regression, decision trees, random forests, XGBoost, and simple neural networks. By the end of the course, you'll be equipped to build, evaluate, and interpret predictive models, empowering you to uncover patterns in data and make informed, data-driven decisions. Learn skills such as Introduction to Machine Learning, Least Squares for Continuous Response Prediction, Logistic Regression for Binary Responses, Decision Tree Algorithms, and Neural Networks.

Coming soon


Unsupervised Learning with Python

Unsupervised learning and data mining is about identifying the native structure in data through clustering, dimensionality reduction, and nearest neighbor search. Unlike supervised machine learning, unsupervised learning methods cannot rely on labels to aid in predictions. Nevertheless these methods are an essential part of a data scientist's took box. Learners will discover various touchstone methods within each category of technique, including both linear and non-linear methods for dimensionality reduction, and center-based (e.g., k-means) and density-based clusters. They will be able to identify pros and cons for each option to decide which is best for their scenario. Python will be used the programming language, and learners will gain familiarity with standard libraries that make applying these methods simple. It is recommended that students have some background in Python, such as completing "Introduction to Python for Data Analysis."

Coming soon


Spring 2026 Badges

What is a badge?
A badge is a digital symbol that verifies a learning achievement. It can be shared online—such as on LinkedIn or on résumés—to highlight specific competencies.

How can I earn a badge?
Learners can earn a badge by successfully completing a set of pre-determined courses. Upon completion of the whole set, the student is eligible for a badge to demonstrate their achievement.

  • Introduction to R for Data Analysis
  • Advanced R for Data Analysis

  • Data Cleaning with R
  • Statistics for Data Analysis

  • Introduction to Python for Data Analysis
  • Machine Learning with Python

  • Data Science Ethics