Skip to main content

Professional Certificate Series

Professional Certificate in Data Analysis for Life Sciences

This HarvardX professional certificate program gives learners the necessary skills and knowledge to analyze data in the life sciences.

    • 4 Courses
    • 4 Months
    • Earn Your Certificate
    What You'll Learn

    Technological advances have transformed fields that rely on data by providing a wealth of information ready to be analyzed.

    From working with single genes to comparing entire genomes, biomedical research groups around the world are producing more data than they can handle and the ability to interpret this information is a key skill for any practitioner. The skills necessary to work with these massive datasets are in high demand, and this series will help you learn those skills. Using the open-source R programming language, you’ll gain a nuanced understanding of the tools required to work with complex life sciences and genomics data. You’ll learn the mathematical concepts — and the data analytics techniques — that you need to drive data-driven research. From a strong foundation in statistics to specialized R programming skills, this series will lead you through the data analytics landscape step-by-step. Taught by Rafael Irizarry from the Harvard T.H. Chan School of Public Health, these courses will enable new discoveries and will help you improve individual and population health. If you’re working in the life sciences and want to learn how to analyze data, enroll now to take your research to the next level.

    Learn More

    4 Courses

    Beyond our premium learning paths you can still earn certificates

    Introduction to Linear Models and Matrix Algebra

    2-4 hours per week • Start today

    Learn more

    Statistics and R

    2-4 hours per week • Start today

    Learn more

    Statistical Inference and Modeling for High-throughput Experiments

    2-4 hours per week • Start today

    Learn more

    High-Dimensional Data Analysis

    2-4 hours per week • Start today

    Learn more

    Learn from the best in the industry

    Meet your instructors

    Michael Love

    Michael Love

    Assistant Professor, UNC Gillings School of Global Public Health

    Rafael Irizarry

    Rafael Irizarry

    Professor of Biostatistics, Harvard T.H. Chan School of Public Health

    After completing the series, learners will understand:

    Basic statistical concepts and R programming skills for analyzing data in the life sciences.

    The underlying math of linear models useful for data analysis in the life sciences.

    The techniques used to perform statistical inference on high-throughput and high-dimensional data.

    Several techniques widely used in the analysis of high-dimensional data.

    Industry Insights

    R is listed as a required skill in 64% of data science job postings and was Glassdoor’s Best Job in America in 2016 and 2017. (source: Glassdoor)

    Companies are leveraging the power of data analysis to drive innovation. Google data analysts use R to track trends in ad pricing and illuminate patterns in search data. Pfizer created customized packages for R so scientists can manipulate their own data.

    32% of full-time data scientists started learning machine learning or data science through a MOOC, while 27% were self-taught. (source: Kaggle, 2017)

    Data Scientists are few in number and high in demand. (source: TechRepublic)