Data Science: R Basics

Build a foundation in R and learn how to wrangle, analyze, and visualize data.

In this online course taught by Harvard Professor Rafael Irizarry, learn how to Build a foundation in R and learn how to wrangle, analyze, and visualize data.

Featuring faculty from:
Self-Paced
Length
8 weeks
1-2 hours a week
Certificate Price
$219
Program Dates
Self-Paced
Length
8 weeks
1-2 hours a week
Certificate Price
$219
Program Dates
Start Data Science: R Basics Today

What You'll Learn

The first in our Professional Certificate Program in Data Science, this course will introduce you to the basics of R programming. You can better retain R when you learn it to solve a specific problem, so you'll use a real-world dataset about crime in the United States. You will learn the R skills needed to answer essential questions about differences in crime across the different states.

We'll cover R's functions and data types, then tackle how to operate on vectors and when to use advanced functions like sorting. You'll learn how to apply general programming features like "if-else," and "for loop" commands, and how to wrangle, analyze and visualize data.

Rather than covering every R skill you might need, you'll build a strong foundation to prepare you for the more in-depth courses later in the series, where we cover concepts like probability, inference, regression, and machine learning. We help you develop a skill set that includes R programming, data wrangling with dplyr, data visualization with ggplot2, file organization with UNIX/Linux, version control with git and GitHub, and reproducible document preparation with RStudio.

The demand for skilled data science practitioners is rapidly growing, and this series prepares you to tackle real-world data analysis challenges.

The course will be delivered via edX and connect learners around the world. By the end of the course, participants will understand the following concepts:

  • Basic R syntax
  • Foundational R programming concepts such as data types, vectors arithmetic, and indexing
  • How to perform operations in R including sorting, data wrangling using dplyr, and making plots

Your Instructors

Image
Rafael Irizarry

Rafael Irizarry

Professor of Biostatistics at Harvard University
Read full bio.

Ways to take this course

When you enroll in this course, you will have the option of pursuing a Verified Certificate or Auditing the Course.

A Verified Certificate costs $219 and provides unlimited access to full course materials, activities, tests, and forums. At the end of the course, learners who earn a passing grade can receive a certificate. 

Alternatively, learners can Audit the course for free and have access to select course material, activities, tests, and forums. Please note that this track does not offer a certificate for learners who earn a passing grade.

Read More

Introduction to Linear Models and Matrix Algebra

Perform matrix operations

Learn to use R programming to apply linear models to analyze data in life sciences.

Read More

Data Science: Inference and Modeling

Key concepts through a motivating case study

Learn inference and modeling: two of the most widely used statistical tools in data analysis.

Read More

Ciencia de Datos: Fundamentos de R

Sentar las bases de conocimiento en R y aprender a discutir, analizar y visualizar datos.

Sentar las bases de conocimiento en R y aprender a discutir, analizar y visualizar datos.