Assignments

  • Project

    Final Project

    For the final project, you will be assigned into groups and conduct an exploratory data analysis of the U.S. Department of Education’s College Scorecard dataset.

  • Reading

    Week 15

    R for Data Science

    Chapter 23: Model basics

  • Reading

    Week 14

    Introductory Statistics with Randomization and Simulation

    Click here to download the textbook.

    Chapter 5: Introduction to linear regression

    • From the beginning through to the end of section 5.1.4

    • Section 5.4.1

    R for Data Science

    Chapter 22: Introduction to Model section

    Chapter 23: Model basics

  • Homework

    Homework 3

    For your third homework assignment, you will use statistical inference to answer a question about the National Survey of Family Growth, Cycle 6 dataset published by the National Center for Health Statistics.

  • Reading

    Week 13

    Introductory Statistics with Randomization and Simulation

    Click here to download the textbook.

    Chapter 2: Foundation for inference

    • From section 2.4 through to the end of section 2.5

    Chapter 4: Inference for numerical data

    • Section 4.5, skipping subsection 4.5.3
  • Reading

    Week 12

    Introductory Statistics with Randomization and Simulation

    Click here to download the textbook.

    Chapter 2: Foundation for inference

    • From the beginning up to the end of section 2.3
  • Reading

    Week 11

    Introduction to computational and data sciences supplemental book

    Chapter 4: Representing distributions

    R for Data Science

    Chapter 7: Exploratory data analysis

  • Homework

    Homework 2 (Group)

    Collaborate with your assigned group members to turn your individual submissions for Homework 2 into a data exploration report.

  • Module Exercise

    DataCamp exercises (Week 10)

    Instructions

    On Datacamp, watch the tutorial videos and complete the interactive coding challenges from the following lessons, which will let you practice and supplement this week’s content.

    Important!

    Your progress through these lessons is being tracked and completing them counts towards the Module exercises category of your grade.

    From Working with data in the Tidyverse (Due: Nov-05)

    • Tidy your data

    • Transform your data

  • Module Exercise

    DataCamp exercises (Week 9)

    Instructions

    On Datacamp, watch the tutorial videos and complete the interactive coding challenges from the following lessons, which will let you practice and supplement this week’s content.

    Important!

    Your progress through these lessons is being tracked and completing them counts towards the Module exercises category of your grade.

    From Working with data in the Tidyverse (Due: Oct-29)

    You will be completing the full course, Working with data in the Tidyverse, over the next two weeks. Chapter 3 of the course, “Tidy your data”, provides you with practice using the tidyr package. However, the lessons in chapter 3 assume you’re familiar with the content in chapters 1 and 2, so it’s best to complete those first.

    • Explore your data

    • Tame your data

    From Working with data in the Tidyverse (Due: Nov-05)

    These are the module exercises for week 10. If you complete the first two chapters and want to get started on chapters 3 and 4 now, you may do so.

    • Tidy your data

    • Transform your data

  • Reading

    Week 9

    R for Data Science

    From chapter 12: from the beginning through to the end of section 12.5

  • Homework

    Homework 2

    For your second homework assignment, you will explore a dataset about the passengers on the Titanic, the British passenger liner that crashed into an iceberg during its maiden voyage and sank early in the morning on April 16, 1912.

  • Module Exercise

    DataCamp exercises (Week 8)

    Instructions

    On Datacamp, watch the tutorial videos and complete the interactive coding challenges from the following lessons, which will let you practice and supplement this week’s content.

    Important!

    Your progress through these lessons is being tracked and completing them counts towards the Module exercises category of your grade.

    From Introduction to the Tidyverse

    • Grouping and summarizing
  • Reading

    Week 8

    R for Data Science

    From chapter 5: sections 5.5 through 5.7

  • Homework

    Homework 1 (Group)

    Collaborate with your assigned group members to turn your individual submissions for Homework 1 into a data exploration report.

  • Module Exercise

    DataCamp exercises (Week 6)

    Instructions

    On Datacamp, watch the tutorial videos and complete the interactive coding challenges from the following lessons, which will let you practice and supplement this week’s content.

    Important!

    Your progress through these lessons is being tracked and completing them counts towards the Module exercises category of your grade.

    From Introduction to the Tidyverse

    • Data wrangling
  • Reading

    Week 6

    R for Data Science

    All of chapter 4 (short)

    From chapter 5: sections 5.1 through 5.4

  • Homework

    Homework 1

    Your first major assignment is a set of exercises based around a single dataset called rail_trail, which will provide you with practice in creating visualizations using R and ggplot2.

  • Module Exercise

    DataCamp exercises (Week 5)

    Instructions

    On Datacamp, watch the tutorial videos and complete the interactive coding challenges from the following lessons, which will let you practice and supplement this week’s content.

    Important!

    Your progress through these lessons is being tracked and completing them counts towards the Module exercises category of your grade.

    From Data Visualization with ggplot2 (Part 1)

    • Aesthetics

    • Geometries

  • Reading

    Week 5

    R for Data Science

    From chapter 3: sections 3.6 through 3.10

  • Module Exercise

    DataCamp exercises (Week 4)

    Instructions

    On Datacamp, watch the tutorial videos and complete the interactive coding challenges from the following lessons, which will let you practice and supplement this week’s content.

    Important!

    Your progress through these lessons is being tracked and completing them counts towards the Module exercises category of your grade.

    From Data Visualization with ggplot2 (Part 1)

    • Introduction
  • Mini-Assignment

    R Markdown mini-assignment

    Mini-assignment on editing R Markdown files and saving to GitHub.

  • Mini-Assignment

    Visualization mini-assignment

    Mini-assignment to practice using RStudio to run code blocks in RMarkdown files and to create visualizations using ggplot2.

  • Reading

    Week 4

  • Module Exercise

    DataCamp exercises (Week 3)

    Instructions

    You should have received an invitation in your Mason email address to register and join DataCamp. If you have not done so already, complete the registration so that you can join the Introduction to Computational and Data Sciences class there. You are to watch the tutorial videos and complete the interactive coding challenges from the following lessons, which will let you practice and supplement this week’s content.

    Important!

    Your progress through these lessons is being tracked and completing them counts towards the Module exercises category of your grade.

    From Introduction to R

    • Intro to basics

    • Vectors

    From Reporting with R Markdown

    • Authoring R Markdown reports

    • Embedding Code

    From Working with the RStudio IDE Part 1

    • Orientation

    From Working with the RStudio IDE Part 2

    • Version Control
  • Reading

    Week 3

  • Reading

    Week 2

  • Module Exercise

    Can Twitter predict election results?

    A module exercise about a data science study that used Twitter data to predict election outcomes.

  • Reading

    Week 1

    R for Data Science

    All of chapter 1

    Introductory Statistics with Randomization and Simulation

    All of Chapter 1, except skip sections 1.3.5, all of 1.4, all of 1.5, skip 1.6.8.