Spring 2020

Agenda

  • Introduction
    • Syllabus
    • Assignments
      • Homework
      • Labs
      • Data Project
      • Final exam
      • Meetup Presentation
    • The DATA606 R Package
    • Using R Markdown

Introduction

A little about me:

  • Principal Investigator for a Department of Education Grant (part of their FIPSE First in the World program) to develop a Diagnostic Assessment and Achievement of College Skills (www.DAACS.net) at Excelsior College
  • Research Data Associate at Cornell University
  • Authored over a dozen R packages including:
  • Specialize in propensity score methods. Three new methods/R packages developed include:

Also a Father…

Runner…

And photographer.

Syllabus

Syllabus and course materials are here: http://data606.net

The site is built using the Blogdown R package and hosted on Github. Each page of the site has a “Improve this page” link at the bottom right, use that to start a pull request on Github.

We will use Blackboard primary only for submitting assignments. Please submit:

  • A PDF or link to the built HTML (e.g. Rpubs, Github)

PDFs are preferred for the homework as there is some LaTeX formatting in the R markdown files. The tineytex R package helps with install LaTeX, but you can also install LaTeX using (MiKTeX (for Windows) and BasicTeX (for Mac) See this page for more information: https://spring2020.data606.net/course-overview/software/

Start End Topic
Monday, January 27, 2020 Sunday, February 02, 2020 Chapter 1 - Intro to Data
Monday, February 03, 2020 Sunday, February 09, 2020 Chapter 2 - Summarizing Data
Monday, February 10, 2020 Sunday, February 16, 2020 Chapter 3 - Probability
Monday, February 17, 2020 Sunday, March 01, 2020 Chapter 4 - Distributions
Monday, March 02, 2020 Sunday, March 08, 2020 Chapter 5 - Foundation for Inference
Monday, March 09, 2020 Sunday, March 15, 2020 Chapter 6 - Inference for Categorical Data
Monday, March 16, 2020 Sunday, March 22, 2020 Chapter 7 - Inference for Numerical Data
Monday, March 23, 2020 Sunday, April 19, 2020 Chapter 8 - Linear Regression
Wednesday, April 08, 2020 Thursday, April 16, 2020 Spring Recess
Monday, April 20, 2020 Sunday, May 03, 2020 Chapter 9 - Multiple and Logistic Regression
Monday, May 04, 2020 Wednesday, May 13, 2020 Intro to Bayesian Analysis
Thursday, May 14, 2020 Sunday, May 17, 2020 Final Exam

Assignments

  • Getting Acquainted (1%)
  • Homework (18%)
  • Labs (40%)
    • Labs are designed to introduce to you doing statistics with R.
    • Answer the questions in the main text as well as the “On Your Own” section.
  • Data Project (18)
    • This allows you to analyze a dataset of your choosing. Projects will be shared with the class. This provides an opportunity for everyone to see different approaches to analyzing different datasets.
    • Proposal is due March 29th (5%); Final project is due May 10th (15%).
  • Final exam (18%)
  • Meetup Presentation (5%)
    • Present one practice problem during our weekly meetups. Signup using the Google Spreadsheet.
    • Please select odd number questions only!

Communication

The DATA606 R Package

The package can be installed from Github using the devtools package.

devtools::install_github('jbryer/DATA606')

Download the Setup.R script here: https://github.com/jbryer/DATA606Spring2020/blob/master/R/Setup.R

Important Functions

  • library('DATA606') - Load the package
  • vignette(package='DATA606') - Lists vignettes in the DATA606 package
  • vignette('os4') - Loads a PDF of the OpenIntro Statistics book
  • data(package='DATA606') - Lists data available in the package
  • getLabs() - Returns a list of the available labs
  • viewLab('Lab1') - Opens Lab1 in the default web browser
  • startLab('Lab1') - Starts Lab1 (copies to getwd()), opens the Rmd file
  • shiny_demo() - Lists available Shiny apps

Using R Markdown

R Markdown files are provided for all the labs and homework.

  • You can download R markdown template files for the homework by right clicking and selecting “Save file as…” from the Homework page.
  • You can start a lab using the DATA606::startLab function.

However, creating new R Markdown files in RStudio can be done by clicking File > New File > R Markdown.

For more information about R Markdown, check out the RStudio page at https://rmarkdown.rstudio.com/