Introduction to Data Science

In this course students will gain exposure to the entire data science pipeline: forming a statistical question, collecting and cleaning data sets, performing exploratory data analyses, identifying appropriate statistical techniques, and communicating the results, all the while leaning heavily on open source computational tools, in particular the R statistical software language. We will focus on analyzing real, messy, and large data sets, requiring the use of advanced data manipulation/wrangling and data visualization packages. Students will be required to bring their own laptops as many lectures will involve in-class computational activities. (MATH 0116; or ECON 0210 or PSYC 0201 and experience with R) 3 hrs lect./disc.

Schedule
11:15am-12:05pm on Monday, Wednesday, Friday (Feb 12, 2018 to May 14, 2018)
Location
Warner Hall 506
Instructors