STAT 1230A
DataScience Across Disciplines
Data Science Across Disciplines
In this course, we will gain exposure to the entire data science pipeline—obtaining and cleaning large and messy data sets, exploring these data and creating engaging visualizations, and communicating insights from the data in a meaningful manner. During morning sessions, we will learn the tools and techniques required to explore new and exciting data sets. During afternoon sessions, students will work in small groups with one of several faculty members on domain-specific research projects in Geography, Political Science, Restorative Justice, Healthcare, or. This course will use the R programming language. No prior experience with R is necessary.
MATH/STAT 1230: Students will explore pediatric healthcare data to better understand the risks correlated with various childhood illnesses through an emphasis on the intuition behind statistical and machine learning techniques. We will practice making informed decisions from noisy data and the steps to go from messy data to a final report. Students will become proficient in R and gain an understanding of various statistical techniques.
GEOG 1230: In this section, students will use data science tools to explore the ways migration systems in the United States changed during the COVID-19 pandemic. We will draw on data collected from mobile phones recording each phone’s monthly place of residence at the census tract level. The dataset includes monthly observations from January 2019 through December 2021 allowing the analysis to compare migration systems pre-pandemic with those during the pandemic.
INTD 1230A: Data is a powerful tool for improving health outcomes by making programmatic choices to support justice. In this afternoon section of Data Across the Disciplines, students will be working with Addison County Restorative Justice (ACRJ) on understanding patterns in the occurrence of driving under the influence. ACRJ has over 1,000 cases and would like to better understand their data and come up with ways to access information. We will explore how identity, geography, and support impact outcomes from DUI cases. Using statistical analysis and data visualizations, along with learning about ethical data practices, we will report our findings.
INTD 1230B: Let’s dive into the minutes and reports of local towns to develop an accessible news and history resource. Could this be a tool for small newspapers to track local news more easily? Can we map this fresh data for a new look across geographies? Do you want to help volunteer town officials make decisions and better wrangle with their town’s history and data?
In this course we will develop a focused database of documents produced by several municipal boards and commissions. We will engage in conversation with local officials, researchers, and journalists. This course aims to introduce students to making data from real world documents and the people that make them to generate useful information that is often open but frequently difficult to sift through.
PSCI 1230: How do candidates for U.S. national office raise money? From whom do they raise it? In this section we will explore these questions using Federal Election Commission data on individual campaign contributions to federal candidates. Our analysis using R will help us identify geographic patterns in the data, as well as variations in funds raised across types of candidates. We will discuss what implications these patterns may have for the health and functioning of democracy in the U.S.
- Schedule
- 10:00am-11:30am on Monday, Tuesday, Wednesday, Thursday at MBH 104 (Jan 4, 2024 to Feb 1, 2024)
2:00pm-3:40pm on Monday, Tuesday, Wednesday at WNS 011 (Jan 4, 2024 to Feb 1, 2024) - Location
- McCardell Bicentennial Hall 104
- Instructors
-
-
Czekanski, Michael
mczekanski@middlebury.edu -
Lyford, Alex
alyford@middlebury.edu
-