Meaghan Winder
Visiting Asst. Prof., Mathematics & Statistics

- Office
- Warner 215
- mwinder@middlebury.edu
- Office Hours
- Monday 11:00 AM-12:00 PM and 2:00-3:00 PM, Tuesday and Thursday 3:30-4:30 PM, Wednesday 3:00-4:00 PM, or by appointment
Courses Taught
STAT 0118
Introduction to Data Science
Course Description
Introduction to Data Science (formerly MATH 0118)
In this course students will gain exposure to the entire data science pipeline: forming a statistical question, collecting and cleaning data sets, performing exploratory data analyses, identifying appropriate statistical techniques, and communicating the results, all the while leaning heavily on open source computational tools, in particular the R statistical software language. We will focus on analyzing real, messy, and large data sets, requiring the use of advanced data manipulation/wrangling and data visualization packages. Students will be required to bring alaptop (owned or college-loaned) to class as many lectures will involve in-class computational activities. (formerly MATH 0216) 3 hrs lect./disc. (Not open to students who have taken BIOL 1230, ECON 1230, ENVS 1230, FMMC 1230, HARC 1230, JAPN 1230, LNGT 1230, NSCI 1230, MATH 1230, SOCI 1230, LNGT 1230, PSCI 1230, WRPR 1230, or GEOG 1230.)
Terms Taught
Requirements
STAT 0201
Current
Upcoming
Intro to Stat and Data Sci
Course Description
Introduction to Statistical and Data Sciences
An introduction to statistical methods and the examination of data sets for students with a background in calculus. Topics include descriptive statistics, elementary distributions for data, hypothesis tests, confidence intervals, and regression. Students develop skills in data cleaning, wrangling, visualization, and model fitting using the Statistical Software R. Emphasis will be placed on reproducibility. (MATH 0121 Or APAB[4] Or APBC[3] Or IBAN[6] Or M1DP[40] Or (M1DP[30] And M2DP[30]))
(Not open to students who have taken MATH 0116, MATH 0118, ECON 0111 (formerly ECON 0210), PSYC 0201, STAT 0116, STAT 0118, BIOL 1230, ECON 1230, ENVS 1230, FMMC 1230, HARC 1230, JAPN 1230, LNGT 1230, NSCI 1230, MATH 1230, SOCI 1230, LNGT 1230, PSCI 1230, WRPR 1230, or GEOG 1230.)
Terms Taught
Requirements
STAT 0211
Upcoming
Regression
Course Description
Regression Theory and Applications (formerly MATH 0211)
Regression is a popular statistical technique for making predictions and for modeling relationships between variables. In this course we will discuss the theory and practical applications of linear, log-linear, and logistic regression models. Topics include least squares estimation, coding for categorical predictors, analysis of variance, and model diagnostics. We will apply these concepts to real datasets using R, a statistical programming language. (MATH 0200 Concurr And (MATH 0116 Or STAT 0116) Or (MATH 0201 Or STAT 0201) Or PSYC 0201 Or ECON 0111) (Not open to students who have taken ECON 0211) 3 hrs lect./disc.
Terms Taught
Requirements
STAT 0311
Current
Statistical Inference
Course Description
Statistical Inference
An introduction to the mathematical methods and applications of statistical inference using both classical methods and modern resampling techniques. Topics will include: permutation tests, parametric and nonparametric problems, estimation, efficiency and the Neyman-Pearsons lemma. Classical tests within the normal theory such as F-test, t-test, and chi-square test will also be considered. Methods of linear least squares are used for the study of analysis of variance and regression. There will be some emphasis on applications to other disciplines. This course is taught using R. (MATH 0200 Concurr And MATH 0310) 3 hrs. lect./disc.
Terms Taught
Requirements
STAT 1230
Upcoming
DataScience Across Disciplines
Course Description
Data Science Across Disciplines
In this course, we will gain exposure to the entire data science pipeline—obtaining and cleaning large and messy data sets, exploring these data and creating engaging visualizations, and communicating insights from the data in a meaningful manner. During morning sessions, we will learn the tools and techniques required to explore new and exciting data sets. During afternoon sessions, students will work in small groups with one of several faculty members on domain-specific research projects in Anthropology, Biology, Classics, Political Science, and Statistics. This course will use the R programming language. No prior experience with programming is necessary.
ANTH/LNGT 1230: In this section we will explore indigenous political voice in unexpected places. The data we will analyze will consist of bilingual Apache-English and Maidu-English stories, songs and speeches originally recorded by anthropological linguists in the early twentieth century as examples of traditional culture. Students will use R to search this corpora for how indigenous contributors were also making claims on the future in their address to the researcher and to wider anticipated audiences.
BIOL/ECSC 1230: In this section we will work with data collected by elephant seals equipped with oceanographic instruments in the Southern Ocean. Depending on your interests, you can approach the project from different angles: students focusing on biology will explore where the seals travel and what drives their movements, while those interested in earth science will investigate the temperature and salinity profiles gathered during their dives. Working in teams, you’ll combine these perspectives to build a fuller picture of both seal ecology and the oceanographic processes that shape their environment. Along the way, you’ll practice manipulating and visualizing different types of data including maps of seal tracks, temperature and salinity profiles, and cross-sections of ocean properties. We will also bring in satellite and autonomous float data to place seal activity and the data they collect in a broader context. By the end, you’ll have a sense of how these different data sources fit together and what unique insights we gain from using seals as oceanographers.
CLAS 1230: In this section students will gain hands-on experience with a variety of natural language processing and text mining techniques by exploring the writings of the ancient historian Plutarch, who lived during the first and second centuries AD. We will focus on the biographies of "great men" in Plutarch’s Lives, which chronicles the history, morals, and virtues of major figures who played parallel roles in ancient Greek and Roman society. The public domain English translation from Project Gutenberg will serve as our main corpus; however, students with a background or interest in Ancient Greek can work with the text in the original language. Using the R programming language, students will transform unstructured text into quantitative data for statistical analysis and morphosyntactic parsing. We will also apply machine learning models to our data to reveal underlying patterns in large amounts of text. No prior experience with R is necessary.
PSCI 1230: Who votes in elections? Who attends protests? Why? In this section we will use the tools of data science to explore these and other questions about political participation in the Americas. We will examine engagement in different forms of participation and the demographic, economic, social, and other factors that shape participation. The class will introduce students to the basics of survey research and the study of political participation. Students will complete a final group project showcasing the concepts and tools learned in class.
STAT 1230: In this section students will dive into the world of data science by focusing on invasive species monitoring data. Early detection is crucial to controlling many invasive species; however, there is a knowledge gap regarding the sampling effort needed to detect the invader early. In this course, we will work with decades of invasive species monitoring data collected across the United States to better understand how environmental variables play a role in the sampling effort required to detect invasive species. Students will gain experience in the entire data science pipeline, but the primary focus will be on data scraping, data visualization, and communication of data-based results to scientists and policymakers.
Terms Taught
Requirements