America's Next Top Model: Demystifying Two Methods for Election Prediction

Bella Karduck ([email protected]), Haley Johnson ([email protected]), Rohit Maramaju ([email protected]) Philip Menchaca ([email protected])

If knowledge is power then when it comes to election predictions, the public is in the dark. Media reports are filled with opinion polling data and pundits expound on which candidate will win, but how are these predictions made?

We test two methods of election prediction — a classical statistical approach and a machine learning method — and make them understandable to a general audience. A public website walks readers through the details of each method and allows them to compare the models’ predictions with actual outcomes.

See pollrbear.com for more details.

This project fulfills the capstone requirement for the Master's of Science in Information, Data Science, at the University of Michigan.

How To Run Our Code

To run the MRP model, run the cells in models/mrp_model.Rmd in order.

To run the machine learning model, run the cells in src/clean_ML.ipynb in order.

Datasets

All datasets we used are publicly available. All rights belong to their respective owners.

Polls

Monmouth Univeristy
Harvard University Poll, October 2020
COMETrends Pre-election survey from UT Dallas, October 2020
Reuter's Poll, January 2024

Census Data

2020 5-year estimates from the American Community's Survey (retrieved with IPUMS)

Repository Structure

Our repistory has the following structure. Note that only key files are included for brevity.

├── Project Poster                          <- Poster for UMSI project expo
├── data                                    <- Data soruces used
| 
├── documentation                           <- Documents data cleaning
| 
├── models                                  <- Code for MRP model and propensity scores
│   └── mrp_model.Rmd                       <- Model 
│   └── mrp_model.html                      <- HTML rendering of R notebook 
│
├── src                                     <- Python scripts & notebooks
│   └── clean_ML.ipynb                      <- Machine learning model   
│   └── census_getter.py                    <- Script to pull data with census API
|   └── helper.py                           <- Helper functions to process Reuter's poll
│   └── process_census_data.ipynb           <- Clean census data 
|   └── process_comet_poll.py               <- Clean COMET poll
│   └── process_harvard_poll_data.ipynb     <- Clean Harvard poll
│   └── process_poll_data.ipynb             <- Clean Monmouth poll
│   └── process_reuters_poll.py             <- Clean Reuter's poll
|
├── website_699                             <- Source code for website
├── LICENSE
├── README.md
├── report.pdf                              <- Detailed overview of our work
└── requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

America's Next Top Model: Demystifying Two Methods for Election Prediction

How To Run Our Code

Datasets

Repository Structure

About

Releases 2

Packages

Contributors 4

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 199 Commits
Project Poster		Project Poster
data		data
documentation		documentation
models		models
src		src
website_699		website_699
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
report.pdf		report.pdf
requirements.txt		requirements.txt

License

pmench/mrp-team

Folders and files

Latest commit

History

Repository files navigation

America's Next Top Model: Demystifying Two Methods for Election Prediction

How To Run Our Code

Datasets

Repository Structure

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 4

Languages

Packages