Skip to content

Predicting NCAA Basketball Tournament Success with Machine Learning Models

Notifications You must be signed in to change notification settings

COMP562-Machine-Learning-March-Madness/Final-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Predicting NCAA Basketball Tournament Success with Machine Learning Models

Abstract

In this paper, we attempt to predict the success of teams participating in the 2015-2019 NCAA Men's Basketball Tournaments using Machine Learning algorithms and find which statistical features give the highest likelihood for games won. By training our data on multiple regression models, we find the best model to be Random Forest Regression. We also find that Power Rating (BARTHAG) has the largest impact on how far a team makes it in the tournament. Using our test results, we predict Dayton as the winner of the 2020 NCAA Tournament that had been canceled due to COVID-19.

The Data

We used the College Basketball Dataset by Andrew Sundberg for our predictions of the 2015-2020 seasons.

Results

We found Random Forest Regression to be the most accurate regression model for predicting the champion. The most important feature that contributed to how far a team goes in the tournament was BARTHAG (Power Rating).

2020 Final Four Prediction

We then used our model to predict the winner of the 2020 tournament if it were to happen given this season's stats.

Authors

About

Predicting NCAA Basketball Tournament Success with Machine Learning Models

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •