fantasy

Predict player points per game (half ppr) using stacked ensemble machine learning, leveraging Sports Reference player statistics from the previous 20 years.

Reproducibility

To reproduce this work, clone the repository, navigate to the root directory, and download/activate the environment:

git clone https://github.com/bscod27/fantasy.git
cd fantasy
conda env create -f environment.yml
conda activate fantasy

Results can be reproduced by executing scripts/scraper.py >> scripts/wrangle.R >> scripts/train_baslearner.py >> scripts/train_metalearner.py >> scripts/generate_predictions.py:

scripts/scraper.py - web scrapes Sports Reference player statistics from the last 20 years and writes it as data/data_[t-20]_[t].csv; requires a command-line argument specifying the current year, t
scripts/wrangle.R - engineers lag 1 features by player and partitions the data into data/traindat.csv (training data with labels) and data/newdat.csv (most recent year statistics to use as inputs for predictions)
scripts/train_baselearner.py- trains several lower-level base learner using 5-fold groupwise (by year) cross-validation, holding out the most recent 4 years as a test set; requires a command-line argument specifying a model type within [enet, knn, svr, rf, xgb]; writes the following files:
- ensemble/train_base_[model].csv - transformed train set (stacked out-of-fold predictions)
- ensemble/test_base_[model].csv - transformed test set (averaged across models trained in each fold)
- ensemble/models_base_[model].pkl - pickled models trained in each fold
scripts/train_metalearner.py - trains an Elastic Net model on the transformed feature space and prints the model performance; saves the pickled meta learner as ensemble/meta_learner.pkl
scripts/generate_predictions.py- leverages the saved models to generate predictions on data/newdat.csv; writes these predictions as preds/predictions.csv

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
data		data
ensemble		ensemble
preds		preds
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fantasy

Reproducibility

About

Releases

Packages

Languages

License

bscod27/fantasy

Folders and files

Latest commit

History

Repository files navigation

fantasy

Reproducibility

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages