Personalized Coaching - The ML Pipeline

The ML pipeline establishes individual models for reaching the daily steps threshold on an individual level, based on FitbitFlex data.

The ML Pipeline consists of 4 jupyter notebooks:

1 HanzeFitbit Data preparation
2 HanzeFitbit Determine the score
3 HanzeFitbit all treament ids, classification algorithms results database pickle
4 HanzeFitbit normal dataframe all treament ids, all models, results database, Predict

These notebooks use an Oracle xe 11g2 database to store and retrieve data. In the folder Metrics and Database the schema needed is added: DatabaseWithoutData.ddl

After installation of the database and the schema, Fitbit csv files can be loaded into the database with notebook 1. Notebook 2 determines the F1-score and Acccuracy of several classification algorithms on grouplevel. Notebook 3 determines the F1-score and Accuracy of several classification algorithms on treatment_id level and pickles the models Notebook 4 uses the pickled models to predict the probability on the individual meeting his threshold.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Personalized Coaching - The ML Pipeline

Files

README.md

Latest commit

History

README.md

File metadata and controls

Personalized Coaching - The ML Pipeline