GitHub - tmelman/BiasedRandomForest: An implementation of Bader-El-Den 2019 (IEEE) paper

Code for implementation of BRAF/Biased Random Forest, from Bader-El-Den 2019 (IEEE).

To run the pipeline on the PIMA dataset, run python run_pipeline.py from the command line.

Credit Tamar Melman, 2020

This code contains 3 files:

ml_utils.py: utility functions to calculate metrics of interest for ML algorithm evaluation
randomforest.py: script defining DecisionTree, RandomForest, and BiasedRandomForest implementations
run_pipeline.py, which runs the entire analysis pipeline to train the model and output metrics.

BiasedRandomForest is for demonstration purposes and is not recommended for ML applications; for imbalanced data, I would recommend one of the following approaches:

Use a weighted Random Forest
modifying the algorithm to pick a balanced subset of the majority class
using SMOTE upsampling with a standard RandomForest

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
__pycache__		__pycache__
README.md		README.md
diabetes_braf_dev.ipynb		diabetes_braf_dev.ipynb
ml_utils.py		ml_utils.py
randomforest.py		randomforest.py
run_pipeline.py		run_pipeline.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

tmelman/BiasedRandomForest

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages