-
Notifications
You must be signed in to change notification settings - Fork 172
Sklearn and XGBoost algorithms and groupby features
- Put correct paths to data: neptune_random_search.yaml:L21.
- Put your neptune.ml user-name (you specify it upon registration π) in the first line: neptune_random_search.yaml:L1.
- Consider your own ranges of hyper-parameters and number of runs. We put some reasonable values, hence setup as-is should give you decent results.
- Run experiment π
neptune run --config neptune_random_search.yaml main.py train_evaluate_predict --pipeline_name XGBoost
Once you put your paths to data and neptune.ml user-name, you have five end-to-end pipelines available:
the model | run command |
---|---|
lightGBM | neptune run --config neptune_random_search.yaml main.py train_evaluate_predict --pipeline_name lightGBM |
XGBoost | neptune run --config neptune_random_search.yaml main.py train_evaluate_predict --pipeline_name XGBoost |
random_forest | neptune run --config neptune_random_search.yaml main.py train_evaluate_predict --pipeline_name random_forest |
log_reg | neptune run --config neptune_random_search.yaml main.py train_evaluate_predict --pipeline_name log_reg |
svc | neptune run --config neptune_random_search.yaml main.py train_evaluate_predict --pipeline_name svc |
'log_reg' is logistic regression and 'svc' is Support Vector Classifier
All of them are parameterizable in the neptune_random_search.yaml, so you can easily run multiple experiments π
- Make sure that you put correct paths to data: neptune.yaml:L21 and neptune.ml user-name (you specify it upon registration) in the first line: neptune.yaml:L1.
- run experiment π
neptune run --config neptune.yaml main.py train_evaluate_predict --pipeline_name lightGBM
We have added substantial number of group-by based featurs. The recipe for extracting them is showed here: aggregation_recipes Transformed that calculated features is implemented here: GroupbyAggregations
XGBoost model is defined in the models.py:L12 and its parameters are all listed in the configuration file:
- random search: neptune_random_search.yaml:L71
- single run: neptune.yaml:L71
Besides this, your have three sklearn-based algorithm available, namely: random forest, logistic regression and support Vector Classifier.
check our GitHub organization https://github.com/neptune-ml for more cool stuff π
Kamil & Kuba, core contributors
- chestnut π°: LightGBM and basic features
- seedling π±: Sklearn and XGBoost algorithms and groupby features
- blossom πΌ: LightGBM on selected features
- tulip π·: LightGBM with smarter features
- sunflower π»: LightGBM clean dynamic features
- four leaf clover π: Stacking by feature diversity and model diversity