The aim of this project is to reduce the death due to drug over dose. Here we are determining who is prescribing opiate very frequently in USA. We classify the prescriber frequent if he is prescribing more the 10 opiate associated drug within a year.
-
I have used data from kaggle site This Link
-
Set up conda env:
- create an env
conda create -n fop
- install all required packages
conda env update -f env.yml
- create an env
-
Notebooks:
- When I was implementing this project, I used notebooks. I have them in two files.
- In EDA.ipynb I have all my Data analysis, and in ml_model.ipynb I have written all the training codes.
- The config.py has all hardcoded variable values.
- The pipeline.py has the main pipeline
- The preprocessors.py has all the custom preprocessors.
-
package_fop folder:
- This folder has all the python scripts for deployment easy style.
- I have created the package of this package, so that later I can easily import it in Flask app.
-
setup.py file:
- To build the package use
python setup.py sdist bdist_wheel
- To locally install type below command from the directory where setup.py is present.
pip install -e .
- To build the package use
-
MANIFEST.in
- This file contains the name of extensions which are not by default added to sdist/bdist. Because of the names the files get included in package.