A ML project
Tool used: Jupiter Notebook, python Libraries: Sklearn, numpy, pandas
Approach
- Data with Nan values were removed
- "Product_id","Customer_name" columns are removed
- "Loyalty_customer", "Product_Category" are mapped to one hot encoding
- Extract year and month and drop instock_date column
- Used RandomForestRegression model to tarin with test data od 1/3rd.
- Predict
Further Improvement:
- Add new features like average price, time of selling etc
- Hyper parameter tuning of model
- Train model using entire dataset