This project aims to predict diseases based on symptoms using machine learning models. Three classifiers (Support Vector Machine, Naive Bayes, and Random Forest) are trained on a dataset, and predictions are made by taking the mode of the predictions from all three models.
This project predicts diseases based on a given set of symptoms. The main goal is to develop an ensemble model that combines the predictions of three different classifiers to improve the accuracy of disease prediction.
The dataset used for this project consists of training and testing data with various symptoms and their corresponding diseases. The data is assumed to be in CSV format with symptom columns and a target disease column.
Three machine learning models are used in this project:
- Support Vector Machine (SVC)
- Naive Bayes (GaussianNB)
- Random Forest Classifier
The final prediction is made by taking the mode of the predictions from these three models.
The ensemble model achieves a certain level of accuracy on the test dataset. The confusion matrix provides a detailed view of the model's performance across different classes.
- Pyhton 3.x
- NumPy
- pandas
- scikit-learn
- seaborn
- matplotlib