This repository contains code for analyzing customer data from a mall. The data includes information about customers' age, gender, annual income, and spending score.
Make sure you have Python installed on your system. You'll also need to install the following Python libraries:
- numpy
- pandas
- matplotlib
- seaborn
- scipy
- scikit-learn
You can install these libraries using pip:
-
Clone the repository to your local machine:
-
Navigate to the project directory:
data_analysis.py
: Contains functions for cleaning, processing, and analyzing the customer data.plotting.py
: Includes functions for plotting various visualizations of the data.main.py
: Main script to execute data analysis and plotting functions.
The data_analysis.py
script includes functions to clean the customer data. This involves removing null values, renaming columns, removing duplicates, and filtering outliers.
The plotting.py
script contains functions to plot various visualizations of the customer data, including scatter plots, histograms, pair plots, and 3D plots.
After running the analysis scripts, you'll obtain insights into the customer demographics, spending behavior, and correlations between different variables.
This project is licensed under the MIT License - see the LICENSE file for details.