Skip to content

A repository for clustering and fitting analysis on a dataset of mall customers

Notifications You must be signed in to change notification settings

AgVicCodes/clustering_analysis

Repository files navigation

Customer Data Analysis

This repository contains code for analyzing customer data from a mall. The data includes information about customers' age, gender, annual income, and spending score.

Prerequisites

Make sure you have Python installed on your system. You'll also need to install the following Python libraries:

  • numpy
  • pandas
  • matplotlib
  • seaborn
  • scipy
  • scikit-learn

You can install these libraries using pip:

  1. Clone the repository to your local machine:

  2. Navigate to the project directory:

Scripts Overview

  • data_analysis.py: Contains functions for cleaning, processing, and analyzing the customer data.
  • plotting.py: Includes functions for plotting various visualizations of the data.
  • main.py: Main script to execute data analysis and plotting functions.

Data Cleaning

The data_analysis.py script includes functions to clean the customer data. This involves removing null values, renaming columns, removing duplicates, and filtering outliers.

Data Visualization

The plotting.py script contains functions to plot various visualizations of the customer data, including scatter plots, histograms, pair plots, and 3D plots.

Results

After running the analysis scripts, you'll obtain insights into the customer demographics, spending behavior, and correlations between different variables.

License

This project is licensed under the MIT License - see the LICENSE file for details.

About

A repository for clustering and fitting analysis on a dataset of mall customers

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published