This project focuses on demonstrating SQL database normalization techniques, up to the third normal form (4NF), using Python. The goal is to organize a database efficiently to reduce redundancy and improve data integrity. It includes automated data generation for a sales transaction dataset, utilizing the Faker Python library. Faker creates realistic sales transactions, simulating customer, product, and order data.
An automation pipeline is also set up in the .github/workflows
directory using GitHub Actions. This pipeline automates the process of generating data and updating the database with each new commit, ensuring fresh data is consistently available for normalization scripts.
- Clone the repository:
git clone https://github.com/AgVicCodes/2_sql_database_normalization_with_python.git
- Install the dependencies:
pip install -r requirements.txt
- Generate sample data using data_generator.py.
- Load data into the database with load_to_sql.py.
- Run the SQL normalization scripts like normalisation.sql.
- Data generation and loading scripts.
- SQL scripts for normalization up to 4NF.
- Sample database schema and data included.
- Python
- SQL
This project is maintained by AgVicCodes.
This project is licensed under the MIT License.