RE-DACT: Secure Redaction and Anonymization Tool

RE-DACT is a powerful tool designed for secure redaction, masking, and anonymization of sensitive data. The project leverages Machine Learning, Python and Cybersecurity principles to ensure data security and integrity across various input formats.

🖥 Project Overview

RE-DACT enables the redaction of sensitive data while maintaining the structural and logical integrity of the input. It offers customizable redaction levels and advanced features like generating synthetic data for learning and sharing purposes. The tool is designed with an easy-to-use GUI.

Features:

Redacts and anonymizes sensitive content in text, images, docx, CSVs, PPTs, PDFs, and videos.
Supports gradational redaction in various levels as well as user customizable anonymization.
Ensures data security without storing or exposing input data.
Generates realistic synthetic data for non-sensitive sharing.
Provides a flexible and user-friendly GUI.
Video redaction focuses specifically on face and text redaction.

Use Cases:

Privacy compliance in data sharing and processing.
Data preparation for machine learning and analytics.
Anonymizing sensitive information for secure sharing.
Legal document and video redaction.

🚀 How to Run Locally

Follow these steps to clone and run the RE-DACT project on your local machine:

Prerequisites:

Python 3.8 or higher installed.
Node.js installed (for the frontend).
pip and npm package managers.

Steps:

Backend Setup:

Clone the repository and navigate to the backend directory:

git clone https://github.com/yourusername/re-dact.git
cd re-dact/backend

Install the required dependencies:

pip install flask flask-cors werkzeug pandas requests cryptography spacy PyPDF2 opencv-python imutils ultralytics cvzone paddleocr python-docx openpyxl matplotlib faker fpdf paddlepaddle PyMuPDF python-pptx omegaconf

Key dependencies used in the project:

Flask: For backend API development.
Flask-CORS: For handling cross-origin requests.
Werkzeug: For file handling.
pandas: For data manipulation.
requests: For HTTP requests.
cryptography: For data encryption and hashing.
spacy: For entity recognition and NLP tasks.
PyPDF2: For PDF processing.
opencv-python: For image and video processing.
imutils: For image utilities.
ultralytics: For YOLO-based object detection.
paddleocr: For OCR tasks.
python-docx: For Word document processing.
openpyxl: For Excel file processing.
matplotlib: For data visualization.
faker: For generating synthetic data.
fpdf: For generating PDF files.
cvzone For Computer vision tasks.

Start the backend by running the following commands in separate terminals, all within the backend directory:
- Terminal 1:
```
python doc.py
```
- Terminal 2:
```
python final_model_full_code.py
```
- Terminal 3:
```
python freetext_code.py
```
- Terminal 4:
```
python app.py
```

Frontend Setup:

Navigate to the frontend directory:

cd ../frontend

Install the frontend dependencies:

npm i -y

Start the frontend server:

npm run dev

Access the application on your browser at:

http://localhost:5173

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
REDACT_Frontend		REDACT_Frontend
backend		backend
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RE-DACT: Secure Redaction and Anonymization Tool

🖥 Project Overview

Features:

Use Cases:

🚀 How to Run Locally

Prerequisites:

Steps:

Backend Setup:

Key dependencies used in the project:

Frontend Setup:

🙋🏻‍♂️ ENJOY!

About

Releases

Packages

Languages

Swarnim1812/REDACT-TOOL

Folders and files

Latest commit

History

Repository files navigation

RE-DACT: Secure Redaction and Anonymization Tool

🖥 Project Overview

Features:

Use Cases:

🚀 How to Run Locally

Prerequisites:

Steps:

Backend Setup:

Key dependencies used in the project:

Frontend Setup:

🙋🏻‍♂️ ENJOY!

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages