Dataset Forge

Overview

This project is designed to create a diverse, labeled image dataset using Generative AI inpainting techniques, leveraging the Stable Diffusion Inpainting Pipeline. It automates the process of creating an object detection image dataset (YOLO Format) from a collection of base images and text prompts of objects.

Advantages

Enhanced Efficiency and Scalability

Speed: Automating dataset curation and labeling with Dataset Forge is exponentially faster than manual processes. It significantly reduces the time required to build a large-scale, labeled dataset.

Scalability: The ability to generate and label thousands of images with minimal human intervention makes Dataset Forge ideal for projects requiring large datasets.

Cost-Effectiveness

Reduced Labor Costs: Manual dataset curation and labeling are labor-intensive and costly. Dataset Forge minimizes the need for human annotators, thereby reducing overall project costs.

Enhanced Accuracy and Consistency

Consistent Labeling: Human labeling is prone to inconsistencies and errors. Dataset Forge offers a standardized, rule-based approach, ensuring uniformity and accuracy in labels across the dataset.

Customization and Flexibility

Adaptability: The tool's configuration can be easily modified to suit various project needs, allowing for the generation of diverse datasets tailored to specific requirements.

Diverse Data Generation: Dataset Forge can create a wide range of images using different inpainting prompts, thereby ensuring a diverse and comprehensive dataset.

Innovative Capabilities

Inpainting Precision: The use of Generative AI inpainting techniques allows for precise object placement and integration within images, which is difficult to achieve manually.

Integration with Advanced AI Models: The tool's compatibility with various diffusion models from Hugging Face allows for continuous improvement and integration of cutting-edge AI technologies.

Dataset Evolution

Dynamic Dataset Enhancement: The tool can continuously update and expand datasets with new images and labels, increasing the diverstity of the dataset for enhancing model training.

Features

Inpainting of objects into images based on textual prompts.
Generation of masks for selective inpainting.
Automated creation of a labeled image dataset in YOLO bounding box format.

Requirements

To run this project, you will need Python 3.10.X along with several dependencies listed in requirements.txt. It is recommended to use a virtual environment. Perform the Pytorch installation according to the official installation instructions for the version number listed in the requirements file and for your system configuration (https://pytorch.org/get-started/locally/). Then, install the remaining requirements using:

pip install -r requirements.txt

Structure

dataset-forge/
│
├── src/                                 # Source code
│   ├── inpainting dataset_generator.py  # Main dataset generation script
│   ├── inpainting_config.yaml           # Default prompts/parameters 
│   └── inpainting.py                    # Inpainting functions
│
├── data/                                # Data directory (input/output)
│   ├── base_images/                     # Base images to inpaint objects into
│   └── output/                          # Output images and bounding box files
│
├── tools/                               # Tools to make datasets            
│   └── generate_inpaint.py              # Script to generate dataset
│
├── notebooks/                           # Jupyter notebooks for demos
│
├── requirements.txt                     # Project dependencies
├── LICENSE                              # License information
└── README.md                            # Project overview (this file)

Usage

To use this project, follow these steps:

Prepare your dataset: Place your input images in data/input. These images should represent the environments you want to inpaint objects into.
Set your prompts: Edit the config file at 'src/inpainting_config.yaml' with your desired prompts and corresponding object classes for the dataset.
Run the script: Run the following from the project root directory. WARNING - This will download the specified model in the config file from Hugging Face on the first run. Ensure you have enough space.

tools/generate_inpaint.py

Check the results: The output images and YOLO formatted label text files will be saved in data/output/.

Notebooks

The notebooks/ directory contains Jupyter notebooks for demonstration and testing. These notebooks provide examples of how to use the functions in this project.

Feature Roadmap

Include functionality to generate the base image dataset in addition to the inpainting one.
Enhance bounding box 'tightness' for even better labels
User Interface
Adding additional label formats (COCO, Pascal VOC, etc.)

Known Issues

Sometimes the inpainting fails to generate any noticeable changes.
Bounding boxes are slightly loose (not tightly hugging the generated objects).
Doesn't work with all diffusion models on Hugging Face.

License

The diffusion model weights are licensed under creativeml-openrail-m. The code is licensed under CC-BY-NC.

Contact

For any queries or suggestions, please contact me here on Github.

Citation

    @InProceedings{Rombach_2022_CVPR,
        author    = {Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj\"orn},
        title     = {High-Resolution Image Synthesis With Latent Diffusion Models},
        booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
        month     = {June},
        year      = {2022},
        pages     = {10684-10695}
    }

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dataset Forge

Overview

Advantages

Enhanced Efficiency and Scalability

Cost-Effectiveness

Enhanced Accuracy and Consistency

Customization and Flexibility

Innovative Capabilities

Dataset Evolution

Features

Requirements

Structure

Usage

Notebooks

Feature Roadmap

Known Issues

License

Contact

Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
data		data
notebooks		notebooks
src		src
static		static
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

bellenfanttyler/dataset-forge

Folders and files

Latest commit

History

Repository files navigation

Dataset Forge

Overview

Advantages

Enhanced Efficiency and Scalability

Cost-Effectiveness

Enhanced Accuracy and Consistency

Customization and Flexibility

Innovative Capabilities

Dataset Evolution

Features

Requirements

Structure

Usage

Notebooks

Feature Roadmap

Known Issues

License

Contact

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages