From 12df471bf16c96b9cdae2df22ec9d163146a6137 Mon Sep 17 00:00:00 2001
From: Martin Kozlovsky <martin.kozlovsky@luxonis.com>
Date: Thu, 31 Oct 2024 03:30:23 +0100
Subject: [PATCH 1/9] luxonis_ml.data docs

---
 README.md                 |   2 +-
 luxonis_ml/data/README.md | 301 ++++++++++++++++++++++++++++++++++++++
 2 files changed, 302 insertions(+), 1 deletion(-)
 create mode 100644 luxonis_ml/data/README.md

diff --git a/README.md b/README.md
index 376f5745..25cb4b95 100644
--- a/README.md
+++ b/README.md
@@ -14,7 +14,7 @@
 
 This library includes a collection of helper functions and utilities for the Luxonis MLOps stack. This includes the following submodules:
 
-- **Dataset Management**: Creating computer vision datasets focused around Luxonis hardware and to be used with our [LuxonisTrain](https://pypi.org/project/luxonis-train/) framework.
+- **Dataset Management**: Creating computer vision datasets focused around Luxonis hardware and to be used with our [LuxonisTrain](https://pypi.org/project/luxonis-train/) framework. Additional documentation can be found [here](https://github.com/luxonis/luxonis-ml/blob/main/luxonis_ml/data/README.md).
 - **Embeddings**: Methods to compute image embeddings.
 - **Tracking**: Our implementation of a logger for use with PyTorch Lightning or in [LuxonisTrain](https://pypi.org/project/luxonis-train/)
 - **Utils**: Miscellaneous utils for developers.
diff --git a/luxonis_ml/data/README.md b/luxonis_ml/data/README.md
new file mode 100644
index 00000000..938ab08c
--- /dev/null
+++ b/luxonis_ml/data/README.md
@@ -0,0 +1,301 @@
+# LuxonisML Data
+
+## Introduction
+
+LuxonisML Data is a library for creating and interacting with datasets in the LuxonisDataFormat (LDF).
+
+The lifecycle of an LDF dataset is as follows:
+
+1. Creating new dataset
+1. Adding data
+1. Defining splits
+1. (Optional) Adding new data or redefining splits
+1. Loading the dataset with `LuxonisLoader` for use in a training pipeline
+1. (Optional) Deleting the dataset if it is no longer needed
+
+Each of these steps will be explained in more detail in the following examples.
+
+## Prerequisites
+
+We will be using our toy dataset `parking_lot` in all examples. The dataset consists of images of cars and motorcycles in a parking lot. Each image has a corresponding annotation in the form of a bounding box, keypoints and several segmentation masks.
+
+**Dataset Annotations:**
+
+| Task                        | Annotation Type   | Classes                                                                                                |
+| --------------------------- | ----------------- | ------------------------------------------------------------------------------------------------------ |
+| Object Detection            | Bounding Box      | car, motorcycle                                                                                        |
+| Keypoint Detection          | Keypoints         | car, motorcycle                                                                                        |
+| Color Segmentation          | Segmentation Mask | background, red, gree, blue                                                                            |
+| Type Segmentation           | Segmentation Mask | backgeound, car, motorcycle                                                                            |
+| Brand Segmentation          | Segmentation Mask | background, alfa-romeo, buick, ducati, harley, ferrari, infiniti, jeep, land-rover, roll-royce, yamaha |
+| Binary Vehicle Segmentation | Segmentation Mask | vehicle                                                                                                |
+
+To start, download the zipped dataset from TODO and extract it to a directory of your choice.
+
+## LuxonisDataset
+
+The first step is to create a new dataset and add some data to it.
+
+The main part of the dataset is the `LuxonisDataset` class.
+It serves as an abstraction over the dataset and provides methods
+for adding data and creating splits.
+You can create as many datasets as you want, each with a unique name.
+
+Datasets can be stored locally or in one of the supported cloud storage providers.
+
+First we import `LuxonisDataset` and create a dataset with the name `"parking_lot"`.
+
+```python
+from luxonisml.data import LuxonisDataset
+
+dataset_name = "parking_lot"
+
+dataset = LuxonisDataset(dataset_name)
+```
+
+> \[!NOTE\]
+> By default, the dataset will be created locally. For details on different storage methods, see TODO.
+
+> \[!NOTE\]
+> If there already is a dataset with the same name, it will be loaded instead of creating a new one.
+> If you want to always create a new dataset, you can pass `delete_existing=True` to the `LuxonisDataset` constructor.
+
+### Adding Data
+
+The next step is to add data to the dataset.
+The data are provided as dictionaries with the following structure:
+
+```python
+{
+    "file": str,  # path to the image file
+    "annotation": Optional[dict]  # annotation of the file
+}
+```
+
+The content of the `"annotation"` field depends on the task type.
+
+To add the data to the dataset, we first define a generator function that yields the data.
+
+```python
+import json
+from pathlib import Path
+
+# path to the dataset, replace it with the actual path on your system
+dataset_root = Path("data/parking_lot")
+
+def generator():
+    for annotation_dir in dataset_root.iterdir():
+        with open(annotation_dir / "annotations.json") as f:
+            data = json.load(f)
+
+        # get the width and height of the image
+        W = data["dimensions"]["width"]
+        H = data["dimensions"]["height"]
+
+        image_path = annotation_dir / data["filename"]
+
+        for instance_id, bbox in data["BoundingBoxAnnotation"].items():
+
+            # get unnormalized bounding box coordinates
+            x, y = bbox["origin"]
+            w, h = bbox["dimension"]
+
+            # get the class name of the bounding box
+            class_ = bbox["labelName"]
+            yield {
+                "file": image_path,
+                "annotation": {
+                    "type": "boundingbox",
+                    "class": class_,
+
+                    # normalized bounding box
+                    "x": x / W,
+                    "y": y / H,
+                    "w": w / W,
+                    "h": h / H,
+                },
+            }
+```
+
+The generator is then passed to the `add` method of the dataset.
+
+```python
+dataset.add(generator())
+```
+
+> \[!NOTE\]
+> The `add` method accepts any iterable, not only generators.
+
+### Defining Splits
+
+The last step is to define the splits of the dataset.
+Usually, the dataset is split into training, validation, and test sets.
+
+To define splits, we use the `make_splits` method of the dataset.
+
+```python
+
+dataset.make_splits({
+  "train": 0.7,
+  "val": 0.2,
+  "test": 0.1,
+})
+```
+
+This will split the dataset across `"train"`, `"val"`, and `"test"` sets with the specified ratios.
+
+For more refined control over the splits, you can pass a dictionary with the split names as keys and lists of file names as values:
+
+```python
+
+dataset.make_splits({
+  "train": ["file1.jpg", "file2.jpg", ...],
+  "val": ["file3.jpg", "file4.jpg", ...],
+  "test": ["file5.jpg", "file6.jpg", ...],
+})
+```
+
+Calling `make_splits` with no arguments will default to an 80/10/10 split.
+
+In order for splits to be created, there must be some new data in the dataset. If no new data were added, calling `make_splits` will raise an error.
+If you wish to delete old splits and create new ones using all the data, pass `redefine_splits=True` to the method call.
+
+> \[!NOTE\]
+> There are no restrictions on the split names,
+> however for most cases one should stick to `"train"`, `"val"`, and `"test"`.
+
+### CLI Reference
+
+The `luxonis_ml` CLI provides a set of commands for managing datasets.
+These commands are accessible via the `luxonis_ml data` command.
+
+The available commands are:
+
+- `luxonis_ml data parse <data_directory>` - parses data in the specified directory and creates a new dataset
+- `luxonis_ml data ls` - lists all datasets
+- `luxonis_ml data info <dataset_name>` - prints information about the dataset
+- `luxonis_ml data inspect <dataset_name>` - renders the data in the dataset on screen using `cv2`
+- `luxonis_ml data delete <dataset_name>` - deletes the dataset
+
+For more information, run `luxonis_ml data --help` or pass the `--help` flag to any of the above commands.
+
+## LuxonisLoader
+
+`LuxonisLoader` provides an easy way to load datasets in the Luxonis Data Format. It is designed to work with the `LuxonisDataset` class and provides an abstraction over the dataset, allowing you to iterate over the stored data.
+
+This guide covers the loading of datasets using the `LuxonisLoader` class.
+
+The `LuxonisLoader` class can also take care of data augmentation, for more info see [Augmentation](#augmentation).
+
+## Dataset Loading
+
+To load a dataset with `LuxonisLoader`, we need an instance of `LuxonisDataset`, and we need to specify what view of the dataset we want to load.
+
+The view can be either a single split (created by `LuxonisDataset.make_splits`) or a list of splits (for more complicated datasets).
+
+```python
+from luxonisml.data import LuxonisLoader, LuxonisDataset
+
+dataset = LuxonisDataset("parking_lot")
+loader = LuxonisLoader(dataset, view="train")
+```
+
+The `LuxonisLoader` can be iterated over to get pairs of images and annotations.
+
+```python
+for img, labels in loader:
+    ...
+```
+
+## LuxonisParser
+
+`LuxonisParser` offers a simple API for creating datasets from several common dataset formats. All of these formats are supported by `roboflow`.
+
+The supported formats are:
+
+- COCO - We support COCO JSON format in two variants:
+  - [RoboFlow](https://roboflow.com/formats/coco-json)
+  - [FiftyOne](https://docs.voxel51.com/user_guide/export_datasets.html#cocodetectiondataset-export)
+- [Pascal VOC XML](https://roboflow.com/formats/pascal-voc-xml)
+- [YOLO Darknet TXT](https://roboflow.com/formats/yolo-darknet-txt)
+- [YOLOv4 PyTorch TXT](https://roboflow.com/formats/yolov4-pytorch-txt)
+- [MT YOLOv6](https://roboflow.com/formats/mt-yolov6)
+- [CreateML JSON](https://roboflow.com/formats/createml-json)
+- [TensorFlow Object Detection CSV](https://roboflow.com/formats/tensorflow-object-detection-csv)
+- Classification Directory - A directory with subdirectories for each class
+
+```plaintext
+dataset_dir/
+├── train/
+│   ├── class1/
+│   │   ├── img1.jpg
+│   │   ├── img2.jpg
+│   │   └── ...
+│   ├── class2/
+│   └── ...
+├── valid/
+└── test/
+```
+
+- Segmentation Mask Directory - A directory with images and corresponding masks.
+
+```plaintext
+dataset_dir/
+├── train/
+│   ├── img1.jpg
+│   ├── img1_mask.png
+│   ├── ...
+│   └── _classes.csv
+├── valid/
+└── test/
+```
+
+The masks are stored as grayscale PNG images where each pixel value corresponds to a class.
+The mapping from pixel values to class is defined in the `_classes.csv` file.
+
+```csv
+Pixel Value, Class
+0, background
+1, class1
+2, class2
+3, class3
+```
+
+### Dataset Creation
+
+The first step is to initialize the `LuxonisParser` object with the path to the dataset. Optionally, you can specify the name of the dataset and the type of the dataset.
+If no name is specified, the dataset will be created with the name of the directory containing the dataset.
+If no type is specified, the parser will try to infer the type from the dataset directory structure.
+
+The dataset directory can either be a local directory or a directory in one of the supported cloud storage providers. The parser will automatically download the dataset if the provided path is a cloud storage URL.
+
+The directory can also be a zip file containing the dataset.
+
+```python
+from luxonisml.data import LuxonisParser
+from luxonis_ml.enums import DatasetType
+
+dataset_dir = "path/to/dataset"
+
+parser = LuxonisParser(
+  dataset_dir=dataset_dir,
+  dataset_name="my_dataset",
+  dataset_type=DatasetType.COCO
+)
+```
+
+After initializing the parser, you can parse the dataset to create a `LuxonisDataset` instance. The `LuxonisDataset` instance will contain the data from the dataset with splits for training, validation, and testing based on the dataset directory structure.
+
+```python
+dataset = parser.parse()
+```
+
+### CLI Reference
+
+The parser can be invoked using the `luxonis_ml data parse` command.
+
+```bash
+luxonis_ml data parse path/to/dataset --name my_dataset --type coco
+```
+
+For more detailed information, run `luxonis_ml data parse --help`.

From ffb1457b7bd2d4d35e069ad7d13fae371d4048c7 Mon Sep 17 00:00:00 2001
From: Martin Kozlovsky <martin.kozlovsky@luxonis.com>
Date: Thu, 31 Oct 2024 03:35:21 +0100
Subject: [PATCH 2/9] table of contents

---
 luxonis_ml/data/README.md | 15 +++++++++++++++
 1 file changed, 15 insertions(+)

diff --git a/luxonis_ml/data/README.md b/luxonis_ml/data/README.md
index 938ab08c..6599d33a 100644
--- a/luxonis_ml/data/README.md
+++ b/luxonis_ml/data/README.md
@@ -15,6 +15,21 @@ The lifecycle of an LDF dataset is as follows:
 
 Each of these steps will be explained in more detail in the following examples.
 
+## Table of Contents
+
+- [LuxonisML Data](#luxonisml-data)
+  - [Introduction](#introduction)
+  - [Prerequisites](#prerequisites)
+  - [LuxonisDataset](#luxonisdataset)
+    - [Adding Data](#adding-data)
+    - [Defining Splits](#defining-splits)
+    - [CLI Reference](#cli-reference)
+  - [LuxonisLoader](#luxonisloader)
+  - [Dataset Loading](#dataset-loading)
+  - [LuxonisParser](#luxonisparser)
+    - [Dataset Creation](#dataset-creation)
+    - [CLI Reference](#cli-reference)
+
 ## Prerequisites
 
 We will be using our toy dataset `parking_lot` in all examples. The dataset consists of images of cars and motorcycles in a parking lot. Each image has a corresponding annotation in the form of a bounding box, keypoints and several segmentation masks.

From 96f517cbdc179eea3aae7df71dc7cf4ca21efabd Mon Sep 17 00:00:00 2001
From: Martin Kozlovsky <martin.kozlovsky@luxonis.com>
Date: Thu, 31 Oct 2024 03:43:51 +0100
Subject: [PATCH 3/9] added annotation format

---
 luxonis_ml/data/README.md | 182 ++++++++++++++++++++++++++++++++++++++
 1 file changed, 182 insertions(+)

diff --git a/luxonis_ml/data/README.md b/luxonis_ml/data/README.md
index 6599d33a..459eade9 100644
--- a/luxonis_ml/data/README.md
+++ b/luxonis_ml/data/README.md
@@ -19,6 +19,7 @@ Each of these steps will be explained in more detail in the following examples.
 
 - [LuxonisML Data](#luxonisml-data)
   - [Introduction](#introduction)
+  - [Table of Contents](#table-of-contents)
   - [Prerequisites](#prerequisites)
   - [LuxonisDataset](#luxonisdataset)
     - [Adding Data](#adding-data)
@@ -29,6 +30,15 @@ Each of these steps will be explained in more detail in the following examples.
   - [LuxonisParser](#luxonisparser)
     - [Dataset Creation](#dataset-creation)
     - [CLI Reference](#cli-reference)
+  - [Annotation Format](#annotation-format)
+    - [Classification](#classification)
+    - [Bounding Box](#bounding-box)
+    - [Keypoints](#keypoints)
+    - [Segmentation Mask](#segmentation-mask)
+      - [Polyline](#polyline)
+      - [Binary Mask](#binary-mask)
+      - [Run-Length Encoding](#run-length-encoding)
+    - [Array](#array)
 
 ## Prerequisites
 
@@ -314,3 +324,175 @@ luxonis_ml data parse path/to/dataset --name my_dataset --type coco
 ```
 
 For more detailed information, run `luxonis_ml data parse --help`.
+
+## Annotation Format
+
+The Luxonis Data Format supports several task types, each with its own annotation structure.
+Each annotation describes a single instance of the corresponding task type in the image.
+
+### Classification
+
+A single class label for the entire image.
+
+```python
+{
+    # type of the annotation, always "classification"
+    "type": "classification",
+
+    # name of the class the image belongs to
+    "class": str,
+}
+```
+
+### Bounding Box
+
+A single bounding box in the `"xywh"` format.
+The coordinates and dimensions are relative to the image size.
+
+```python
+{
+    # type of the annotation, always "boundingbox"
+    "type": "boundingbox",
+
+    # name of the class the bounding box belongs to
+    "class": str,
+
+    # unique identifier of the instance in the image
+    "instance_id": Optional[int],
+
+    # bounding box coordinates, relative to the image size
+    "x": float, # x coordinate of the top-left corner
+    "y": float, # y coordinate of the top-left corner
+    "w": float, # width of the bounding box
+    "h": float, # height of the bounding box
+}
+```
+
+### Keypoints
+
+A list of keypoint coordinates in the form of `(x, y, visibility)`.
+THe coordinates are relative to the image size.
+
+The visibility can be:
+
+- 0 for keypoints outside the image
+- 1 for keypoints in the image but occluded
+- 2 for fully visible keypoints
+
+```python
+{
+    # type of the annotation, always "keypoints"
+    "type": "keypoints",
+
+    # name of the class the keypoints belong to
+    "class": str,
+
+    # unique identifier of the instance in the image
+    "instance_id": Optional[int],
+
+    # list of (x, y, visibility) coordinates of the keypoints
+    # coordinates are relative to the image size
+    # visibility is 0 for not visible, 1 for occluded and 2 for visible
+    "points": list[tuple[float, float, Literal[0, 1, 2]]],
+}
+```
+
+### Segmentation Mask
+
+There are 3 options for how to specify the segmentation mask:
+
+#### Polyline
+
+The mask is described as a polyline. It is assumed that the last
+point connects to the first one to form a polygon.
+The coordinates are relative to the image size.
+
+```python
+{
+    # type of the annotation, always "polyline"
+    "type": "polyline",
+
+    # name of the class this mask belongs to
+    "class": str,
+
+    # list of (x, y) coordinates forming the polyline
+    # coordinates are relative to the image size
+    # the polyline will be closed to form a polygon,
+    #   i.e. the first and last point are the same
+    "polyline": list[tuple[float, float]],
+}
+```
+
+#### Binary Mask
+
+The mask is a binary 2D numpy array.
+
+```python
+{
+    # type of the annotation, always "mask"
+    "type": "mask",
+
+    # name of the class this mask belongs to
+    "class": str,
+
+    # binary mask as a 2D numpy array
+    # 0 for background, 1 for the object
+    "mask": np.ndarray,
+}
+```
+
+#### Run-Length Encoding
+
+The mask is described using the [Run-Length Encoding](https://en.wikipedia.org/wiki/Run-length_encoding) compression.
+
+Run-length encoding compresses data by reducing the physical size
+of a repeating string of characters.
+This process involves converting the input data into a compressed format
+by identifying and counting consecutive occurrences of each character.
+
+The RLE is composed of the height and width of the mask image and the counts of the pixels belonging to the positive class.
+
+```python
+{
+    # type of the annotation, always "rle"
+    "type": "rle",
+
+    # name of the class this mask belongs to
+    "class": str,
+
+    # height of the mask
+    "height": int,
+
+    # width of the mask
+    "width": int,
+
+    # counts of the pixels belonging to the positive class
+    "counts": list[int] | bytes,
+
+}
+```
+
+{% alert %}
+The RLE format is not intended for regular use and is provided mainly to support datasets that may already be in this format.
+{% /alert %}
+
+{% alert %}
+Masks provided as numpy arrays are converted to RLE format internally.
+{% /alert %}
+
+### Array
+
+An array of arbitrary data. This can be used for any custom data that doesn't fit into the other types.
+
+```python
+{
+    # type of the annotation, always "array"
+    "type": "array",
+
+    # name of the class this array belongs to
+    "class": str,
+
+    # path to a `.npy` file containing the array data
+    "path": str,
+}
+```

From 43a80f75219a5f7dd70aa9fcac9653f6a4b9e78c Mon Sep 17 00:00:00 2001
From: Martin Kozlovsky <martin.kozlovsky@luxonis.com>
Date: Thu, 31 Oct 2024 03:48:41 +0100
Subject: [PATCH 4/9] fixed CLI

---
 luxonis_ml/data/README.md | 125 ++++++++++++++++++++++++++++++++++++--
 1 file changed, 119 insertions(+), 6 deletions(-)

diff --git a/luxonis_ml/data/README.md b/luxonis_ml/data/README.md
index 459eade9..17431acd 100644
--- a/luxonis_ml/data/README.md
+++ b/luxonis_ml/data/README.md
@@ -24,12 +24,12 @@ Each of these steps will be explained in more detail in the following examples.
   - [LuxonisDataset](#luxonisdataset)
     - [Adding Data](#adding-data)
     - [Defining Splits](#defining-splits)
-    - [CLI Reference](#cli-reference)
+    - [LuxonisML CLI](#luxonisml-cli)
   - [LuxonisLoader](#luxonisloader)
-  - [Dataset Loading](#dataset-loading)
+    - [Dataset Loading](#dataset-loading)
   - [LuxonisParser](#luxonisparser)
     - [Dataset Creation](#dataset-creation)
-    - [CLI Reference](#cli-reference)
+    - [LuxonisParser CLI](#luxonisparser-cli)
   - [Annotation Format](#annotation-format)
     - [Classification](#classification)
     - [Bounding Box](#bounding-box)
@@ -39,6 +39,10 @@ Each of these steps will be explained in more detail in the following examples.
       - [Binary Mask](#binary-mask)
       - [Run-Length Encoding](#run-length-encoding)
     - [Array](#array)
+  - [Augmentation](#augmentation)
+    - [Augmentation Pipeline](#augmentation-pipeline)
+    - [Example](#example)
+    - [Usage with LuxonisLoader](#usage-with-luxonisloader)
 
 ## Prerequisites
 
@@ -189,7 +193,7 @@ If you wish to delete old splits and create new ones using all the data, pass `r
 > There are no restrictions on the split names,
 > however for most cases one should stick to `"train"`, `"val"`, and `"test"`.
 
-### CLI Reference
+### LuxonisML CLI
 
 The `luxonis_ml` CLI provides a set of commands for managing datasets.
 These commands are accessible via the `luxonis_ml data` command.
@@ -212,7 +216,7 @@ This guide covers the loading of datasets using the `LuxonisLoader` class.
 
 The `LuxonisLoader` class can also take care of data augmentation, for more info see [Augmentation](#augmentation).
 
-## Dataset Loading
+### Dataset Loading
 
 To load a dataset with `LuxonisLoader`, we need an instance of `LuxonisDataset`, and we need to specify what view of the dataset we want to load.
 
@@ -315,7 +319,7 @@ After initializing the parser, you can parse the dataset to create a `LuxonisDat
 dataset = parser.parse()
 ```
 
-### CLI Reference
+### LuxonisParser CLI
 
 The parser can be invoked using the `luxonis_ml data parse` command.
 
@@ -496,3 +500,112 @@ An array of arbitrary data. This can be used for any custom data that doesn't fi
     "path": str,
 }
 ```
+
+## Augmentation
+
+`LuxonisLoader` provides a way to use augmentations with your datasets. Augmentations are transformations applied to the data to increase the diversity of the dataset and improve the performance of the model.
+
+### Augmentation Pipeline
+
+The augmentations are specified as a list of dictionaries, where each dictionary represents a single augmentation. The structure of the dictionary is as follows:
+
+```python
+{
+    "name": str,  # name of the augmentation
+    "params": dict  # parameters of the augmentation
+}
+```
+
+By default, we support most augmentations from the `albumentations` library. You can find the full list of augmentations and their parameters in the {% link href="https://albumentations.ai/docs/api_reference/augmentations/" %}Albumentations documentation{% /link %}.
+
+On top of that, we provide a handful of custom batch augmentations:
+
+- `Mosaic4` - Mosaic augmentation with 4 images. Combines crops of 4 images into a single image in a mosaic pattern.
+- `MixUp` - MixUp augmentation. Overlays two images with a random weight.
+
+### Example
+
+The following example demonstrates a simple augmentation pipeline:
+
+```python
+
+[
+  {
+    'name': 'HueSaturationValue',
+    'params': {
+      'p': 0.5
+      'hue_shift_limit': 3,
+      'sat_shift_limit': 70,
+      'val_shift_limit': 40,
+    }
+  },
+  {
+    'name': 'Rotate',
+    'params': {
+      'p': 0.6,
+      'limit': 30,
+      'border_mode': 0,
+      'value': [0, 0, 0]
+    }
+  },
+  {
+    'name': 'Perspective',
+    'params': {
+      'p': 0.5
+      'scale': [0.04, 0.08],
+      'keep_size': True,
+      'pad_mode': 0,
+      'pad_val': 0,
+      'mask_pad_val': 0,
+      'fit_output': False,
+      'interpolation': 1,
+      'always_apply': False,
+    }
+  },
+  {
+    'name': 'Affine',
+    'params': {
+      'p': 0.4
+      'scale': None,
+      'translate_percent': None,
+      'translate_px': None,
+      'rotate': None,
+      'shear': 10,
+      'interpolation': 1,
+      'mask_interpolation': 0,
+      'cval': 0,
+      'cval_mask': 0,
+      'mode': 0,
+      'fit_output': False,
+      'keep_ratio': False,
+      'rotate_method': 'largest_box',
+      'always_apply': False,
+    }
+  },
+]
+
+```
+
+{% alert %}
+The augmentations are **not** applied in order. Instead, an optimal order is determined based on the type of the augmentations to minimize the computational cost.
+{% /alert %}
+
+### Usage with LuxonisLoader
+
+For this example, we will assume the augmentation example above is stored in a JSON file named `augmentations.json`.
+
+```python
+
+from luxonis_ml.data import LuxonisDataset, LuxonisLoader, Augmentations
+import json
+
+with open("augmentations.json") as f:
+    augmentations = json.load(f)
+
+aug = Augmentations(image_size=[256, 320], augmentations=augmentations)
+dataset = LuxonisDataset("parking_lot")
+loader = LuxonisLoader(dataset, view="train", augmentations=aug)
+
+for img, labels in loader:
+    ...
+```

From 3d51b3f334ae2f5b80a33c36489a0124a298e336 Mon Sep 17 00:00:00 2001
From: Martin Kozlovsky <martin.kozlovsky@luxonis.com>
Date: Thu, 31 Oct 2024 11:50:57 +0100
Subject: [PATCH 5/9] remove markdoc tags

---
 luxonis_ml/data/README.md | 17 +++++++----------
 1 file changed, 7 insertions(+), 10 deletions(-)

diff --git a/luxonis_ml/data/README.md b/luxonis_ml/data/README.md
index 17431acd..21f11dbb 100644
--- a/luxonis_ml/data/README.md
+++ b/luxonis_ml/data/README.md
@@ -476,13 +476,11 @@ The RLE is composed of the height and width of the mask image and the counts of
 }
 ```
 
-{% alert %}
-The RLE format is not intended for regular use and is provided mainly to support datasets that may already be in this format.
-{% /alert %}
+> \[!NOTE\]
+> The RLE format is not intended for regular use and is provided mainly to support datasets that may already be in this format.
 
-{% alert %}
-Masks provided as numpy arrays are converted to RLE format internally.
-{% /alert %}
+> \[!NOTE\]
+> Masks provided as numpy arrays are converted to RLE format internally.
 
 ### Array
 
@@ -516,7 +514,7 @@ The augmentations are specified as a list of dictionaries, where each dictionary
 }
 ```
 
-By default, we support most augmentations from the `albumentations` library. You can find the full list of augmentations and their parameters in the {% link href="https://albumentations.ai/docs/api_reference/augmentations/" %}Albumentations documentation{% /link %}.
+By default, we support most augmentations from the `albumentations` library. You can find the full list of augmentations and their parameters in the [Albumentations](https://albumentations.ai/docs/api_reference/augmentations/)documentation.
 
 On top of that, we provide a handful of custom batch augmentations:
 
@@ -586,9 +584,8 @@ The following example demonstrates a simple augmentation pipeline:
 
 ```
 
-{% alert %}
-The augmentations are **not** applied in order. Instead, an optimal order is determined based on the type of the augmentations to minimize the computational cost.
-{% /alert %}
+> \[!NOTE\]
+> The augmentations are **not** applied in order. Instead, an optimal order is determined based on the type of the augmentations to minimize the computational cost.
 
 ### Usage with LuxonisLoader
 

From b42da4d657a2ed3783d7ae203f5cbc9e865c7802 Mon Sep 17 00:00:00 2001
From: Martin Kozlovsky <martin.kozlovsky@luxonis.com>
Date: Thu, 31 Oct 2024 11:52:39 +0100
Subject: [PATCH 6/9] added bold text

---
 luxonis_ml/data/README.md | 18 +++++++++---------
 1 file changed, 9 insertions(+), 9 deletions(-)

diff --git a/luxonis_ml/data/README.md b/luxonis_ml/data/README.md
index 21f11dbb..aa82d8b2 100644
--- a/luxonis_ml/data/README.md
+++ b/luxonis_ml/data/README.md
@@ -242,16 +242,16 @@ for img, labels in loader:
 
 The supported formats are:
 
-- COCO - We support COCO JSON format in two variants:
+- **COCO** - We support COCO JSON format in two variants:
   - [RoboFlow](https://roboflow.com/formats/coco-json)
   - [FiftyOne](https://docs.voxel51.com/user_guide/export_datasets.html#cocodetectiondataset-export)
-- [Pascal VOC XML](https://roboflow.com/formats/pascal-voc-xml)
-- [YOLO Darknet TXT](https://roboflow.com/formats/yolo-darknet-txt)
-- [YOLOv4 PyTorch TXT](https://roboflow.com/formats/yolov4-pytorch-txt)
-- [MT YOLOv6](https://roboflow.com/formats/mt-yolov6)
-- [CreateML JSON](https://roboflow.com/formats/createml-json)
-- [TensorFlow Object Detection CSV](https://roboflow.com/formats/tensorflow-object-detection-csv)
-- Classification Directory - A directory with subdirectories for each class
+- [**Pascal VOC XML**](https://roboflow.com/formats/pascal-voc-xml)
+- [**YOLO Darknet TXT**](https://roboflow.com/formats/yolo-darknet-txt)
+- [**YOLOv4 PyTorch TXT**](https://roboflow.com/formats/yolov4-pytorch-txt)
+- [**MT YOLOv6**](https://roboflow.com/formats/mt-yolov6)
+- [**CreateML JSON**](https://roboflow.com/formats/createml-json)
+- [**TensorFlow Object Detection CSV**](https://roboflow.com/formats/tensorflow-object-detection-csv)
+- **Classification Directory** - A directory with subdirectories for each class
 
 ```plaintext
 dataset_dir/
@@ -266,7 +266,7 @@ dataset_dir/
 └── test/
 ```
 
-- Segmentation Mask Directory - A directory with images and corresponding masks.
+- **Segmentation Mask Directory** - A directory with images and corresponding masks.
 
 ```plaintext
 dataset_dir/

From c5e1602e0329e1b47efcde38f7ea1bfc1362baac Mon Sep 17 00:00:00 2001
From: Martin Kozlovsky <martin.kozlovsky@luxonis.com>
Date: Thu, 31 Oct 2024 11:53:26 +0100
Subject: [PATCH 7/9] added missing comas

---
 luxonis_ml/data/README.md | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/luxonis_ml/data/README.md b/luxonis_ml/data/README.md
index aa82d8b2..466ecf36 100644
--- a/luxonis_ml/data/README.md
+++ b/luxonis_ml/data/README.md
@@ -549,7 +549,7 @@ The following example demonstrates a simple augmentation pipeline:
   {
     'name': 'Perspective',
     'params': {
-      'p': 0.5
+      'p': 0.5,
       'scale': [0.04, 0.08],
       'keep_size': True,
       'pad_mode': 0,
@@ -563,7 +563,7 @@ The following example demonstrates a simple augmentation pipeline:
   {
     'name': 'Affine',
     'params': {
-      'p': 0.4
+      'p': 0.4,
       'scale': None,
       'translate_percent': None,
       'translate_px': None,

From 3cf7f66a41343309e70d013b1e3cae50364f8674 Mon Sep 17 00:00:00 2001
From: Martin Kozlovsky <martin.kozlovsky@luxonis.com>
Date: Thu, 31 Oct 2024 11:55:27 +0100
Subject: [PATCH 8/9] missing comma

---
 luxonis_ml/data/README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/luxonis_ml/data/README.md b/luxonis_ml/data/README.md
index 466ecf36..dbc4a937 100644
--- a/luxonis_ml/data/README.md
+++ b/luxonis_ml/data/README.md
@@ -531,7 +531,7 @@ The following example demonstrates a simple augmentation pipeline:
   {
     'name': 'HueSaturationValue',
     'params': {
-      'p': 0.5
+      'p': 0.5,
       'hue_shift_limit': 3,
       'sat_shift_limit': 70,
       'val_shift_limit': 40,

From 10a4f80aa689f64d5b0f9ec98984269a899c4e3e Mon Sep 17 00:00:00 2001
From: Martin Kozlovsky <martin.kozlovsky@luxonis.com>
Date: Thu, 31 Oct 2024 21:22:43 +0100
Subject: [PATCH 9/9] added link

---
 luxonis_ml/data/README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/luxonis_ml/data/README.md b/luxonis_ml/data/README.md
index dbc4a937..55b1fd95 100644
--- a/luxonis_ml/data/README.md
+++ b/luxonis_ml/data/README.md
@@ -59,7 +59,7 @@ We will be using our toy dataset `parking_lot` in all examples. The dataset cons
 | Brand Segmentation          | Segmentation Mask | background, alfa-romeo, buick, ducati, harley, ferrari, infiniti, jeep, land-rover, roll-royce, yamaha |
 | Binary Vehicle Segmentation | Segmentation Mask | vehicle                                                                                                |
 
-To start, download the zipped dataset from TODO and extract it to a directory of your choice.
+To start, download the zipped dataset from [here](https://drive.google.com/uc?export=download&id=1OAuLlL_4wRSzZ33BuxM6Uw2QYYgv_19N) and extract it to a directory of your choice.
 
 ## LuxonisDataset