Skip to content

Latest commit

 

History

History
153 lines (107 loc) · 5.82 KB

README.org

File metadata and controls

153 lines (107 loc) · 5.82 KB

Embrapa Wine Grape Instance Segmentation Dataset - Embrapa WGISD

Motivation for Dataset Creation

Why was the dataset created?

Embrapa WGISD (Wine Grape Instance Segmentation Dataset) was created to provide images and annotation to study object detection and instance segmentation in image-based monitoring and field robotics for viticulture. It provides instances from five different grape varieties taken on field. These instances shows variance in grape pose, illumination and focus, including genetic and phenological variations as shape, color and compactness.

What (other) tasks could the dataset be used for?

Possible uses include relaxations of the instance segmentation problem: classification (Is a grape in the image?), semantic segmentation (What are the “grape pixels” in the image?), object detection (Where are the grapes in the image?). The WGISD can be also in grape variety identification.

Has the dataset been used for any tasks already?

Who funded the creation of the dataset?

The building of the WGISD dataset was supported by the Embrapa SEG Project 01.14.09.001.05.04, Image-based metrology for Precision Agriculture and Phenotyping, and the CNPq PIBIC Program.

Dataset Composition

What are the instances?

Each instance consists in a RGB image and an annotation describing grape bunches locations as bounding boxes. A subset of the instances also contains binary masks identifying the pixels belonging to each grape bunch. Each image presents at least one grape bunch. Some grape bunches can appear far at the background and should be ignored.

Are relationships between instances made explicit in the data?

File names prefixes identify the variety observed in the instance:

PrefixVariety
CDYChardonnay
CFRCabernet Franc
CSVCabernet Sauvignon
SVBSauvignon Blanc
SYHSyrah

How many instances of each type are there?

The dataset consists of 300 images containing 4,398 grape bunches identified by bounding boxes. A subset of 125 images also contains binary masks identifying the pixels of each bunch. That means that from the 4,398 bunches, 1,898 of them presents binary masks for instance segmentation.

PrefixVarietyDateInstancesBoxed bunchesMasked bunches
CDYChardonnay2018-04-2765838242
CFRCabernet Franc2018-04-27651048460
CSVCabernet Sauvignon2018-04-2757640290
SVBSauvignon Blanc2018-04-27651313586
SYHSyrah2017-04-2748559256
Total30043981834

What data does each instance consist of?

Each instance contains a 2048 x 1365 pixels RGB image and a text file containing one bounding box description per line. These text files follows the “YOLO format”:

class cx cy w h

class is an integer defining the object class - the dataset presents only the grape class that is numbered 0, so every line starts with this “class zero” indicator. The center of the bounding box is the point (cx, cy), represented as float values because this format normalize the coordinates by the image dimensions. To get the absolute position, use (2048 * cx, 1365 * cy). The bounding box dimensions are given by w and h, also normalized by the image size.

The instances presenting mask data for instance segmentation contain files presenting the .npz extension. These files are compressed archives for Numpy n-dimensional arrays. Each array is a 1,365 x 2,048 x n_bunches three-dimensional array where n_bunches is the number of grape bunches observed in the image. After assigning the Numpy array to a variable M, the mask for the i-th grape bunch can be found in M[:,:,i]. The i-th mask corresponds to the i-th line in the bounding boxes file.

The dataset also includes the original image files, presenting resolutions bigger than 2048 x 1365 pixels. The normalized annotation for bounding boxes allows easy identification of bunches in the original images, but the mask data will need proper rescaling by users that wish work in the original full resolution.

Is everything included or does the data rely on external resources?

Everything is included in the dataset.

Are there recommended data splits or evaluation measures?

The dataset comes with specified train/test splits. The splits are found in lists stored as text files. There are also lists referring only to instances presenting binary masks.

ImagesBounding boxesMasks
Train/Val24035551402
Test60843432
Total30043981834
  • Precision/recall
  • AP
  • AP50

What experiments were initially run on this dataset?

Any other comments?

Data Collection Process

How was the data collected?

Who was involved in the data collection process?

How was the data associated with each instance acquired?

Does the dataset contain all possible instances?

If the dataset is a sample, then what is the population?

Is there information missing from the dataset and why?

Are there any known errors, sources of noise, or redundancies in the data?

Any other comments?

Data Preprocessing

Datset Distribution

Dataset Maintenance

Legal & Ethical Considerations