Common Semantic Segmentation

Format specification

CSS format specification is available here.

Supported annotation types:

Masks

Import Common Semantic Segmentation dataset

A Datumaro project with a CSS source can be created in the following way:

datum create
datum import --format common_semantic_segmentation <path/to/dataset>

Extra import options:

--image-prefix IMAGE_PREFIX allow to import dataset with custom image prefix (by default ‘')
--mask-prefix MASK_PREFIX allow to import dataset with custom mask prefix (by default ‘')

CSS dataset directory should have the following structure:

└─ Dataset/
    ├── dataset_meta.json # a list of labels
    ├── images/
    │   ├── <img1>.png
    │   ├── <img2>.png
    │   └── ...
    └── masks/
        ├── <img1>.png
        ├── <img2>.png
        └── ...

To describe classes and colors, you should use dataset_meta.json.

To make sure that the selected dataset has been added to the project, you can run datum project info, which will display the project information.

Export to other formats

Datumaro can convert a CSS dataset into any other format Datumaro supports. To get the expected result, convert the dataset to formats that support the segmentation task (e.g. PASCAL VOC, CamVid, Cityscapes, etc.)

There are several ways to convert a CSS dataset to other dataset formats using CLI:

datum create
datum import -f common_semantic_segmentation <path/to/dataset>
datum export -f voc -o <output/dir>

datum convert -if common_semantic_segmentation -i <path/to/dataset> \
    -f cityscapes -o <output/dir> -- --save-media

Or, using Python API:

import datumaro as dm

dataset = dm.Dataset.import_from('<path/to/dataset>', 'common_semantic_segmentation')
dataset.export('save_dir', 'camvid', save_media=True)

Examples

Examples of using this format from the code can be found in the format tests