Image zip
Format specification
The image zip format allow to export/import unannotated datasets with images to/from zip archive. The format doesn’t support any annotations or attributes.
Load Image zip dataset
Few ways to load unannotated datasets to your Datumaro project:
- From existing archive:
datum import -o project -f image_zip -i ./images.zip
- From directory with zip archives. Datumaro will loaded images from all zip files in the directory:
datum import -o project -f image_zip -i ./foo
The directory with zip archives should have the following structure:
└── foo/
├── archive1.zip/
| ├── image_1.jpg
| ├── image_2.png
| ├── subdir/
| | ├── image_3.jpg
| | └── ...
| └── ...
├── archive2.zip/
| ├── image_101.jpg
| ├── image_102.jpg
| └── ...
...
Images in a archives should have supported extension, follow the user manual to see the supported extensions.
Export to other formats
Datumaro can load dataset images from a zip archive and convert it to another supported dataset format, for example:
datum import -o project -f image_zip -i ./images.zip
datum export -f coco -o ./new_dir -- --save-images
Export unannotated dataset to zip archive
Example: exporting images from VOC dataset to zip archives:
datum import -o project -f voc -i ./VOC2012
datum export -f image_zip -o ./ --overwrite -- --name voc_images.zip \
--compression ZIP_DEFLATED
Extra options for export to image_zip format:
--save-images
allow to export dataset with saving images (default:False
);--image-ext <IMAGE_EXT>
allow to specify image extension for exporting dataset (default: use original or.jpg
, if none);--name
name of output zipfile (default:default.zip
);--compression
allow to specify archive compression method. Available methods:ZIP_STORED
,ZIP_DEFLATED
,ZIP_BZIP2
,ZIP_LZMA
(default:ZIP_STORED
). Follow zip documentation for more information.
Examples
Examples of using this format from the code can be found in the format tests