Kinetics
Format specification
Kinetics 400/600/700 is a video datasets for action recognition task. Dataset is available for downloading here
Supported media type:
Video
Supported type of annotations:
Label
Supported attributes for labels:
time_start
(integer) - time (in seconds) of the start of recognized actiontime_end
(integer) - time (in seconds) of the end of recognized action
Import Kinetics dataset
A Datumaro project with a Kinetics dataset can be created in the following way using CLI:
datum create
datum import -f kinetics <path_to_dataset>
Or using Python API:
import datumaro as dm
dataset = dm.Dataset.import_from('<path_to_dataset>', format='kinetics')
├── test.csv
├── train.json
├── train
│ ├── <name_of_video_1_with_yt_id>.avi # extension of video could be other
│ ├── <name_of_video_2_with_yt_id>.avi
│ ├── ...
└── test
├── <name_of_video_100_with_yt_id>.avi # extension of video could be other
├── <name_of_video_101_with_yt_id>.avi
├── ...
Kinetics dataset has two equivalent annotation file formats: .csv
and
.json
. Datumaro supports both, but in case when two annotation files have
same names but different extensions Datumaro will use .csv
.
Note: name of each video file must contain youtube_id of this video, that specified in annotation file. And to speed up the import, you can leave only the youtube_id in the video filename.
See the full list of supported video extensions here.