DOTA#
Format specification#
DOTA is a collection of 2K aerial images for a rotated object detection task. Each objects are annotated with 4 coordinates for representing oriented bounding boxes, a label among 15 classes (baseball-diamond, basketball-court, bridge, ground-track-field, harbor, helicopter, large-vehicle, plane, roundabout, ship, small-vehicle, soccer-ball-field, storage-tank, swimming-pool, tennis-court) and a difficulty.
Import DOTA dataset#
A Datumaro project with a DOTA source can be created in the following way:
datum project create
datum project import --format dota <path/to/dataset>
Or, using Python API:
import datumaro as dm
dataset = dm.Dataset.import_from('<path/to/dataset>', 'dota')
Directory structure#
dota/
├── train # Subset directory
│ ├── images
│ │ ├── img1.jpg # Image file
│ │ ├── img2.jpg # Image file
│ │ └── ...
│ ├── labelTxt
│ │ ├── img1.txt # Annotation file
│ │ ├── img2.txt # Annotation file
│ │ └── ...
├── val # Subset directory
│ ├── images
│ │ ├── img3.jpg # Image file
│ │ ├── img4.jpg # Image file
│ │ └── ...
│ ├── labelTxt
│ │ ├── img3.txt # Annotation file
│ │ ├── img4.txt # Annotation file
│ │ └── ...
└── ...
Annotation Txt file#
The example of <image_id>.txt
is given by DOTA annotation format.