datumaro.plugins.data_formats.arrow#
- class datumaro.plugins.data_formats.arrow.ArrowBase(root_path: str, *, file_paths: List[str], ctx: ImportContext | None = None)[source]#
Bases:
DatasetBase
- categories() Dict[AnnotationType, Categories] [source]#
Returns metainfo about dataset labels.
- get(item_id: str, subset: str | None = None) DatasetItem | None [source]#
Provides random access to dataset items.
- class datumaro.plugins.data_formats.arrow.ArrowExporter(extractor: IDataset, save_dir: str, *, save_media: bool | None = None, image_ext: str | Callable[[str], bytes] | None = None, default_image_ext: str | None = None, save_dataset_meta: bool = False, ctx: ExportContext | None = None, num_workers: int = 0, max_shard_size: int | None = 1000, num_shards: int | None = None, prefix: str = 'datum', **kwargs)[source]#
Bases:
Exporter
- AVAILABLE_IMAGE_EXTS = ('AS-IS', 'PNG', 'TIFF', 'JPEG/95', 'JPEG/75', 'NONE')#
- DEFAULT_IMAGE_EXT = 'AS-IS'#
- class datumaro.plugins.data_formats.arrow.ArrowImporter[source]#
Bases:
Importer
- classmethod detect(context: FormatDetectionContext) FormatDetectionConfidence | None [source]#
Modules