datumaro.components.annotations.matcher#

Functions

match_segments_more_than_pair(a_segms, b_segms)

Match segments and return sets of the matched segments which can be more than two

match_segments_pair(a_segms, b_segms[, ...])

Match segments and return pairs of the two matched segments

Classes

AnnotationMatcher(*[, context])

Method generated by attrs for class AnnotationMatcher.

BboxMatcher(*, context[, pairwise_dist, ...])

Method generated by attrs for class BboxMatcher.

CaptionsMatcher(*[, context])

Method generated by attrs for class CaptionsMatcher.

Cuboid3dMatcher(*, context[, pairwise_dist, ...])

Method generated by attrs for class Cuboid3dMatcher.

FeatureVectorMatcher(*[, context])

Method generated by attrs for class FeatureVectorMatcher.

HashKeyMatcher(*[, context])

Method generated by attrs for class HashKeyMatcher.

ImageAnnotationMatcher(*[, context])

Method generated by attrs for class ImageAnnotationMatcher.

LabelMatcher(*[, context])

Method generated by attrs for class LabelMatcher.

LineMatcher(*, context[, pairwise_dist, ...])

Method generated by attrs for class LineMatcher.

MaskMatcher(*, context[, pairwise_dist, ...])

Method generated by attrs for class MaskMatcher.

PointsMatcher(*, context[, pairwise_dist, ...])

Method generated by attrs for class PointsMatcher.

PolygonMatcher(*, context[, pairwise_dist, ...])

Method generated by attrs for class PolygonMatcher.

ShapeMatcher(*, context[, pairwise_dist, ...])

Method generated by attrs for class ShapeMatcher.

TabularMatcher(*[, context])

Method generated by attrs for class TabularMatcher.

datumaro.components.annotations.matcher.match_segments_pair(a_segms, b_segms, distance=<function segment_iou>, dist_thresh=1.0, label_matcher=<function <lambda>>)[source]#

Match segments and return pairs of the two matched segments

datumaro.components.annotations.matcher.match_segments_more_than_pair(a_segms, b_segms, distance=<function segment_iou>, dist_thresh=1.0, label_matcher=<function <lambda>>)[source]#

Match segments and return sets of the matched segments which can be more than two

class datumaro.components.annotations.matcher.AnnotationMatcher(*, context: IMatcherContext | IMergerContext | None = None)[source]#

Bases: object

Method generated by attrs for class AnnotationMatcher.

match_annotations(sources)[source]#
class datumaro.components.annotations.matcher.LabelMatcher(*, context: IMatcherContext | IMergerContext | None = None)[source]#

Bases: AnnotationMatcher

Method generated by attrs for class LabelMatcher.

distance(a, b)[source]#
match_annotations(sources)[source]#
class datumaro.components.annotations.matcher.ShapeMatcher(*, context: ~datumaro.components.abstracts.merger.IMatcherContext | ~datumaro.components.abstracts.merger.IMergerContext | None = None, pairwise_dist=0.9, cluster_dist=-1.0, match_segments=<function match_segments_pair>)[source]#

Bases: AnnotationMatcher

Method generated by attrs for class ShapeMatcher.

match_annotations(sources: List[List[Annotation]]) List[List[Annotation]][source]#
distance(a, b)[source]#
label_matcher(a, b)[source]#
class datumaro.components.annotations.matcher.BboxMatcher(*, context: ~datumaro.components.abstracts.merger.IMatcherContext | ~datumaro.components.abstracts.merger.IMergerContext | None = None, pairwise_dist=0.9, cluster_dist=-1.0, match_segments=<function match_segments_pair>)[source]#

Bases: ShapeMatcher

Method generated by attrs for class BboxMatcher.

class datumaro.components.annotations.matcher.PolygonMatcher(*, context: ~datumaro.components.abstracts.merger.IMatcherContext | ~datumaro.components.abstracts.merger.IMergerContext | None = None, pairwise_dist=0.9, cluster_dist=-1.0, match_segments=<function match_segments_pair>)[source]#

Bases: ShapeMatcher

Method generated by attrs for class PolygonMatcher.

class datumaro.components.annotations.matcher.MaskMatcher(*, context: ~datumaro.components.abstracts.merger.IMatcherContext | ~datumaro.components.abstracts.merger.IMergerContext | None = None, pairwise_dist=0.9, cluster_dist=-1.0, match_segments=<function match_segments_pair>)[source]#

Bases: ShapeMatcher

Method generated by attrs for class MaskMatcher.

class datumaro.components.annotations.matcher.PointsMatcher(*, context: ~datumaro.components.abstracts.merger.IMatcherContext | ~datumaro.components.abstracts.merger.IMergerContext | None = None, pairwise_dist=0.9, cluster_dist=-1.0, match_segments=<function match_segments_pair>, sigma: list | None = None, instance_map)[source]#

Bases: ShapeMatcher

Method generated by attrs for class PointsMatcher.

sigma: list | None#
distance(a, b)[source]#
class datumaro.components.annotations.matcher.LineMatcher(*, context: ~datumaro.components.abstracts.merger.IMatcherContext | ~datumaro.components.abstracts.merger.IMergerContext | None = None, pairwise_dist=0.9, cluster_dist=-1.0, match_segments=<function match_segments_pair>)[source]#

Bases: ShapeMatcher

Method generated by attrs for class LineMatcher.

distance(a, b)[source]#
class datumaro.components.annotations.matcher.CaptionsMatcher(*, context: IMatcherContext | IMergerContext | None = None)[source]#

Bases: AnnotationMatcher

Method generated by attrs for class CaptionsMatcher.

match_annotations(sources)[source]#
class datumaro.components.annotations.matcher.Cuboid3dMatcher(*, context: ~datumaro.components.abstracts.merger.IMatcherContext | ~datumaro.components.abstracts.merger.IMergerContext | None = None, pairwise_dist=0.9, cluster_dist=-1.0, match_segments=<function match_segments_pair>)[source]#

Bases: ShapeMatcher

Method generated by attrs for class Cuboid3dMatcher.

distance(a, b)[source]#
class datumaro.components.annotations.matcher.ImageAnnotationMatcher(*, context: IMatcherContext | IMergerContext | None = None)[source]#

Bases: AnnotationMatcher

Method generated by attrs for class ImageAnnotationMatcher.

match_annotations(sources)[source]#
class datumaro.components.annotations.matcher.HashKeyMatcher(*, context: IMatcherContext | IMergerContext | None = None)[source]#

Bases: AnnotationMatcher

Method generated by attrs for class HashKeyMatcher.

match_annotations(sources)[source]#
class datumaro.components.annotations.matcher.FeatureVectorMatcher(*, context: IMatcherContext | IMergerContext | None = None)[source]#

Bases: AnnotationMatcher

Method generated by attrs for class FeatureVectorMatcher.

match_annotations(sources)[source]#
class datumaro.components.annotations.matcher.Annotation(*, id: int = 0, attributes: Dict[str, Any] = _Nothing.NOTHING, group: int = 0, object_id: int = -1)[source]#

Bases: object

A base annotation class.

Derived classes must define the ‘_type’ class variable with a value from the AnnotationType enum.

Method generated by attrs for class Annotation.

id: int#
attributes: Dict[str, Any]#
group: int#
object_id: int#
property type: AnnotationType#
as_dict() Dict[str, Any][source]#

Returns a dictionary { field_name: value }

wrap(**kwargs)[source]#

Returns a modified copy of the object

class datumaro.components.annotations.matcher.IMatcherContext[source]#

Bases: ABC

abstract get_any_label_name(ann: Annotation, label_id: int) str[source]#
class datumaro.components.annotations.matcher.IMergerContext[source]#

Bases: IMatcherContext

abstract merge_infos(sources: Sequence[IDataset]) Dict[source]#
abstract merge_categories(sources: Sequence[IDataset]) Dict[source]#
abstract merge_media_types(sources: Sequence[IDataset]) Type[MediaElement] | None[source]#
abstract merge(sources: Sequence[IDataset]) DatasetItemStorage[source]#
datumaro.components.annotations.matcher.OKS(a, b, sigma=None, bbox=None, scale=None)[source]#

Object Keypoint Similarity metric. https://cocodataset.org/#keypoints-eval

class datumaro.components.annotations.matcher.TabularMatcher(*, context: IMatcherContext | IMergerContext | None = None)[source]#

Bases: AnnotationMatcher

Method generated by attrs for class TabularMatcher.

match_annotations(sources)[source]#
datumaro.components.annotations.matcher.approximate_line(points: Sequence[float], segments: int) ndarray[source]#

Approximates a 2d line to the required number of segments. The new points are distributed uniformly across the input line.

Parameters:
  • points (Sequence) – an array of line point coordinates. The size is [points * 2], the layout is [x0, y0, x1, y1, …].

  • segments (int) – the required numebr of segments in the resulting line.

Returns:

an array of new line point coordinates.

Return type:

new_points (numpy.ndarray)

Returns:

The size is [(segments + 1) * 2], the layout is [x0, y0, x1, y1, …].

datumaro.components.annotations.matcher.attrib(default=_Nothing.NOTHING, validator=None, repr=True, cmp=None, hash=None, init=True, metadata=None, type=None, converter=None, factory=None, kw_only=False, eq=None, order=None, on_setattr=None, alias=None)[source]#

Create a new attribute on a class.

Warning

Does not do anything unless the class is also decorated with attr.s / attrs.define / and so on!

Please consider using attrs.field in new code (attr.ib will never go away, though).

Parameters:
  • default

    A value that is used if an attrs-generated __init__ is used and no value is passed while instantiating or the attribute is excluded using init=False.

    If the value is an instance of attrs.Factory, its callable will be used to construct a new value (useful for mutable data types like lists or dicts).

    If a default is not set (or set manually to attrs.NOTHING), a value must be supplied when instantiating; otherwise a TypeError will be raised.

    The default can also be set using decorator notation as shown below.

    See also

    defaults

  • factory (callable) – Syntactic sugar for default=attr.Factory(factory).

  • validator (callable or a list of callables.) –

    callable that is called by attrs-generated __init__ methods after the instance has been initialized. They receive the initialized instance, the Attribute(), and the passed value.

    The return value is not inspected so the validator has to throw an exception itself.

    If a list is passed, its items are treated as validators and must all pass.

    Validators can be globally disabled and re-enabled using attrs.validators.get_disabled / attrs.validators.set_disabled.

    The validator can also be set using decorator notation as shown below.

    See also

    validators

  • repr (a bool or a callable to use a custom function.) – Include this attribute in the generated __repr__ method. If True, include the attribute; if False, omit it. By default, the built-in repr() function is used. To override how the attribute value is formatted, pass a callable that takes a single value and returns a string. Note that the resulting string is used as-is, i.e. it will be used directly instead of calling repr() (the default).

  • eq (a bool or a callable.) –

    If True (default), include this attribute in the generated __eq__ and __ne__ methods that check two instances for equality. To override how the attribute value is compared, pass a callable that takes a single value and returns the value to be compared.

    See also

    comparison

  • order (a bool or a callable.) –

    If True (default), include this attributes in the generated __lt__, __le__, __gt__ and __ge__ methods. To override how the attribute value is ordered, pass a callable that takes a single value and returns the value to be ordered.

    See also

    comparison

  • cmp (a bool or a callable.) –

    Setting cmp is equivalent to setting eq and order to the same value. Must not be mixed with eq or order.

    See also

    comparison

  • hash (bool | None) –

    Include this attribute in the generated __hash__ method. If None (default), mirror eq’s value. This is the correct behavior according the Python spec. Setting this value to anything else than None is discouraged.

    See also

    hashing

  • init (bool) –

    Include this attribute in the generated __init__ method. It is possible to set this to False and set a default value. In that case this attributed is unconditionally initialized with the specified default value or factory.

    See also

    init

  • converter (callable) –

    callable that is called by attrs-generated __init__ methods to convert attribute’s value to the desired format. It is given the passed-in value, and the returned value will be used as the new value of the attribute. The value is converted before being passed to the validator, if any.

    See also

    converters

  • metadata (dict | None) – An arbitrary mapping, to be used by third-party components. See extending-metadata.

  • type

    The type of the attribute. Nowadays, the preferred method to specify the type is using a variable annotation (see PEP 526). This argument is provided for backward compatibility. Regardless of the approach used, the type will be stored on Attribute.type.

    Please note that attrs doesn’t do anything with this metadata by itself. You can use it as part of your own code or for static type checking <types>.

  • kw_only (bool) – Make this attribute keyword-only in the generated __init__ (if init is False, this parameter is ignored).

  • on_setattr (callable, or a list of callables, or None, or attrs.setters.NO_OP) – Allows to overwrite the on_setattr setting from attr.s. If left None, the on_setattr value from attr.s is used. Set to attrs.setters.NO_OP to run no setattr hooks for this attribute – regardless of the setting in attr.s.

  • alias (str | None) – Override this attribute’s parameter name in the generated __init__ method. If left None, default to name stripped of leading underscores. See private-attributes.

New in version 15.2.0: convert

New in version 16.3.0: metadata

Changed in version 17.1.0: validator can be a list now.

Changed in version 17.1.0: hash is None and therefore mirrors eq by default.

New in version 17.3.0: type

Deprecated since version 17.4.0: convert

New in version 17.4.0: converter as a replacement for the deprecated convert to achieve consistency with other noun-based arguments.

New in version 18.1.0: factory=f is syntactic sugar for default=attr.Factory(f).

New in version 18.2.0: kw_only

Changed in version 19.2.0: convert keyword argument removed.

Changed in version 19.2.0: repr also accepts a custom callable.

Deprecated since version 19.2.0: cmp Removal on or after 2021-06-01.

New in version 19.2.0: eq and order

New in version 20.1.0: on_setattr

Changed in version 20.3.0: kw_only backported to Python 2

Changed in version 21.1.0: eq, order, and cmp also accept a custom callable

Changed in version 21.1.0: cmp undeprecated

New in version 22.2.0: alias

datumaro.components.annotations.matcher.attrs(maybe_cls=None, these=None, repr_ns=None, repr=None, cmp=None, hash=None, init=None, slots=False, frozen=False, weakref_slot=True, str=False, auto_attribs=False, kw_only=False, cache_hash=False, auto_exc=False, eq=None, order=None, auto_detect=False, collect_by_mro=False, getstate_setstate=None, on_setattr=None, field_transformer=None, match_args=True, unsafe_hash=None)[source]#

A class decorator that adds dunder methods according to the specified attributes using attr.ib or the these argument.

Please consider using attrs.define / attrs.frozen in new code (attr.s will never go away, though).

Parameters:
  • these (dict of str to attr.ib) –

    A dictionary of name to attr.ib mappings. This is useful to avoid the definition of your attributes within the class body because you can’t (e.g. if you want to add __repr__ methods to Django models) or don’t want to.

    If these is not None, attrs will not search the class body for attributes and will not remove any attributes from it.

    The order is deduced from the order of the attributes inside these.

  • repr_ns (str) – When using nested classes, there’s no way in Python 2 to automatically detect that. Therefore it’s possible to set the namespace explicitly for a more meaningful repr output.

  • auto_detect (bool) –

    Instead of setting the init, repr, eq, order, and hash arguments explicitly, assume they are set to True unless any of the involved methods for one of the arguments is implemented in the current class (i.e. it is not inherited from some base class).

    So for example by implementing __eq__ on a class yourself, attrs will deduce eq=False and will create neither __eq__ nor __ne__ (but Python classes come with a sensible __ne__ by default, so it should be enough to only implement __eq__ in most cases).

    Warning

    If you prevent attrs from creating the ordering methods for you (order=False, e.g. by implementing __le__), it becomes your responsibility to make sure its ordering is sound. The best way is to use the functools.total_ordering decorator.

    Passing True or False to init, repr, eq, order, cmp, or hash overrides whatever auto_detect would determine.

  • repr (bool) – Create a __repr__ method with a human readable representation of attrs attributes..

  • str (bool) – Create a __str__ method that is identical to __repr__. This is usually not necessary except for Exceptions.

  • eq (bool | None) –

    If True or None (default), add __eq__ and __ne__ methods that check two instances for equality.

    They compare the instances as if they were tuples of their attrs attributes if and only if the types of both classes are identical!

    See also

    comparison

  • order (bool | None) –

    If True, add __lt__, __le__, __gt__, and __ge__ methods that behave like eq above and allow instances to be ordered. If None (default) mirror value of eq.

    See also

    comparison

  • cmp (bool | None) –

    Setting cmp is equivalent to setting eq and order to the same value. Must not be mixed with eq or order.

    See also

    comparison

  • unsafe_hash (bool | None) –

    If None (default), the __hash__ method is generated according how eq and frozen are set.

    1. If both are True, attrs will generate a __hash__ for you.

    2. If eq is True and frozen is False, __hash__ will be set to None, marking it unhashable (which it is).

    3. If eq is False, __hash__ will be left untouched meaning the __hash__ method of the base class will be used (if base class is object, this means it will fall back to id-based hashing.).

    Although not recommended, you can decide for yourself and force attrs to create one (e.g. if the class is immutable even though you didn’t freeze it programmatically) by passing True or not. Both of these cases are rather special and should be used carefully.

    See also

  • hash (bool | None) – Alias for unsafe_hash. unsafe_hash takes precedence.

  • init (bool) –

    Create a __init__ method that initializes the attrs attributes. Leading underscores are stripped for the argument name. If a __attrs_pre_init__ method exists on the class, it will be called before the class is initialized. If a __attrs_post_init__ method exists on the class, it will be called after the class is fully initialized.

    If init is False, an __attrs_init__ method will be injected instead. This allows you to define a custom __init__ method that can do pre-init work such as super().__init__(), and then call __attrs_init__() and __attrs_post_init__().

    See also

    init

  • slots (bool) – Create a slotted class that’s more memory-efficient. Slotted classes are generally superior to the default dict classes, but have some gotchas you should know about, so we encourage you to read the glossary entry.

  • frozen (bool) –

    Make instances immutable after initialization. If someone attempts to modify a frozen instance, attrs.exceptions.FrozenInstanceError is raised.

    Note

    1. This is achieved by installing a custom __setattr__ method on your class, so you can’t implement your own.

    2. True immutability is impossible in Python.

    3. This does have a minor a runtime performance impact <how-frozen> when initializing new instances. In other words: __init__ is slightly slower with frozen=True.

    4. If a class is frozen, you cannot modify self in __attrs_post_init__ or a self-written __init__. You can circumvent that limitation by using object.__setattr__(self, "attribute_name", value).

    5. Subclasses of a frozen class are frozen too.

  • weakref_slot (bool) – Make instances weak-referenceable. This has no effect unless slots is also enabled.

  • auto_attribs (bool) –

    If True, collect PEP 526-annotated attributes from the class body.

    In this case, you must annotate every field. If attrs encounters a field that is set to an attr.ib but lacks a type annotation, an attr.exceptions.UnannotatedAttributeError is raised. Use field_name: typing.Any = attr.ib(...) if you don’t want to set a type.

    If you assign a value to those attributes (e.g. x: int = 42), that value becomes the default value like if it were passed using attr.ib(default=42). Passing an instance of attrs.Factory also works as expected in most cases (see warning below).

    Attributes annotated as typing.ClassVar, and attributes that are neither annotated nor set to an attr.ib are ignored.

    Warning

    For features that use the attribute name to create decorators (e.g. validators), you still must assign attr.ib to them. Otherwise Python will either not find the name or try to use the default value to call e.g. validator on it.

    These errors can be quite confusing and probably the most common bug report on our bug tracker.

  • kw_only (bool) – Make all attributes keyword-only in the generated __init__ (if init is False, this parameter is ignored).

  • cache_hash (bool) – Ensure that the object’s hash code is computed only once and stored on the object. If this is set to True, hashing must be either explicitly or implicitly enabled for this class. If the hash code is cached, avoid any reassignments of fields involved in hash code computation or mutations of the objects those fields point to after object creation. If such changes occur, the behavior of the object’s hash code is undefined.

  • auto_exc (bool) –

    If the class subclasses BaseException (which implicitly includes any subclass of any exception), the following happens to behave like a well-behaved Python exceptions class:

    • the values for eq, order, and hash are ignored and the instances compare and hash by the instance’s ids (N.B. attrs will not remove existing implementations of __hash__ or the equality methods. It just won’t add own ones.),

    • all attributes that are either passed into __init__ or have a default value are additionally available as a tuple in the args attribute,

    • the value of str is ignored leaving __str__ to base classes.

  • collect_by_mro (bool) –

    Setting this to True fixes the way attrs collects attributes from base classes. The default behavior is incorrect in certain cases of multiple inheritance. It should be on by default but is kept off for backward-compatibility.

    See also

    Issue #428

  • getstate_setstate (bool | None) –

    Note

    This is usually only interesting for slotted classes and you should probably just set auto_detect to True.

    If True, __getstate__ and __setstate__ are generated and attached to the class. This is necessary for slotted classes to be pickleable. If left None, it’s True by default for slotted classes and False for dict classes.

    If auto_detect is True, and getstate_setstate is left None, and either __getstate__ or __setstate__ is detected directly on the class (i.e. not inherited), it is set to False (this is usually what you want).

  • on_setattr (callable, or a list of callables, or None, or attrs.setters.NO_OP) –

    A callable that is run whenever the user attempts to set an attribute (either by assignment like i.x = 42 or by using setattr like setattr(i, "x", 42)). It receives the same arguments as validators: the instance, the attribute that is being modified, and the new value.

    If no exception is raised, the attribute is set to the return value of the callable.

    If a list of callables is passed, they’re automatically wrapped in an attrs.setters.pipe.

  • field_transformer (callable | None) –

    A function that is called with the original class object and all fields right before attrs finalizes the class. You can use this, e.g., to automatically add converters or validators to fields based on their types.

    See also

    transform-fields

  • match_args (bool) – If True (default), set __match_args__ on the class to support PEP 634 (Structural Pattern Matching). It is a tuple of all non-keyword-only __init__ parameter names on Python 3.10 and later. Ignored on older Python versions.

New in version 16.0.0: slots

New in version 16.1.0: frozen

New in version 16.3.0: str

New in version 16.3.0: Support for __attrs_post_init__.

Changed in version 17.1.0: hash supports None as value which is also the default now.

New in version 17.3.0: auto_attribs

Changed in version 18.1.0: If these is passed, no attributes are deleted from the class body.

Changed in version 18.1.0: If these is ordered, the order is retained.

New in version 18.2.0: weakref_slot

Deprecated since version 18.2.0: __lt__, __le__, __gt__, and __ge__ now raise a DeprecationWarning if the classes compared are subclasses of each other. __eq and __ne__ never tried to compared subclasses to each other.

Changed in version 19.2.0: __lt__, __le__, __gt__, and __ge__ now do not consider subclasses comparable anymore.

New in version 18.2.0: kw_only

New in version 18.2.0: cache_hash

New in version 19.1.0: auto_exc

Deprecated since version 19.2.0: cmp Removal on or after 2021-06-01.

New in version 19.2.0: eq and order

New in version 20.1.0: auto_detect

New in version 20.1.0: collect_by_mro

New in version 20.1.0: getstate_setstate

New in version 20.1.0: on_setattr

New in version 20.3.0: field_transformer

Changed in version 21.1.0: init=False injects __attrs_init__

Changed in version 21.1.0: Support for __attrs_pre_init__

Changed in version 21.1.0: cmp undeprecated

New in version 21.3.0: match_args

New in version 22.2.0: unsafe_hash as an alias for hash (for PEP 681 compliance).

datumaro.components.annotations.matcher.bbox_iou(a, b) Literal[-1] | float[source]#

IoU computations for simple cases with bounding boxes

datumaro.components.annotations.matcher.max_bbox(annotations: Iterable[Tuple[float, float, float, float] | Shape | Mask]) Tuple[float, float, float, float][source]#

Computes the maximum bbox for the set of spatial annotations and boxes.

Returns:

(x, y, w, h)

Return type:

bbox (tuple)

datumaro.components.annotations.matcher.mean_bbox(annotations: Iterable[Tuple[float, float, float, float] | Shape | Mask]) Tuple[float, float, float, float][source]#

Computes the mean bbox for the set of spatial annotations and boxes.

Returns:

(x, y, w, h)

Return type:

bbox (tuple)

datumaro.components.annotations.matcher.segment_iou(a, b)[source]#

Generic IoU computation with masks, polygons, and boxes. Returns -1 if no intersection, [0; 1] otherwise