Openvino Adapter#

class model_api.adapters.openvino_adapter.OpenvinoAdapter(core, model, weights_path=None, model_parameters={}, device='CPU', plugin_config=None, max_num_requests=0, precision='FP16', download_dir=None, cache_dir=None)#

Bases: InferenceAdapter

Works with OpenVINO model

precision, download_dir and cache_dir are ignored if model is a path to a file

await_all()#

In case of asynchronous execution waits the completion of all busy infer requests.

Return type:

None

await_any()#

In case of asynchronous execution waits the completion of any busy infer request until it becomes available for the data submission.

Return type:

None

copy_raw_result(request)#
embed_preprocessing(layout, resize_mode, interpolation_mode, target_shape, pad_value, dtype=<class 'int'>, brg2rgb=False, mean=None, scale=None, input_idx=0)#

Embeds OpenVINO PrePostProcessor module into the model.

Return type:

None

get_input_layers()#
Gets the names of model inputs and for each one creates the Metadata structure,

which contains the information about the input shape, layout, precision in OpenVINO format, meta (optional)

Return type:

dict[str, Metadata]

Returns:

  • the dict containing Metadata for all inputs

get_layout_for_input(input, shape=None)#
Return type:

str

get_model()#

Returns the openvino.Model object

Returns:

openvino.Model object

get_output_layers()#
Gets the names of model outputs and for each one creates the Metadata structure,

which contains the information about the output shape, layout, precision in OpenVINO format, meta (optional)

Return type:

dict[str, Metadata]

Returns:

  • the dict containing Metadata for all outputs

get_raw_result(request)#

Gets raw results from the internal inference framework representation as a dict.

Parameters:

infer_result (-) – framework-specific result of inference from the model

Returns:

{

‘output_layer_name_1’: raw_result_1, ‘output_layer_name_2’: raw_result_2, …

}

Return type:

  • raw result (dict) - model raw output in the following format

get_rt_info(path)#

Gets an attribute value from OV.model_info structure.

Parameters:

path (list[str]) – a suquence of tag names leading to the attribute.

Returns:

attribute value wrapped into OVAny object.

Return type:

OVAny

infer_async(dict_data, callback_data)#

Performs the asynchronous model inference and sets the callback for inference completion. Also, it should define get_raw_result() function, which handles the result of inference from the model.

Parameters:
  • dict_data (-) –

    it’s submitted to the model for inference and has the following format: {

    ’input_layer_name_1’: data_1, ‘input_layer_name_2’: data_2, …

    }

  • callback_data (-) – the data for callback, that will be taken after the model inference is ended

Return type:

None

infer_sync(dict_data)#

Performs the synchronous model inference. The infer is a blocking method.

Parameters:

dict_data (-) –

it’s submitted to the model for inference and has the following format: {

’input_layer_name_1’: data_1, ‘input_layer_name_2’: data_2, …

}

Returns:

{

‘output_layer_name_1’: raw_result_1, ‘output_layer_name_2’: raw_result_2, …

}

Return type:

  • raw result (dict) - model raw output in the following format

is_ready()#

In case of asynchronous execution checks if one can submit input data to the model for inference, or all infer requests are busy.

Return type:

bool

Returns:

  • the boolean flag whether the input data can be

    submitted to the model for inference or not

load_model()#

Loads the model on the device.

Return type:

None

log_runtime_settings()#
Return type:

None

operations_by_type(operation_type)#
reshape_model(new_shape)#

Reshapes the model inputs to fit the new input shape.

Parameters:

new_shape (-) –

the dictionary with inputs names as keys and list of new shape as values in the following format: {

’input_layer_name_1’: [1, 128, 128, 3], ‘input_layer_name_2’: [1, 128, 128, 3], …

}

save_model(path, weights_path=None, version=None)#

Saves OV model as two files: .xml (architecture) and .bin (weights).

Parameters:
  • path (str) – path to save the model files (.xml and .bin).

  • weights_path (str, optional) – Optional path to save .bin if it differs from .xml path. Defaults to None.

  • version (str, optional) – Output IR model version (for instance, IR_V10). Defaults to None.

set_callback(callback_fn)#

Sets callback that grabs results of async inference.

Parameters:

callback_fn (Callable) – Callback function.

update_model_info(model_info)#

Populates OV IR RT info with the given model info.

Parameters:

model_info (dict[str, Any]) – a dict representing the serialized parameters.

model_api.adapters.openvino_adapter.create_core()#
Return type:

Core

model_api.adapters.openvino_adapter.get_input_shape(input_tensor)#
Return type:

list[int]

model_api.adapters.openvino_adapter.get_user_config(flags_d, flags_nstreams, flags_nthreads=None)#
Return type:

dict[str, str]

model_api.adapters.openvino_adapter.parse_devices(device_string)#
Return type:

list[str]

model_api.adapters.openvino_adapter.parse_value_per_device(devices, values_string)#

Format: <device1>:<value1>,<device2>:<value2> or just <value>

Return type:

dict[str, int]