`nncase_runtime` Module API Manual#

Overview#

This manual introduces the nncase_runtime module of the CanMV platform, guiding developers to invoke the KPU (Neural Network Processing Unit) and AI2D modules in the MicroPython environment to implement efficient deep learning inference and image processing functions.

Abbreviation	Description
nncase_runtime	k230 nncase runtime package, includes KPU module and AI2D module
nncase_runtime.kpu	kpu module
nncase_runtime.ai2d	ai2d module

API Introduction#

from_numpy#

Description

This function is used to create a runtime_tensor from a ulab.numpy object in MicroPython.

Syntax

runtime_tensor = nncase_runtime.from_numpy(ulab.numpy)

Parameters

Parameter Name	Description	Input/Output
ulab.numpy	numpy object	Input

Return Value

Return Value	Description
runtime_tensor	Returns the created runtime_tensor object.
Other	If it fails, a C++ exception will be thrown.

to_numpy#

Description

Converts a runtime_tensor object to a ulab.numpy object.

Syntax

runtime_tensor = kpu.get_output_tensor(0)
result = runtime_tensor.to_numpy()

Parameters

None.

Return Value

Return Value	Description
ulab.numpy	Returns the ulab.numpy object converted from runtime_tensor.
Other	If it fails, a C++ exception will be thrown.

nncase_runtime.kpu#

The kpu module provides basic functions for invoking KPU hardware to perform neural network model inference, including loading models, setting input data, executing inference, and obtaining output results.

load_kmodel#

Description

Loads a neural network model in the compiled kmodel format.

Syntax

load_kmodel(read_bin)
load_kmodel(path)

Parameters

Parameter Name	Description	Input/Output
read_bin	Binary content of kmodel	Input
path	File path of kmodel	Input

Return Value

Return Value	Description
None	Loaded successfully.
Other	If it fails, a C++ exception will be thrown.

set_input_tensor#

Description

Sets the input runtime_tensor for kmodel inference.

Syntax

set_input_tensor(index, runtime_tensor)

Parameters

Parameter Name	Description	Input/Output
index	Input index of kmodel	Input
runtime_tensor	Tensor containing input data information	Input

Return Value

Return Value	Description
None	Set successfully.
Other	If it fails, a C++ exception will be thrown.

get_input_tensor#

Description

Gets the input runtime_tensor for kmodel inference.

Syntax

get_input_tensor(index)

Parameters

Parameter Name	Description	Input/Output
index	Input index of kmodel	Input

Return Value

Return Value	Description
runtime_tensor	Tensor containing input data information.
Other	If it fails, a C++ exception will be thrown.

set_output_tensor#

Description

Sets the output result after kmodel inference.

Syntax

set_output_tensor(index, runtime_tensor)

Parameters

Parameter Name	Description	Input/Output
index	Output index of kmodel	Input
runtime_tensor	Tensor of the output result	Input

Return Value

Return Value	Description
None	Set successfully.
Other	If it fails, a C++ exception will be thrown.

get_output_tensor#

Description

Gets the output result after kmodel inference.

Syntax

get_output_tensor(index)

Parameters

Parameter Name	Description	Input/Output
index	Output index of kmodel	Input

Return Value

Return Value	Description
runtime_tensor	Gets the runtime_tensor of the index-th output.
Other	If it fails, a C++ exception will be thrown.

run#

Description

Starts the kmodel inference process.

Syntax

run()

Return Value

Return Value	Description
None	Inference successful
Other	If it fails, a C++ exception will be thrown.

inputs_size#

Description

Gets the number of inputs of the kmodel.

Syntax

inputs_size()

Return Value

Return Value	Description
size_t	Number of inputs of kmodel.
Other	If it fails, a C++ exception will be thrown.

outputs_size#

Description

Gets the number of outputs of the kmodel.

Syntax

outputs_size()

Return Value

Return Value	Description
size_t	Number of outputs of kmodel.
Other	If it fails, a C++ exception will be thrown.

get_input_desc#

Description

Gets the input description information of the specified index.

Syntax

get_input_desc(index)

Parameters

Parameter Name	Description	Input/Output
index	Input index of kmodel	Input

Return Value

Return Value	Description
MemoryRange	Information of the index-th input, including `dtype`, `start`, `size`.

get_output_desc#

Description

Gets the output description information of the specified index.

Syntax

get_output_desc(index)

Parameters

Parameter Name	Description	Input/Output
index	Output index of kmodel	Input

Return Value

Return Value	Description
MemoryRange	Information of the index-th output, including `dtype`, `start`, `size`.

nncase_runtime.ai2d#

build#

Description

Construct an ai2d_builder object.

Syntax

build(input_shape, output_shape)

Parameters

Name	Description
input_shape	Input shape
output_shape	Output shape

Return Value

Return Value	Description
ai2d_builder	Returns the ai2d_builder object for execution.
Other	If failed, a C++ exception will be thrown.

run#

Description

Configure registers and start AI2D computation.

Syntax

ai2d_builder.run(input_tensor, output_tensor)

Parameters

Name	Description
input_tensor	Input tensor
output_tensor	Output tensor

Return Value

Return Value	Description
None	Success.
Other	If failed, a C++ exception will be thrown.

set_dtype#

Description

Set the data type during AI2D computation.

Syntax

set_dtype(src_format, dst_format, src_type, dst_type)

Parameters

Name	Type	Description
src_format	ai2d_format	Input data format
dst_format	ai2d_format	Output data format
src_type	datatype_t	Input data type
dst_type	datatype_t	Output data type

set_crop_param#

Description

Configure crop-related parameters.

Syntax

set_crop_param(crop_flag, start_x, start_y, width, height)

Parameters

Name	Type	Description
crop_flag	bool	Whether to enable the crop function
start_x	int	Starting pixel in the width direction
start_y	int	Starting pixel in the height direction
width	int	Width
height	int	Height

set_shift_param#

【Description】

Used to configure shift-related parameters.

【Definition】

set_shift_param(shift_flag, shift_val)

【Parameters】

Name	Type	Description
shift_flag	bool	Whether to enable the shift function
shift_val	int	Number of bits for right shift

set_pad_param#

【Description】

Used to configure pad-related parameters.

【Definition】

set_pad_param(pad_flag, paddings, pad_mode, pad_val)

【Parameters】

Name	Type	Description
pad_flag	bool	Whether to enable the pad function
paddings	list	Padding for each dimension, size=8, representing the number of front and back paddings for dim0 to dim4 respectively, where dim0/dim1 are fixed at {0, 0}
pad_mode	int	Only supports pad constant, configure 0
pad_val	list	Padding value for each channel

set_resize_param#

【Description】

Used to configure resize-related parameters.

【Definition】

set_resize_param(resize_flag, interp_method, interp_mode)

【Parameters】

Name	Type	Description
resize_flag	bool	Whether to enable the resize function
interp_method	ai2d_interp_method	Resize interpolation method
interp_mode	ai2d_interp_mode	Resize mode

set_affine_param#

【Description】

Used to configure affine-related parameters.

【Definition】

set_affine_param(affine_flag, interp_method, cord_round, bound_ind, bound_val, bound_smooth, M)

【Parameters】

Name	Type	Description
affine_flag	bool	Whether to enable the affine function
interp_method	ai2d_interp_method	Interpolation method used by Affine
cord_round	uint32_t	Integer boundary 0 or 1
bound_ind	uint32_t	Boundary pixel mode 0 or 1
bound_val	uint32_t	Boundary fill value
bound_smooth	uint32_t	Boundary smoothing 0 or 1
M	list	Vector corresponding to the affine transformation matrix. The affine transformation is \(Y=\[a_0, a_1; a_2, a_3\] \cdot X + \[b_0, b_1\] \), then M=[a_0,a_1,b_0,a_2,a_3,b_1 ]

shrink_memory_pool#

【Description】

Cleans up the memory pool generated by nncase_runtime and releases memory.

【Syntax】

import gc
import nncase_runtime

gc.collect()
nncase_runtime.shrink_memory_pool()

Reference Example#

import os
import ujson
import nncase_runtime as nn
import ulab.numpy as np
import image
import sys

# 加载kmodel
kmodel_path="/sdcard/examples/kmodel/face_detection_320.kmodel"
kpu=nn.kpu()
kpu.load_kmodel(kmodel_path)

# 读取一张图片，并将其transpose成chw数据
img_path="/sdcard/examples/utils/db_img/id_1.jpg"
img_data = image.Image(img_path).to_rgb888()
img_hwc=img_data.to_numpy_ref()
shape=img_hwc.shape
img_tmp = img_hwc.reshape((shape[0] * shape[1], shape[2]))
img_tmp_trans = img_tmp.transpose().copy()
img_chw=img_tmp_trans.reshape((shape[2],shape[0],shape[1]))

# 初始化ai2d预处理，并运行ai2d预处理，输出模型输入分辨率320*320
ai2d=nn.ai2d()
output_data = np.ones((1,3,320,320),dtype=np.uint8)
ai2d_output_tensor = nn.from_numpy(output_data)
ai2d.set_dtype(nn.ai2d_format.NCHW_FMT, nn.ai2d_format.NCHW_FMT, np.uint8, np.uint8)
ai2d.set_resize_param(True,nn.interp_method.tf_bilinear, nn.interp_mode.half_pixel)
ai2d_builder = ai2d.build([1,3,img_chw.shape[1],img_chw.shape[2]], [1,3,320,320])
ai2d_input_tensor = nn.from_numpy(img_chw)
ai2d_builder.run(ai2d_input_tensor, ai2d_output_tensor)

# kmodel运行，并获取输出，输出从tensor转成ulab.numpy.ndarray数据
kpu.set_input_tensor(0, ai2d_output_tensor)
kpu.run()
for i in range(kpu.outputs_size()):
    output_data = kpu.get_output_tensor(i)
    result = output_data.to_numpy()
    print(result.shape)

Conclusion#

By using the nncase_runtime module, developers can implement efficient deep learning inference and image processing functions on the CanMV platform. This manual provides detailed descriptions of the API and usage examples, aiming to help developers get started quickly and efficiently develop related applications.

nncase_runtime Module API Manual

Contents

`nncase_runtime` Module API Manual#

Overview#

API Introduction#

from_numpy#

to_numpy#

nncase_runtime.kpu#

load_kmodel#

set_input_tensor#

get_input_tensor#

set_output_tensor#

get_output_tensor#

run#

inputs_size#

outputs_size#

get_input_desc#

get_output_desc#

nncase_runtime.ai2d#

build#

run#

set_dtype#

set_crop_param#

set_shift_param#

set_pad_param#

set_resize_param#

set_affine_param#

shrink_memory_pool#

Reference Example#

Conclusion#

nncase_runtime Module API Manual

Contents

nncase_runtime Module API Manual#

Overview#

API Introduction#

from_numpy#

to_numpy#

nncase_runtime.kpu#

load_kmodel#

set_input_tensor#

get_input_tensor#

set_output_tensor#

get_output_tensor#

run#

inputs_size#

outputs_size#

get_input_desc#

get_output_desc#

nncase_runtime.ai2d#

build#

run#

set_dtype#

set_crop_param#

set_shift_param#

set_pad_param#

set_resize_param#

set_affine_param#

shrink_memory_pool#

Reference Example#

Conclusion#

`nncase_runtime` Module API Manual#