[onert] Export dedicated OneRT data types to Python #16210

arkq · 2025-10-15T14:39:56Z

NNFW types and numpy data types do not map one to one because NNFW has quantized types represented as uint8 or int16. Because of that it should be more efficient to export custom data type object which will map these two types.

Additionally, for convenience, dedicated types are exported in the top-level onert Python module, so one can use them as follows:

>>> import numpy as np
>>> import onert
>>> onert.
onert.bool          onert.dtype(        onert.float32       onert.int32         onert.native        onert.qint8         onert.tensorinfo(
onert.common        onert.experimental  onert.infer         onert.int64         onert.qint16sym     onert.quint8        onert.uint8
>>> np.array([2, 42, 42], dtype=onert.float32)
array([ 2., 42., 42.], dtype=float32)

ONE-DCO-1.0-Signed-off-by: Arkadiusz Bokowy [email protected]

NNFW types and numpy data types do not map one to one because NNFW has quantized types represented as uint8 or int16. Because of that it should be more efficient to export custom data type object which will map these two types. Additionally, for convenience, dedicated types are exported in the top-level onert Python module, so one can use them as follows: > import numpy as np, onert > np.array([2, 42, 42], dtype=onert.float32) ONE-DCO-1.0-Signed-off-by: Arkadiusz Bokowy <[email protected]>

runtime/onert/api/python/include/nnfw_api_wrapper.h

ragmani · 2025-10-17T04:27:06Z

While reviewing this PR I noticed a couple of pre-existing issues (not introduced by these changes):

Quantization info
There is currently no clear mechanism for handling quantization parameters (scale, zero_point). We probably need a consistent way to represent and preserve these attributes.
Data type mapping between onert and numpy
At the moment, there is no standard mapping layer between onert.dtype and numpy.dtype. This means patterns like

np.array([1, 2, 3], dtype=onert.qint8)

will not work as expected. We likely need a bidirectional converter and clear usage rules so that dtype handling is both intuitive and reliable.

These are not regressions introduced by this PR, but they directly affect how usable the Python API works. If possible, could you also consider addressing them when binding tensorinfo?

ragmani · 2025-10-17T04:29:19Z

One additional request: after the changes in this PR, could you please check that the existing python examples still run correctly? Currently our CI does not include python API validation.

arkq · 2025-10-22T07:48:33Z

At the moment, there is no standard mapping layer between onert.dtype and numpy.dtype. This means patterns like
np.array([1, 2, 3], dtype=onert.qint8)
will not work as expected. We likely need a bidirectional converter and clear usage rules so that dtype handling is both intuitive and reliable.

I've been thinking about extending numpy with OneRT quantized types, so then numpy will be able to convert between floats, integers and quantized types. I've never done that before though, so I will have to investigate the pros and cons of such approach. Anyway, I think that then the API will be clear and simple because it would be possible to use all numpy operations (hopefully) on quantized tensors.

ragmani · 2025-10-23T02:45:13Z

I also looked into this and found NumPy’s ongoing work on a new dtype system relevant here:

https://numpy.org/neps/roadmap.html#extensibility

NEP 41: outlines the shortcomings of the old monolithic dtype system and sets the direction for a modular, extensible design.
NEP 42: proposes the new DType class hierarchy and APIs so that user-defined dtypes (e.g. quantized types with scale/zero-point) can be registered cleanly.
NEP 43: extends universal functions (ufuncs) to integrate seamlessly with new dtypes, so user-defined dtypes can participate in NumPy operations.

All three are accepted or under discussion, but full implementation is still ongoing. Until this new dtype system is complete, introducing something like onert.qint8 directly into NumPy is not realistic.

Given this situation, I would suggest some practical options for now:

Provide a mapping/conversion layer between onert.dtype and numpy.dtype in the Python bindings to ensure usability.
If that turns out to be too complex or fragile, document it explicitly as a known issue and defer full support until NumPy’s extensible DType framework matures.

ragmani

LGTM

arkq · 2025-10-23T11:22:14Z

LGTM

Maybe we should wait with that PR (at least until the next week)? I will also look at the possibility to extend numpy data types, because I've seen somewhere (some time ago) that it is possible but maybe not very easy to do. I will investigate how complicated it would be right now (if possible at all). I will submit my investigation result here as well.

ragmani · 2025-10-23T11:46:12Z

That sounds like a good idea. I’ll look forward to your investigation.

hseok-oh · 2025-10-24T00:45:54Z

@arkq Please resolve conflicts

arkq force-pushed the python-onert-dtypes branch from 6f78932 to c36e868 Compare October 15, 2025 14:52

tomdol reviewed Oct 15, 2025

View reviewed changes

runtime/onert/api/python/include/nnfw_api_wrapper.h Outdated Show resolved Hide resolved

Protect internal fields to make dtype immutable

21b0b60

ragmani approved these changes Oct 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[onert] Export dedicated OneRT data types to Python #16210

[onert] Export dedicated OneRT data types to Python #16210

Uh oh!

arkq commented Oct 15, 2025

Uh oh!

Uh oh!

ragmani commented Oct 17, 2025

Uh oh!

ragmani commented Oct 17, 2025

Uh oh!

arkq commented Oct 22, 2025

Uh oh!

ragmani commented Oct 23, 2025

Uh oh!

ragmani left a comment

Uh oh!

arkq commented Oct 23, 2025 •

edited

Loading

Uh oh!

ragmani commented Oct 23, 2025

Uh oh!

hseok-oh commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[onert] Export dedicated OneRT data types to Python #16210

Are you sure you want to change the base?

[onert] Export dedicated OneRT data types to Python #16210

Uh oh!

Conversation

arkq commented Oct 15, 2025

Uh oh!

Uh oh!

ragmani commented Oct 17, 2025

Uh oh!

ragmani commented Oct 17, 2025

Uh oh!

arkq commented Oct 22, 2025

Uh oh!

ragmani commented Oct 23, 2025

Uh oh!

ragmani left a comment

Choose a reason for hiding this comment

Uh oh!

arkq commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ragmani commented Oct 23, 2025

Uh oh!

hseok-oh commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

arkq commented Oct 23, 2025 •

edited

Loading