Tensor Mapping APIs#

Warning

Experimental features:

  • May be replaced, updated, or removed at any time.

  • Do not require maintaining API/ABI stability of their own additions over time.

  • Do not require conformance testing of their own additions.

Motivation#

Used to target the CUDA entry points cuTensorMapEncodeIm2col and cuTensorMapEncodeTiled.

For some tensor core operations on sm_90+ Nvidia devices, a tensor descriptor must be built on the host and passed to the kernel. The interfaces mentioned above, and mapped to UR in this extension, provide the APIs necessary to create these tensor descriptor objects, that can then be passed to the kernels.

Changelog#

Revision

Changes

1.0

Initial Draft

Support#

This extension is only supported on the UR_PLATFORM_BACKEND_CUDA backend.

Contributors#