Intel MKL-DNN functionality supports a number of numerical data types. IEEE single precision floating point (fp32) is considered to be the golden standard in deep learning applications and is supported in all the library functions. The purpose of low precision data types support is to improve performance of compute intensive operations, such as convolutions, inner product, and recurrent neural network cells in comparison to fp32.
Data type | Desc |
---|---|
f32 | IEEE single precision floating point |
bf16 | non-IEEE 16-bit floating point |
f16 | IEEE half precision floating point |
s8/u8 | signed/unsigned 8-bit integer |
Intel MKL-DNN supports training and inference with the following data types:
Usage mode | CPU | GPU |
---|---|---|
Inference | f32, bf16, s8/u8 | f32, f16 |
Training | f32, bf16 | f32 |
See topics for the corresponding data types details:
Individual primitives may have additional limitations with respect to data type support based on the precision requirements. The list of data types supported by each primitive is included in the corresponding sections of the developer guide.
While all the platforms Intel MKL-DNN supports have hardware acceleration for fp32 arithmetics, that is not the case for other data types. Considering that performance is the main purpose of the low precision data types support, Intel MKL-DNN implements this functionality only for the platforms that have hardware acceleration for these data types. The table below summarizes the current support matrix:
Data type | CPU | GPU |
---|---|---|
f32 | any | any |
bf16 | Intel(R) DL Boost with bfloat16 | not supported |
f16 | not supported | any |
s8, u8 | Intel AVX512, Intel DL Boost | not supported |