A subset of Basic Linear ALgebra (BLAS) functions to perform matrix-matrix multiplication. More...
Functions | |
mkldnn_status_t MKLDNN_API | mkldnn_sgemm (char transa, char transb, mkldnn_dim_t M, mkldnn_dim_t N, mkldnn_dim_t K, float alpha, const float *A, mkldnn_dim_t lda, const float *B, mkldnn_dim_t ldb, float beta, float *C, mkldnn_dim_t ldc) |
SGEMM performs a matrix-matrix multiplication operation defined as. More... | |
mkldnn_status_t MKLDNN_API | mkldnn_gemm_u8s8s32 (char transa, char transb, char offsetc, mkldnn_dim_t M, mkldnn_dim_t N, mkldnn_dim_t K, float alpha, const uint8_t *A, mkldnn_dim_t lda, uint8_t ao, const int8_t *B, mkldnn_dim_t ldb, int8_t bo, float beta, int32_t *C, mkldnn_dim_t ldc, const int32_t *co) |
mkldnn_gemm_u8s8s32() and mkldnn_gemm_s8s8s32() perform a matrix-matrix multiplication operation and add the result to a scalar-matrix product. More... | |
mkldnn_status_t MKLDNN_API | mkldnn_gemm_s8s8s32 (char transa, char transb, char offsetc, mkldnn_dim_t M, mkldnn_dim_t N, mkldnn_dim_t K, float alpha, const int8_t *A, mkldnn_dim_t lda, int8_t ao, const int8_t *B, mkldnn_dim_t ldb, int8_t bo, float beta, int32_t *C, mkldnn_dim_t ldc, const int32_t *co) |
mkldnn_gemm_u8s8s32() and mkldnn_gemm_s8s8s32() perform a matrix-matrix multiplication operation and add the result to a scalar-matrix product. More... | |
A subset of Basic Linear ALgebra (BLAS) functions to perform matrix-matrix multiplication.
mkldnn_status_t MKLDNN_API mkldnn_sgemm | ( | char | transa, |
char | transb, | ||
mkldnn_dim_t | M, | ||
mkldnn_dim_t | N, | ||
mkldnn_dim_t | K, | ||
float | alpha, | ||
const float * | A, | ||
mkldnn_dim_t | lda, | ||
const float * | B, | ||
mkldnn_dim_t | ldb, | ||
float | beta, | ||
float * | C, | ||
mkldnn_dim_t | ldc | ||
) |
SGEMM performs a matrix-matrix multiplication operation defined as.
C := alpha*op( A )*op( B ) + beta*C
where
The matrices are assumed to be stored in row-major order (the elements in a matrix rows are contiguous in memory).
mkldnn_status_t MKLDNN_API mkldnn_gemm_u8s8s32 | ( | char | transa, |
char | transb, | ||
char | offsetc, | ||
mkldnn_dim_t | M, | ||
mkldnn_dim_t | N, | ||
mkldnn_dim_t | K, | ||
float | alpha, | ||
const uint8_t * | A, | ||
mkldnn_dim_t | lda, | ||
uint8_t | ao, | ||
const int8_t * | B, | ||
mkldnn_dim_t | ldb, | ||
int8_t | bo, | ||
float | beta, | ||
int32_t * | C, | ||
mkldnn_dim_t | ldc, | ||
const int32_t * | co | ||
) |
mkldnn_gemm_u8s8s32() and mkldnn_gemm_s8s8s32() perform a matrix-matrix multiplication operation and add the result to a scalar-matrix product.
For the final result, a vector is added to each row or column of the output matrix.
The operation is defined as:
C := alpha*(op(A) - A_offset) * (op(B) - B_offset) + beta*C + C_offset
where
op( X ) = X
or op( X ) = X**T
,A_offset
is an m-by-k matrix with every element equal to the value ao
,B_offset
is an k-by-n matrix with every element equal to the value bo
,C_offset
is an m-by-n matrix defined by the co
array of size len
:offsetc = F
: len
must be at least 1
,offsetc = C
: len
must be at least max(1, m)
,offsetc = R
: len
must be at least max(1, n)
,alpha
and beta
are scalars, andA
, B
and C
are matrices, with op( A )
an m-by-k matrix, op( B )
a k-by-n matrix and C
an m-by-n matrix.The matrices are assumed to be stored in row-major order (the elements in a matrix rows are contiguous in memory).
mkldnn_status_t MKLDNN_API mkldnn_gemm_s8s8s32 | ( | char | transa, |
char | transb, | ||
char | offsetc, | ||
mkldnn_dim_t | M, | ||
mkldnn_dim_t | N, | ||
mkldnn_dim_t | K, | ||
float | alpha, | ||
const int8_t * | A, | ||
mkldnn_dim_t | lda, | ||
int8_t | ao, | ||
const int8_t * | B, | ||
mkldnn_dim_t | ldb, | ||
int8_t | bo, | ||
float | beta, | ||
int32_t * | C, | ||
mkldnn_dim_t | ldc, | ||
const int32_t * | co | ||
) |
mkldnn_gemm_u8s8s32() and mkldnn_gemm_s8s8s32() perform a matrix-matrix multiplication operation and add the result to a scalar-matrix product.
For the final result, a vector is added to each row or column of the output matrix.
For full description, see mkldnn_gemm_u8s8s32().