Perform a quantized matrix multiplication of  a by the matrix b.
tf.raw_ops.QuantizedMatMul(
    a, b, min_a, max_a, min_b, max_b, Toutput=tf.dtypes.qint32, transpose_a=False,
    transpose_b=False, Tactivation=tf.dtypes.quint8, name=None
)
The inputs must be two-dimensional matrices and the inner dimension of
a (after being transposed if transpose_a is non-zero) must match the
outer dimension of b (after being transposed if transposed_b is
non-zero).
Args | |
|---|---|
a
 | 
A Tensor. Must be one of the following types: qint8, quint8, qint32, qint16, quint16.
Must be a two-dimensional tensor.
 | 
b
 | 
A Tensor. Must be one of the following types: qint8, quint8, qint32, qint16, quint16.
Must be a two-dimensional tensor.
 | 
min_a
 | 
A Tensor of type float32.
The float value that the lowest quantized a value represents.
 | 
max_a
 | 
A Tensor of type float32.
The float value that the highest quantized a value represents.
 | 
min_b
 | 
A Tensor of type float32.
The float value that the lowest quantized b value represents.
 | 
max_b
 | 
A Tensor of type float32.
The float value that the highest quantized b value represents.
 | 
Toutput
 | 
An optional tf.DType from: tf.qint8, tf.quint8, tf.qint32, tf.qint16, tf.quint16. Defaults to tf.qint32.
 | 
transpose_a
 | 
An optional bool. Defaults to False.
If true, a is transposed before multiplication.
 | 
transpose_b
 | 
An optional bool. Defaults to False.
If true, b is transposed before multiplication.
 | 
Tactivation
 | 
An optional tf.DType from: tf.qint8, tf.quint8, tf.qint32, tf.qint16, tf.quint16. Defaults to tf.quint8.
The type of output produced by activation function
following this operation.
 | 
name
 | 
A name for the operation (optional). | 
Returns | |
|---|---|
A tuple of Tensor objects (out, min_out, max_out).
 | 
|
out
 | 
A Tensor of type Toutput.
 | 
min_out
 | 
A Tensor of type float32.
 | 
max_out
 | 
A Tensor of type float32.
 |