BLAS Level 3 routines perform matrix-matrix operations. Table “BLAS Level 3 Routine Groups and Their Data Types” lists the BLAS Level 3 routine groups and the data types associated with them.
Routine Group |
Data Types |
Description |
---|---|---|
s, d, c, z |
Matrix-matrix product of general matrices |
|
c, z |
Matrix-matrix product of Hermitian matrices |
|
c, z |
Rank-k update of Hermitian matrices |
|
c, z |
Rank-2k update of Hermitian matrices |
|
s, d, c, z |
Matrix-matrix product of symmetric matrices |
|
s, d, c, z |
Rank-k update of symmetric matrices |
|
s, d, c, z |
Rank-2k update of symmetric matrices |
|
s, d, c, z |
Matrix-matrix product of triangular matrices |
|
s, d, c, z |
Linear matrix-matrix solution for triangular matrices |
Many applications spend considerable time for executing BLAS level 3 routines. This time can be scaled by the number of processors available on the system through using the symmetric multiprocessing(SMP) feature built into the Intel MKL Library. The performance enhancements based on the parallel use of the processors are available without any programming effort on your part.
To enhance performance, the library uses the following methods:
The operation of BLAS level 3 matrix-matrix functions permits to restructure the code in a way which increases the localization of data reference, enhances cache memory use, and reduces the dependency on the memory bus.
Once the code has been effectively blocked as described above, one of the matrices is distributed across the processors to be multiplied by the second matrix. Such distribution ensures effective cache management which reduces the dependency on the memory bus performance and brings good scaling results.
Copyright © 1994 - 2010, Intel Corporation. All rights reserved.