FM

Configuration options for the full matrices. [Edit on GitHub]

Keywords

Keyword descriptions

FORCE_BLOCK_SIZE: logical = F

Lone keyword: T

Usage: FORCE_BLOCK_SIZE

Ensure for small matrices that the layout is compatible with bigger ones, i.e. no subdivision is performed (can break LAPACK!!!). [Edit on GitHub]

NCOL_BLOCKS: integer = 64

Defines the number of columns per scalapack block in the creation of vlock cyclic dense matrices [Edit on GitHub]

NROW_BLOCKS: integer = 64

Defines the number of rows per scalapack block in the creation of block cyclic dense matrices [Edit on GitHub]

TYPE_OF_MATRIX_MULTIPLICATION: enum = COSMA

Valid values:

Allows to switch between scalapack pxgemm and COSMA pxgemm. COSMA reduces the communication costs but increases the memory demands. The performance of Scalapack’s pxgemm on GPU’s depends crucially on the BLOCK_SIZES. Make sure optimized kernels are available. [Edit on GitHub]